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By Marston Morse and Gustav À. HEDLUND. °°: 


1. Introduction. In a recent paper! we initiated a theory of symbolic 
dynamics. In this theory we consider unending sequences of symbols or 
symbolic trajectories and devote attention to those properties of symbolic 
trajectories which are suggested by dynamical considerations. A symbolic 
trajectory is formed from symbols taken from a finite set of generating symbols 
subject to certain rules of admissibility. In SD admissibility conditions were 
formulated of such generality that the resulting symbolic trajectories include 
in particular those which ‘arise in the geodesic problem on surfaces which 
satisfy the condition of uniform geodesic instability. 

However, no surface of the topological type of a torus satisfies the con- 
dition of uniform geodesic instability and the admissibility conditions of SD 
do not include those which, arise in the case of-the torus. | 

In the present paper we consider a class of symbolic trajectories formed 
from two generating symbols subject to admissibility conditions defined by a 
simple comparison property. These are the symbolic trajectories which char- 
acterize the geodesics on a flat torus. They may be used to characterize the 
distribution of the zeros of the solutions of a differential equation of the form 
y” + f(x)y — 0, where f(z) is a periodic function of s. We term the tra 
jectories of this class Sturmian. A first fundamental result is as follows: 


Sturmian trajectories possess certain numerical characteristics, namely, 
a frequency, a pole, and a type index, and admit mechanical constructions 
uniquely determined by these characteristics. 


There are three types of Sturmian trajectories,—irrational, skew and 
periodic. The trajectories of irrational type are recurrent but not periodic; 
those of skew type are not recurrent. The recurrency function of a recur- 
rent Sturmian trajectory is completely determined by the frequency «æ of the 
trajectory and may be denoted by R(n,a). We introduce the variable 
y==a(1+ a). Let Cy/Dy be the convergents in a continued fraction repre- 
sentation of y. We have the following fundamental theorem: 


*Received June 19, 1939. - . 

+ Cf. Morse and Hedlund. (References will be-found in the bibliography at the end 
of the paper.) This paper will hereafter be referred to as SD. Numerous references 
will be found in the bibliography at the end of SD. 
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1 a I 
When a is irrational, R(n, a) increases by unity when n increases, from 


fn —lton except when n—=Dy,v—=0,1,-- +. Fon these exceptional values 
of n we have the relation 
R(Dy, a) == Dr + 2D) — 1 | (v—0/1,2,: °°) 


starting with v = 1 in the special case Do = Di. | 


The preceding theorem thus gives a simple mode of evaluating ae a) 
when æ is irrational. Previous to this the only non-periodic recurrent tra- 
.jectory of which the recurrency function had been determined was the Morse 
recurrent trajectory (cf. SD §8). . i 4s 
The evaluation of R(n, a) permits various extensions of our|knowledge of 
recurrency functions. In particular we are able to solve one of the problems posed 
at the end of SD., We had shown in SD § 7 that if R(n) is the recurrency func- 
tion of a general non-periodie recurrent trajectory, dim inf R(n)/n = 2. The 
, h #00 ' 
"results of the present paper show that the constant 2 cannot be replaced by a 
greater constant. | 
The proper choice of a yields a recurrency function R(n, a) such that 


R(n, a) LEE, | 


with R(n,«) becoming infinite more slowly than for any other previously 
known non-periodic trajectory. A final result on the asymptotic behavior of 
© R(n, a) is as follows. Let $(«) be a positive monotonically increasing func- 
tion of + defined for æ > 0. As n becomes infinite the lim. sup. of 
| Bn, a) | | 
| ng (log n) | 
is finite or infinite for almost all values of. a according as the series 
= i 
À #0) 
converges or diverges. In particular the lim. sup. of 


R(n, a) 
n log n 


. is infinite for almost all values of n while the lim. sup. of | i 
E(n, a) ; i 
. n (logn)°* B. A 1) 
is zero for almost all values of «. 
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` The results of this paper and its predecessor will presently be given its 
appropriate dynamical setting (cf. G. D. Birkhoff) in terms of trajectories on 
a space form. 
I CLASSIFIOATION AND REPRESENTATION. 


2. The comparison condition and general theorems. We shall consider 
sequences X of two symbols a and b of the forms 


(2. 1) te Ber Te aoe, 

(22) S aBaB.ua ++, 

(a3) ++ aBaaBe, | 

(2.3) a aB,a : - + aB,a | (=) 


in which B, is a finite block of bs. We admit that B, may be the null set, 
We term B, the cell of Æ of indez n, and term X a cell-sequence. A cell- 
sequence X of the form (2.1), (2.2) or (2.3) will be respectively termed a 
cell-series, a cell-beam or a chain. A chain ‘which contains n cells will be 
termed an n-chain. If the chain (2.3) appears in X it will be termed the 
chain [r,s] of Z. | 

Two cell-series (cell-beams) X and Y will be regarded as identical if and 
only if the cells of X and Y have the same index range and if cells with the 
same index are identical. On the other hand, two n-chains of the form 


aByua Ses Bin, 
aB’ q@1&*** OB’ qusa, 
will be regarded as identical if and only if 
Bees hey: à (i, =1,2,-+-,n). 


| The number of symbols b in an n-chain z will be called the b-length of z. 
We shall be concerned with cell-sequences X which satisfy the following 
condition. 


C. Under Condition C the b-lengths of any two n-chains of X with ihe 
same n shall differ by at most one. 


We term this condition the comparison condition. Cell-sequences which 
satisfy the comparison condition will be called Siurmian.. As we shall see, 
Sturmian sequences FRE in the theory of linear second order differential 
equations. t , . 


THEOREM 2.1. The b- lengths bm and by of arbitrary m- and n- -chains 
of a Sturmian chain satisfy the relation ; 


(2.4) n(bm +1) > m(ba— 1). 
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We shall give an inductive proof of the theorem, first noting that (2.4) : 
holds when m—n—1. Let N be‘any positive integer at most the maximum ` 


x 


of m and n in the theorem. ‘We assume the truth of (2.4) for n and m both -> 


less than N, and shall prove that (2.4) holds when m and n.are at most N. 


Since (2.4) holds for m = n, there remain two cases to eqtuider: 


Case I. N—=m>n. We here set m==pn+q with OSq<cn An . | 
m-chain z may be regarded as a sequence of p successive n-chains followed by . - 
The b-lengths © 


a g-chain, two successive chains having a symbol a in common, 
of our p n-chains are each at least ba — 1 so that 


' (2.5) by = p(ba—1) +, 


where bg is the b-length of the final g-chain of s. We arẹ assuming that 


(2.4) holds when m and n ave both less than Ñ, so that 
(2.6) nba 1) > q(ba—1).# 


- Adding 1 to both members of (2.5) and multiplying by n we find that 


(2.7) ‘(bm + 1) 2 np (ba —1) + n (ba +1). 


Upon using (2.6), (2. Y takes the form (2. a and the proof is s completo 


in Case I. 


Case II. N=n >m. Bet n— pin tg with 0 qi Dante 


as in Case I we find that 
(2. 8) k ` ; b, = p(bm + 1) + bg, 


i 
i 


where ba is the b-length of an arbitrary n-chain y and bg is| the b-length of- 


the final q-chain of y. By virtue of our inductive hypothesis 


L 
i 


seit 1 from both members of (2.8), than eee m and using 


(2.9), relation (2.4) results again as in Case I. N | 
The proof of the theorem is complete. 


THEOREM 2.2. If ba is the b-length of an arbitrary n-dhoin of a Stur-. 


mian beam or series, then b,/n tends to a finite limit a as n becomes infinite. 


It follows from (2.4) that 





pee ee ee ~ + = E 
and the theorem follows Pa ` | | 


We term a the frequency of the Sturmidn beam or series. 


we set 8 — 1/a and term £ the i rotation number. 
É shall be œ by convention. ; 





1 
| 
g 
1 


When 4540 
When «= 0, 


ff 


CES 
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when Zi is a Sturmian chain we shall get < 


Ga e-mai], e-ma], 


| De ranging over all b-lengths of m-cbeins of X. 


THEOREM 2.3. (a) A necessary and sufficient condition that a cell- 


| | sequence T be Sturmian ts that there evist a.constant a = 0 such that the 
` b-lengths ba of n-chains of T satisfy one of the two following sets of conditions 


‘for each n: 
(2.11) | l . na—l<b Snl, 
(2.11)” Fa Mae he 


(b) If T is a Sturmian chain, conditions (2.11) are satisfied if and only if 
a is on the interval d S Sa”, the right and left equalities prevailing in: 


_ (2. 11) at most when a =a” and.« respectively. (c) If T is a Sturmian 
beam or series, (2: 11) ts. antiafied if and only if a is the frequency. 


The conditions (2. 13) or:the conditions (2. 11)” ate sufficient that T 
be Sturmian since there are at most two integral values of bn which satisfy 
(2:11) or (2.11)” respectively. for a en n, and these integral values differ 


_ by at most ‘one. 


To prove the conditions (2. 11) necessary we suppose T' Sturmian. 
: We a with mig case in yoi Ti is a chain. It follows from (2.4) that 
rie moomo n | 
50 Gmr a < a”. Moreover. «= 20 except in the trivial case in which bm == 0 
for each m. It is easily seen that (È 11) holds ior’ <a’. For in 
such a cage 


(2:18) na +1 >ne tin a TEES 
Similarly ` . - 

: 1 
(2.14) na—1<n—1£<n +=] —1-%,. 


For # < a< a”, (2.11) thus holds with the equalities, excluded. It is also 
clear from (2.13) and (2.14) that (2.11)’ holds when æ = « and (2. 11)” 
when, a = g” but that (2. LÉ does not.hold in general when «== a” nor 


- (2.11)” when a =g.. 


The preceding analysis indtadées a proof of (b), as well as a proof of the 
necessity of (2.11) when T is a finite chain. 
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l ' | 
We come to the case in which T is a Sturmian beam or; series with fre- 
| quency a. That bm ma + 1 follows at once from (2.12)] upon letting n 
become infinite. Similarly we see from (2.12) that bn = na — 1 upon letting 
m beéome infinite. The conditions (2.11) taken as a whole are accordingly 

` satisfied by T. But it'is impossible that bm = ma + 1 and ba = na — 1 for 


ihe same beam or cell-series T. For in such a case we find that 
| (bn +1) = n(ba—1), | 


contrary to (2.4). Thus conditions (2.11) are necessary in one of the two 

forms. ` 
That (2.11) holds at a when a is the frequency follows upon Had 

the respective members of (2.11) by n and letting n becomelinfinite. | 
The proof of the theorem is complete. 


3. The classification of Sturmian beams and series according to fre- 
quencies. Let R be a Sturmian beam with a rational frequency a. When 
. &@>0 we set a —g/p where q and p are relatively prime integers. When 
a==0 we understand that q == 0 and p—1. 





LEMMA 8.1. "In a Sturmian beam R with a satoa ener a/b, 
ias cannot exist two p-chains with b-lengths different from % with Been 
cell indices. 


The lemma is illustrated by the Sturmian series 
(8.1) ` + -aB aByaBya > + : 


` in which the cell-series obtained by omitting Boa is periodié with cell lengths’ 
elternatingly 2 and 3, starting with b(B,) —3. The b-length of Bo shall be 3. 
_ Here p— 2, q=. The 2-chains in:general have the b-length q=6. But 
the 2-chain aB,aB,a has the b-length 6. 
We come to the proof of the lemma. | 
The b-lengths bp of p-chains of R satisfy (2.11)’ or (2.11). We con- 
sider the case in which (2,11)’ is satisfied. Then bp must be g or g-+1. 
We suppose the lemma false. There then exist two, p chains zand y of | 
E with b-lengths g +1 and with different cell indices., Without loss of : 
generality we can suppose that x precedes y in R. Let w be the m-chain of R 
with æ as its initial p-chain and y its terminal p-chain. We distinguish two 
cases: CaseI. m = 2p; CaselIl. p< m < 2p. i 





Case I. Letz be the subchain of w whose cells are not cells of £ or y, 
and suppose that z is an r-chain. We understand that r mäy be zero. Let br 
and bm be the b-lengths of z and w respectively. We have m — 2p +- r and 
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bm = br +- 2q +2. Upon applying (2.11) to br and to bm respectively, we 
find that 


(8.2) Ti<d< A+, oy age 
(3. 8) BANAL 1 <b +4 2S Opt en. 


Upon subtracting 2q from each member of (3. 3) we see that (3.2) is satis- 
fied with br replaced by br -++ 2. Since this is impossible we infer that Case I 
18 impossible. 


Case II. Let z be the subchain of w whose cells occur in both x and y, 
and suppose that z is an r-chain. Let b- and bm be the b-lengths of z and w 
respectively. We have m<=2p—r and bm = 2q + 2—b,. Upon applying 
(2. 11)” to br and ba respectively, we obtain (3.2) and the relation 


(8.3) Cr Gp) 5 ti 


Upon formally adding me respective members of (3. 8) a (8. 3)’ we obtain 
the relation Mae 
—2 <2 5&2, 


from which we infer that the equality holds in (8.2). Hence r must be a 

multiple of p. But this is impossible ifp<r< 2p. Thus Case IT is equally 

impossible. 

The case where (2. B holds is similarly treated, and the a is 

complete. A 
A Sturmian beam or series will be said to have the cell-period p if its 

cells satisfy the relation 

(3. 4) Bi = Bi 

for each admissible +. 


_ THEOREM 3.1. À periodic Sturmian series T or beam R with rational 
frequency a = q/p, where? (q, p) == 1, has the minimum cell-period p. The. 
b-lengths bn of tts n-chains satisfy the condition ` 
(3.5) na—1< dp < na +1, 
assuming each integral value by which satisfies (3.56). 


The p-chains of T have the constant b-length q. Otherwise there would 
be infinitely many p-chains with different cell-indices and with b-lengths 
different from g contrary to Lenfma 3.1. Hence T has the cell-period p. 


3 The notation (q, p) =1 shall mean that q and p are relatively prime integers.: 
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‘Let s be an arbitrary cell-period of T and let r be the b-length of an ` 
gue of T. Then i 
r 
(r = = 7 | 
This is poini only if s is a multiple of R Hence p is the minimum cell- 
period of T. 
That (3.5) is satisfied will follow from (2.11) once ‘the anih signs 
are excluded from (2.11). But an equality can hold in (2.11) only if na ` 
is an integer or zero, and this implies that n is a multiple of p. When n—rp, 
bn = rq since a p-chain of T has the b-length g, and we conclude that ba = na. 
Hence the equality never prevails in-(2. 11) and (3.5) holds as stated. 
To see that 6, assumes each integral value which satisfies (3.5) we first 
. note that when n is a multiple of p, na is the only value of b, which satisfies 
(3.5). There remains the case where n is not a multiple of p. But if for 
the given n, ba had but one value, n would be a period of T, and hence a 
multiple of p, contrary to hypothesis.: Hence bn assumes each integral value 
which satisfies (3.5). | . 
A similar proof applies Lo beams. 


The proof of the theorem is complete. 
Sturmian series with irrational frequencies will be termed AA 





_ Tamonex 3.2. The b-lengths b» of the n-chain of an irrational Stur- 
mian series with frequency a satisfy the condition - ! 


(3. 6) na—1 < ba< nat, 





j 
assuming each integral value bn which satisfies (3.6). : f 
` t 

The numbers ba satisfy (2.11)’ or (2. 11)” as we have seen. But when 
a. is irrational the equality can never prevail in (2.11). Moreover, for each 
` n, by assumes the two values defined by (3.6). Otherwise F would have the - 
cell-period n, and hence a rational frequency. | 

The proof of the theorem is complete. i 

Sturmian series which have rational frequencies but which are not Er 
will be termed skew. The appropriateness of the term will appear later. An 
example of a akew Sturmian series has already been given in! (3.1): Another 
example T* will be given here. To define T* we first define a cell-series Z. 
The beam of Z whose first cell is Bam shall have the cell-period 5. The b-lengths 
of its cells shall have a period block 21211., The beam of Z whose final cell 
is Bm shall also have the cell-period 5. The b-lengths of ith cells shall have 


a period block 11212. The b-lengths of cells of Z thus form a sequence 
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- (11212) (11212) (21211) (21211) - 


To obtain T* from Z we replace Bm in Z by a cell of bleah: 1. The series 
T* is seen to be Sturmian. Its frequency is 7/5. _ 
To analyze skew Sturmian series we introduce several terms. „An n-chain 
of a Sturmian series T' whose b-length is the maximum or minimum among 
b-lengths of n-chains of T will be said to be of max or min type respectively. 
A cell B, of T will be said to be of maz or min type if the chain aB,a is of 
max or min type -respectively. 
It follows from Lemma 8. 1'that in any skew Sturmian series T vith 
frequency q/p there exists one and only one p-chain whose b-length is different 
from g. This chain will be called the critical chain of T. | 


THEOREM 3.3. Let T be a skew Sturmian series with critical p-chain C. 
The beam following (pregeding) the initial (final) cell of C has the cell- 
period p. The initial cell B of C ts identical with the final cell of C while 
the cells immediately preceding and Tong C are identical and opposite in 
type to B. 


Let X and F be respectively the beams preceding the final and following 
the initial cell of C. The beams X and Y have the cell-period p since each 
of their p-chains has the b-length q. 

Suppose for simplicity that C is of max type. Let B’ be the terminal 
cell of C. The beams | | 
(8.7) aBX, YBa 


are not periodic for their terminal p-chains C have b-length qg + 1. All the 
p-chains in the two beams (3.7%) will have the b-length q provided their 
terminal cells are reduced by a unit in b-length, following which reduction 
both beams have the cell-period p. The cells thereby replacing B and B’ have © 
copies in T and must be of minimum type. Hence B and B’ are of maximum 
type and identical. 

That the cell preceding (following) C in T is of type opposite to B 
follows from the fact that the beams (3.7) are not periodic. 

The case where C is of minimum type is similarly treated. 


THEOREM 8.4. The b-length ba of the n-chains of a skew Sturmian 
sertes T with frequency « satisfy one of the conditions (2.11). Condition 
(2.11) {(2.11)”} is satisfied tf the critical chain is of maz type {min type}. 
The integers ba assume all integral values satisfying (2.11)’ {(2.11)”}. 


That the numbers bn satisfp one of the conditions (2.11)’ or (2.11)” 


follows from Theorem 2.3. Let us suppose that the conditions (2.11)’ are 


satisfied. Since T is not periodic, 6, must assume two distinct values for each 


| 
| 
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positive integer n. Corresponding.to å given integer n there are only two ` 
integral values which satisfy .(2.11)’ and it follows that b, must assume all 
integral values which satisfy (2.11)’. If a — a), where (q, p) ==1, then 
bp=q org+1. The critical p-chain of T must have b-length g + 1 and 
hence is of max type. 

Analogous arguments hold if the’ inttgers bn satisfy ; (2.11)” and the 
proof of the theorem is complete. | à 


Conprrion A. À set of m-chains will be À fo satisfy Condition A if, 
n being any positive integer not exceeding m, the b-lengths of the sub n-chains 
of the given set of m-chains assume at most two values. | 


; : r, ee Cl ; 
Lemma 3.2. If œ set of m-chains satisfies Condition À, the number of 
chains in the set cannot exceed m + 1. | ; 





If X and F are m- and n-chains respectively, XY shall mean the (m + 1)- 
_ chain whose first m-chain and last-n-chain are X and Y respectively. 

The lemma is obviously true if m1. We assume the lemma true for 
- . integers not exceeding m—1 and prove that-the lemma holds for the integer 
m> 1. If the lemmia were not true"there would exist a set of m + 2 different 
m-chains satisfying Condition A. Let us denote such a set by 


Ci, C2, FPE Omis. a i 


; l ; 
By the hypothesis of the induction, this set contains at most two different 
1-chains B and B* and therefore the set can be written in the form 


(0) | Ci DB, , Cms = Dania Binsa, 
where By, i= 1,2, : -,m +2, is either B or B* and the chains of the set - 
(D) | Dy Dz, + +; Dias | 
are (m—-1)-chains. By the hypothesis of the induction, there can be at most 
m different (m—1)-chains in the set (D). Since the members of the set (C) 
are assumed to be all different, there cannot be three identical (m — 1)-chains 
in the set (D). It follows that there are at least two pairs in the set (D) such 


„that members of the same pair are identical. We can assume the notation so 


chosen that 
- Die Dr, Di D, 
Cı = D,B, Ca = D,B*, ' C; = DB, Ci = D,B*. 


Since the members of the set (C) are all different, it follows that D, and D, 
“must differ in some cell and can be written in the form 


D; ES EB, Ds = E;:B:F:, | 
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where E., F,, Es are chains, EEA one of the pair B., B3 is B and the other 
is B*. The chains 
BRB. BP,B*, B:F,B, B;F,B* 


are subchains of the ‘given set of m-chains. These four chains contain the 
- same number of cells and their b-kengths assume three different values, From 
. this contradiction we infer the truth of the lemma. 


LEMMA 3.3. Corresponding to a given constant « = 0 the set of Stur- 
mian chatris satisfying (2.11)’ {(2.11)”} contains at most n +1 different 
n-chains. 


For the set of Sturmian n-chains sétisfying (2. 11)’ {(2.11)”} is a set 
satisfying Condition A and it follows from Lemma 3.2 that there can be at 
most n + 1 different n-chgins in the get. 

Let T be a Sturmian series and r an integer, positive, negative or zero. 
The Sturmian series 7” which results upon adding r to the index of each cell 
of T will be said to be similar to T. -We write T” ~T. 


THEOREM 3.5. A periodic Sturmian series T {beam R} with frequency 
a = q/p, where (q, p) —1, contains n + 1 different n-chains if 0<n<p 
and p different n-chains if n= p. Two periodic Sturmian series with the 
same frequency are similar. 


The series T has the minimum cell-period p (cf. Theorem 8. It fol- 
lows that if n < p, T must contain at least n + 1 different n-chains, for 
otherwise (cf. SD § 7; the arguments given in SD concern blocks, but similar 
arguments apply to chains) T would have a cell-period less than p. The 
b-lengths of the n-chains of T satisfy one of the conditions (2.11) and we 
infer from Lemma 3.3 that T contains at most n + 1 different n-chains. Thus 
T contains n +1 different n-chains if 0 <n < p. Since the number of 
different n-chains of T is a non-decreasing function of n, T contains at least 
p different n-chains if n= p. The periodicity of T implies that T contains 
at most p different n-chains. 

Let T and 7” be periodic Sturmian series with the same a 
a= q/p, (q,p) —1. The b-lengths of the n-chains of T and T” satisfy 
(3.5) and hence (2.11)’. It follows from Lenima 3.3 that the totality of- 
(p—1)-chains in T and T” form a set containing at most p different (p—1)- 
chains. But it has been shown that each of the cell-series T and T” contains p ' 
different (p — 1)-chains and we, infer that T and T” contain the same (p—1)- 
chains. In particular, T and T? contain identical (p—1)-chains and since 
the b-length of any p-chain of T or T” is q, it follows that T and T” contain. 
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identical p-chains. Since T and T” are periodic with cell;period p, they are 
similar. - 
Thé proof of the theorem is complete. 


THEOREM 3.6. A skew Sturmtan series U contains n +1 diferent 
n-chains for every positive integer n. Tyo skew Sturmian series with the 
same frequency and with critical chains of the same b-length are similar. 


Let n be a positive integer. Since U is not periodic it must contain at 
least n + 1 different n-chains and it follows from Theorem 3.4 and Lemma 
3.3 that U contains exactly n + 1 different n-chains. 

Let U and V be skew Sturmian series with the same frequency a = q/p, 
where (q, p) = 1, and with critical p-chains of the same length. The critical 
p-chains of U and V are either both of b-length q + 1 or both of b-length 
g—1. In the former case U and V satisfy (2.11)*and in the latter (2.11). 
It follows from Lemma 8.3 that U and V contain the same n-chains and in 
particular the same critical p-chains. By virtue of the relation of a skew 
Sturmian trajectory to its critical chain, as given in Theorem 3.3, we infer 
that U and V are similar. 

Let X be a cell-series and let Y be the cell-series obtained from X by 
inverting the order of the indices of the cells of X. We term F the inverse 
of X. If X is a Sturmian series, the inverse of Y satisfies Condition C and 
hence is Sturmian. If the inverse Y of a Sturmian series|X is similar to X,. 
we term X symmetric. 3 





4 





COROLLARY. A skew Sinima series is symmetric. 


For if X is a skew Sturmian series, its inverse Y is aida skew Stur- 
mian with frequency equal to that of X and with critical chain of b-length 
equal to that of the critical chain of X. It follows from | Theorem 3.6 that 
Y is similar to X and hence Xi is symmetric. | 


THEOREM 3.7. Two irrational Sturmian trajectories with the same fre- 
quency comian the same n + 1 diferent n-chains for each; positive integer n. 


Let T and T” be irrational Sturmian trajectories with the same frequency 
a. Since neither T nor T” is periodic, each must contain at least n +1 
` different n-chains for each positive integer n. Since T and T” have the same 
frequency « the b-lengths of the n-chains of both T and T“ satisfy (3.6) and 
hence (2.11)’. We infer from Lemma 8.3 that the totality of n-chains in 
both F and T” cannot consist of more than p + 1 different n-chains. It fol- 
lows that T and T” contain the same n + 1 different n-chains. The proof of 
the theorem is complete. 
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4 Mechanical sequences. Let & be a positive real number and c an 
arbitrary real number. On the real axis — œ < z < -+-œ we introduce the 
set of points à f : 

(4. 0) + +.1,6—28,c—B,c,¢e+f,c+ 28,°-- (B = 1/a). 
We term c the pole of this set of points. | | 

Let T(c,a) {T'(c, «)} denote the cell-series of the form (2.1) in which 
the i-th cell B; contains as many b’s as there are points (4.0) in the interval 
iS<eciti {fic eSi+1}. The b-length of an n-chain of T(c, a) or 
T” (c, æ) is either 8, or 8, + 1, where 


(4.1) © B= 808 + Tn (0Z rn <B). 
` It follows that T'(c, «) and F’(c,a) are Sturmian series. Observe that 


Lis 5 Sn n | 
TES 
so that æ is the frequency of both T (c, a) ‘and T'(c, a). 

‘When « == 0 we understand that (4. 0) is a null set ofpoints. The corre- 
sponding cell-series contains 1 no 6’s and will be denoted by either T(c,0) or 
T’(c, 0). 

If c, = c, mod £ the TERT ais (4.0) are identical and T (c1, a) 
=T (ca, a), T'(c,a) == T’(c:, a). The set of points congruent to smod 8 
will be denoted by P(z). The domain of P(z) will be regarded as a. circle T. 
The function P(x) maps the z-axis onto the circle T. In this map the image 
on T of a neighborhood of a point c on the z-axis will be regarded as a neigh- . 
borhood of P(c) on F. The circle T will be taken in the sense which cor- 
responds locally to the sense of increasing c. The interval PQ on T, P £Q, 
shall mean the segment of T which begins with P and ends with Q, taking 
T in its positive sense, and including P but not Q. When P == Q, the interval 
PQ shall be the whole of T. We term T the B-ctrole. 


Lea 4.1. If P(r) sé P(n 41) and œ > 0, an m-chain Tr, n] of 
T(c,a) is of max or min type respectively according as P(c) is on the | 
interval P(r)P(n + 1) or the complementary interval P(n + 1)P(r) of the | 
B-circle. If P(r) =P(n +1) all m-chains have the same b-length. 


This follows at once from the conventions upon noting that the type 
of an m-chain [r,n] of T(c,a) decreases when P(c) leaves the interval’ 
P(r)P(n +1) and remains invariant as P(c) varies on this interval, or its 
complement. In particular, suppose s is an integer between r and n +1 
inclusive. Suppose P(r) =£P(n +1). Then P(s) #P(s-+1) whatever. 
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the integer s. As P(c) enters an ca beginning with PC. P(c) vary- . 
ing in the positive sense on T the cells By and B, change thoir : types to min 
. and max, respectively. 

We define the alternate interval PQ, PQ, of T as the! P of T 
which begins with P and ends with Q, taking T in its positive sense, and 
including Q but not P. When P=Q, the alternate interval PQ shall be 
the whole of T.. The proof of the following: -lemma is analogous to that of 


© Lemma 4.1. 


Leama 4.1. Read Lemma 4.1 with T'(c, a) Gua ne T'(c,«) and 
_with the term interval replaced by alternate interval. | 


l 
THEOREM 4.1. If a is irrational, two series T (c, a) and T(a, a) 
{T (e, a) and T'(a,a)} are identical if and only if c=a mod p. ‘ 


If a-is irrational, the points P(n); (n—d,2,:-°-), ‘are everywhere 
dense on the B-cirele T, If ca mod 8, P(e) A Pld). There accordingly 
exist integers r and n with r<n such that P(c) lies on the interval 
P(r)P(n +1) of T while P(a) lies on the complementary interval. It 
follows from Lemma 4.1 that the chains [r, n] of T(c, a) and T(a, a) are 
. different. If ac mod £, T (a, a) =T (c,«). The proof} of the theorem 
is complete for the Sturmian series T (c; «) and T (a, a). i 

The proof of the theorem for the series T” (c, a«a) and T” (a, a) is similar. 

The residue intervals. “Suppose a==q/p with q > 0, p>0, „and 
(q, p) = 1. Let m be an arbitrary integer. Then ae 

qm =sp+r, OSr<p, 
where s and r are integers. Hence 
Pai 
| M == 8 q + ma | 
It follows that the numbers r/q with r==0,1,: °°, p— 1, form a complete 
‘set of residues mod 8 of the rational integers, and the point set P(n) on the 
_ B-cirele T reduces to the set of p points P(r/g). The latter set is identical 
with the set of points | i ; 
(4.2) POLPO) a PI) | 


. since no two integers for which 0 S nS p— 1 are congruent mod £. 

The points (4,2) divide the 8-circle T into p successive intervals termed 
residue intervals if the initial but not the terminal point | jis included in an 
interval, and the alternate residus intervals if the terminai but not the initial 
point is included in an interval. 

The following theorem is an easy consequence of Lemrits 4,1 and 4. 1’. 


Å% 
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THEOREM 4.2. When a is positive and rational, ‘two cell-series T (c, a) 
{T’(c,a)} are identical if and only if the corresponding points P(c) lie on 
the same residue interval {alternate residue interval}. ` 


THEOREM 4. 3.. When a is irrational, T (a, a) = T'(c, a) if and only if. 
ac=c mod B and cs&m mod B where m is an integer. If a= c= m mod B, 
m an integer, the corresponding cells of TF (a, «) and T’(c, a) are equal except 
that Bm and Bn are of max and min type respectively in T(a,«) and of 
opposite types in T’(c, a). l 


If cm mod 8, where m is an integer, the number of points of the set 
(4.0) in the interval i& x <t<+ 1 is identical with the number in the 
interval i< e++1. Thus the cell B; of T(c,a) is identical with the 
cell B; of T’(c,a) and T (c, «) =T (c,a). If a= c mod £ it follows from 
Theorem 4,1 that T (a, a) =T (c, a) and hence T (a, a) = T’ (c, a). 

- Conversely, let us assume that T (a, «) = T’ (c,a). Since a is irrational 
the points P(n), n==1,2,: : -, are everywhere dense on the B-circle T and 
by arguments similar to those given in the proof of Theorem 4. 1, it is easily - 
shown that a==c mod 8. If c=2m mod £, the interval m—1< em 
contains one more point of the. set (4.0) than does the interval 
m— 1 v < m, namely the point m. It follows that the cell Bu_, is of min 
type in T(a, a) and of max type in 7’(c,a@). Similarly, the cell Bm is of 
max type in T (a, a) and of min type in 7’(c,a). Thus if T (a; a) == T (0, a) 
we-must have a= c >£ m mod 8 — 

- The second statement of the theorem follows readily. 

The following theorem is easily derived with the aid of Theorem 4. 2. 


THEOREM 4.4. When, a is positive and rational, the cell-series T (a, a) 
and T’(c,a) are identical if and only if the residue interval in which P (a) 
lies, coincides, except for end points, with the alternate residue interval in 
which P(c) lies. 


THEORÐM 4. 5. If a is irrational, T (c, a) a a) {T (6, a) ~T (a, a)} 
if and only if c— a + pp + q, where p and q are integers. 


If c= at+pB-+q it follows from Theorem 4.1 that T(c, a) 
=T(a+q,2). But the chain [r,s] of T(a,a) is identical with the chain 


fr+gs+ al of T(a+q, a), independently of the values of r and s, and 


hence T (a, a) ~T(a+q, a) =T(c, a). ' 

To prove the converse we assume that T (e, a) ~T (a, a). It follows 
that there exists an integer q such that the cell B, of T u a) is identical 
with the cell Big of T(a,a) for all integral values of +. The cell By of 
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T(a+q,) is identical with the cell B, of Tic, x), Thus T(c, a) 
` =T (a4 q,a) and we infer from Theorem 4.1 that c==a + gq, mod 8, or 
c =a + pB +q, where p and q are integers. 

A similar proof applies to the pair T” (c, a) and T’(a,%). The proof of 
the theorem is complete. | 


| THEOREM 4.6. If ais rational the chi T(c,%) and T'(c, a) are 
periodic and any two of these cell-sertes are similar, whatever the value of c. 





The periodicity of these cell-series is evident. They are all Sturmian 
series with the same rational frequency a. It follows from Theorem 8. 5 that . 
any two of these cell-series are similar. The proof of the theorem: is complete. 

If a is irrational and c>4m, mod B where m is an integer, the cell-series 
T(c,a) and T’(c,a) are identical. Thus the class of cell- series T(c, a) 

. corresponding to a given value of « includes most of the cell-series T” (c, a). 


1 


However, as stated in the following theorem, there are exceptions. 


THROREM 4.7. If a is irrational and c= m, mod B, where m is an 
integer, the cell-seties T’(c, a) is not similar to any cell-series T (a, a). 


Let us suppose that 7’(c,a) and T'(a,a) are similar. It follows that - 
there exists an integer q such that T(a+ q,a) =T (c,a). We infer from 
Theorem 4.3 that cé m, mod £, where m is an integer. From'this contra- 
diction we infer the truth of the theorem. | a | 

The cell-series S(m, a) and S’ (m,a). The preceding cell-series T'(c, a) 
and T’(c,«) include no skew Sturmian series. To obtain such cell-series we 
introduée new mechanical sequences as follows. Let c be a rational integer m 
and a positive and rational. In S(m,a) the number of b’s in By shall equal 
the number of points of the set (4.0) on the intervals | 





n<Tr£<n+i, m<r<m+il,. Es ni 
| 


according as n < m, n= m, or n>m. In S’(m,a) the number of bein B, 
shall equal the number of points of the set (4. 0) on the intervals | 


n<iSnti, mSeSm+l, n<eSn+t, 


according as n <m, n=m, orn>m. As a special convention we under- 
stand that S’(m,0) shall-consist of null cells except that Bm ‘shall be b. 
S(m, 0) will not be defined. 


24 

THEOREM 4.8. The cell-sequences & (m,a) and S’(m,a) are skew 
Sturmian with a critical chain of min and maz type respectively. The cell 
Bm is the initial cell of the critical chain of S(m, a) {S’(m,a)}. | 


I 
‘ 





Sf 
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In case a == 0, all the cells of S (m, «) are null save Bn and the theorem 
is obvious. We accordingly assume that a > 0. . 

To establish the theorem it is sufficient to show that S {S’} is Sturmian. 
_ To that end let-@ be the b-length of an n-chain of S {S’} whose initial cell is 
B,. Then 9 is the number of points of the set (4.0) on an interval Z(S) 
{I(8’)} of length n beginning at the point z == r. Various cases arise accord- 
ing to the nature of n according as J includes one, both, or neither of its end 
points. We represent in the form 


i n= SnB H tn Or, <B, 
and distinguish between the two following cases. 


Case I. r,£0. Here I contains at most one of the points (4.0) as an 
end point and O == Sn Or 8x +1. The comparison Condition C is accordingly 
satisfied. | 


Case II. r,—0. When rs4m mod £, no point of (4.0) ‘is at an 
end point of I and 8—s,.. When r==m mod £; there are points of (4.0) at 
both points of Z(S) {1(S’)}. We see that 6 = 5, or sr>— 1 in S and Ss, or 
Sn +1in 8’. The Condition C is accordingly satisfied. 

Thus 8 and S’ are Sturmian. They have the frequency a. They are not 
periodic by virtue of the definition of Bm. Moreover S and S’ are skew 
Sturmian with critical p-chain [m, m + p— 1]. For the b-lengths of this 
p-chain-in S(m, a) or S’(m, a), respectively, are the number of points of the 
‘set (4.0) on the intervals 


m<acm+op, mSeSm+qp, © (P=), 


and in either case are different from q, the length of every other p-chain. 
‘Hence Bm is the initial cell of the critical chain of S or 8”. 


5. On the representation of Sturmian series by mechanical sequences. 
The mechanical sequences are Sturmian. We show conversely that any Stur- 
mian series is identical with a properly chosen mechanical ‘sequence. 


THEOREM 5.1. A poriedic Sturmian series U with fraguonoy a a is identi- 
cal with T (c, a) for suitable choice of c. 


If a = 0 the only symbol appearing in p and T (e, H is a and the theorem 
is evident. 

We assume «==q/p>0, where (q, p) =1. According to Theorem 
4.6, T(c,a) is a periodic Sturmihn series. Since U and T(c,a) are periodic 
Sturmian series with the same frequency, we infer from Theorem 3.5 that U 

2 
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‘and T(c,a) are similar, The series U has the cell-period pai thus there 
are at most p different Sturmian series which are similar to Ọ and such that 
no two are identical. According to Theorem 4. 2, the cell-series T'(c,a) and 
T(a, a) are ‘identical only if the points P(c) and P(a) lie on the same resi- 
. due interval. Since there are p residue intervals corresponding to & — q/p 
` it follows that there’are p cell-series T'(c,.#), no two of which are identical. 
Since all of these cell-series are similar to U, we infer that one of them is 

identical with U. The proof of the theorem is complete. 


THEOREM 5.2, <A skew Sturmian series U with frequency ais identical 
with one of the cell-series S(n,a) or ae) for suitable choices of the 
integer n. l | 


In the case «== 0 the cell-series Ọ is “evidently identical. with F (m, 0). 
We assume a = g/p > 0, where (q, p) = Í,. Let U be aiskew Sturmian 
series with frequency @, with critical chain of min type and with Bm as the 

‘initial cell of its critical chain. Then U and 8 (m,a) are skew Sturmian 
series with the same frequency and with critical chains of the same b!lengths. 
It follows from Theorem 3.6 that U and S(m,«) are similar, According to 
Theorem 4. 8, Bm is the initial cell of the critical chain of S(m, a). Tt follows 

` from Theorem 8.8 that: S(m, a) and U are identical. | 
Similar arguments show that if the critical chain of U is of max! type, U 

-is identical with S’(m,%) for suitable choice of m. | 

LEMMA 5.1. An m-chain B whose n-chains satisfy the condition 


j l 


(5.1) ‘ na—1<bn< na +1 


. for some a = 0 has a copy in the cell-series T (c, a) for each ial of] 





a 
. 


The case a = 0 is trivial since b, = 0 for each n when a= 0. 

If æ is irrational the cell series T (c, a)‘ contains m +1 different m-chains 
(cf. Theorem 3.7) whose n-chains satisfy (5.1) (cf. Theorem 3.2). If the 
n-chains of a set of m-chains satisfy (5.'1), they satisfy (2. 11)” land are » 
Sturmian. It follows from Lemma 3. 3 that the number of different m-chains - 
in such a set cannot exceed m +1. Thus T (c, a) contains all : ‘à -chains whose 
n-chains satisfy (5.1) and in particular T(c, &) contains B. : 

-If a=-q/p0, where (q, p) 1, arguments similar to those of the 
irrational case ‘apply if m < p. It follows from (5.1) that'all p-chains of 
B have b-length q; B is periodic with cell-period p and is completely deter- 
-mined by its initial (p—1)-chain B* and the integer q. Since T'(c, q/p) 
contains all the p possible (p — 1)-chains, Whose n-chains satisfy (511) (cf. 
Theorem 3. 5 and Fe it contains B*. But T'(c, q/p) is periodic with cell- 


+ 
1 
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period p and each of its p-chains is of b-length g. It follows that T (e, ¢/p) 
contains B. : 
The proof of the lemma is complete. 


THEOREM 5.3. A Sturmian series U with irrational frequency a 1 
identical with T'(c,«) or T’(c, a), for at least one value of c. 


Let [r,s:a] denote the chain [r,s] of T(a,a) and let [r,s:a]’ denote 
the chain [r,s] of T’(a,a). If n is any positive integer it follows from 
Lemma 5. 1 that the chain [— n, n] of U is identical with a chain [— n -+ y, 
n+ Pn:c] of T(c,«) and hence that [—n,n] is identical with the chair 
[— n, n:an] of T(as, x) where an —c— px The points P(an) of T have 
a cluster point P(a) and we can assume the sequence ni, (i = 1,2,- +>), sc 
chosen that the points Gn,,@n,° °° vary in one sense on the a-axis and 
approach «=a as a lim point. With increasing 1, the points 
(5. 2) an, — mB, "ty Guy — Ê, Guy ani + B5° * +, an, + MB, 
approach the points 
(5.3) ` a—mp,: ` `,a—p, a, a +B, `+, a+ mp, 
respectively, either from the right or from the left. If a==k mod £ for nc 
integer k, no point of the set (5.3) is integral. For a given m and i suff- 
ciently large the chain [— m, m:a» ] is identical with the chain [— m, m:a] 
If i is also chosen so large that m S mx, the chain [— m, m] of U and the 
chain [— m, m: an,} are identical. It follows that the chain [— m, m] of U 
is identical with the chain [— m, m:a] of T (a, a) for every positive integer 
m and hence U is identical with T (a, a). 

If a= k mod, where k is an integer, and the points (5.2) approach 
the points (5.8) from the right, the chains [— m, m:an] and [— m, m:a] 
are again identical for fixed m and for sufficiently large î. Again U is identical 
with T(a, a). 

If ak, mod £, where k is an integer and the points (5.2) approach 
the points (5.3) from the left it is easily seen that the chain [— m, m: an,] 
and the chain [—m,m:a]’ of T’(a,a) are identical for fixed m and for i 
sufficiently large. In this case the chain [— m,m] of U is identical with the 
chain [—m,m:a]’ of T’(a,a) for every positive integer m. But this 
implies the identity of U and T” (a, a). 

The proof of the theorem is complete. 

6. The continuation of Sturmian series. A Sturmian n-chain which 


appears as a subchain of a Sturmian series T will be said to admit T as a 
Sturmian continuation. 
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THEOREM 6.1. ‘Each Sturmian chain z admits aleph continuations. 


By virtue of Theorem 2.3 there ‘exists an open interval of values of « 
such that the n-chains of x satisfy the relations (5.1). It follows from 
© Lemma 5.1 that for each such value of a there exists a constant c such that 
_ T(c, a) contains a copy of x. If aa’ the cell series T(c, a)i and Ti(a, a’) 
are not identical so that there are aleph Sturmian continuations of z. 

A Sturmian beam whose cells B, are respectively identical | with the cells 

B’; with the same index in a Sturmian series T will be said to} admit 7’ as a 
Sturmian ‘continuation. 


THEOREM 6. 2. Bath Barwa beam R with na frequency a 
admits at least one and at most two Sturmian continuations.. In the case 
where R admits different continuations these continuations; ‘are identical 
respectively with the cell- -series D a) and T’(m, a) PE a suitable choice of 
the integer m. 


If the Sturmian bo R with irrational tee admite. a Sturmian 
continuation, we infer from Theorem 5.3 that this continuation is identical 
with one of the cell series T (c, a) or 7’(c,a) for a suitable choice of c. The 
Sturmian beam R does not admit two distinct Sturmian continuations| of the 
form T(c,a) and T(a,a). For T'(c, a) and T (a, a) would then have a com- 
. mon beam and it would follow as in the proof of Theorem 4.1 that: 
-c= a mod £, and hence T'(c, a) = T (a, a). Similarly R admita at most one 
Sturmian continuation of the form T”(c, a): 
i Essentially as in the proof of Theorem 5. 3, 'so here it follows that & 

. possesses at least one continuation of the form T (c, «) or T” (c, a). If T (c, a) 
and T”(a, a) are different Sturmian ‘continuations of R, it would follow as in © 
the proof of Theorem 4.1 that a==cmod 8. But since T(c, a) =Æ Ti (a, a) 
we conclude. from Theorem 4. 3 that c= m mod £ where m, iis a suitably 
_ chosen’ integer. But then- T(c,a) and T'(a, a) are identical with T(m, à) 
and T”(m, a) respectively. The proof of the theorem is complete. 


THEOREM 6.3. A non-periodic Sturmian beam R urth. rational frequency 
a admits a unique Sturmian. continuation. i 


Without loss of generálity we can assume that Æ is of the form 
| OB QB ya “-. | 


i 


If a = q/p, he (q, P — 1, it follows from Lemma 3.1 that + there] exists 
cne and only one p-chain of R whose b-length is different from g. We term 
this chain the critical chain C of'R. Exactly as in the proof of Theorem 3. 3, 
so here it follows that the beam following the initial cell B’m of C and the 
chain preceding the terminal cell of C are periodic with cell-period p. 


! i 
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The b-length of the n-chains of R satisfy one of the conditions (2.11), 
` say (2.11)”. The b-lengths of the n-chains of S(m, a) also satisfy (2. 11)” 
and it follows as in the proof of Theorem 8. 6 that the critical pchains of E 
and S(m,a) are identical, That their cells have the same indices follows 
from our choice of m. In view of the relation of S (m, a) to its critical p-chain 
as disclosed in Theorem 3.3 and the relation of R to its critical p-chain, we 
can affirm that À appears as a beam of S(m, a) and of no other skew Sturmian 
series. 
In the case where R-satisfies (2. 11)’ ‘similar arguments apply and show | 
that S’(m, «) is the unique continuation of R. 
The proof of the theorem is complete. 


THEOREM 6.4. A periodic Sturmian beam R with rational frequency 
a == q/p > 0 admits three dissimilar Sturmian continuations of which one is 
periodic, and two are skew with critical chains of different type. If a = 0, 
E admits two Sturmian continuations of which one ts periodic and the other 
is skew. 


The proof of Theorem 3.5 shows that if two periodic Sturmian beams 
have the same frequency a == q/p where (g, p) ==1, they have the same cell- 
period p and contain the same p-chains. If «> 0, each of the cell-series 
T(c,a), S(m,a) and S’(m, a) contains a periodic beam similar to R. If c 
and m are suitably chosen, each of these cell-series will be a continuation of R. 
The cell-series T'(c,a), S(m,a) and S’(m, a) are dissimilar and it follows 
from Theorems 3. 5 and 3.6 that any other Sturmian series with frequency x 
is similar to one of these. i 

If a = 0 it is easily seen that T (c, 0) and S’(m, 0), where m is a suitably 

` chosen integer, are the only dissimilar Sturmian continuations of R. 

The proof of the theorem is complete. _ 

We distinguish between T(c,a) and T’(c, «) by assigning a type- -indez 
+1 or — 1 respectively to these series. We can similarly assign a type-tndex 
1 or — 1 to the series S(m, a) and S’(m, «) respectively. Thus-every Stur- 
mian series T possesses a frequency, at least one pole and a type-index. As we 
have seen, T admits a mechanical continuation i as determined by these 
numerical characteristics. 

In Part TI a class of similar Sturmian series will be called a iinet 
trajectory. We note that the members of such a elass admit the same numerical 
characteristics. 

II. Tae REOURRENOY FUNOTION. 


7. Sturmian trajectories and rays. We return to the concept of tra- 
Jectories and I-trajectories of SD, using the preceding symbols a and 6 as 


rk 
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generating aaa Recall that an I- “trajectory A is an indexed eee of 
the form i í 
(7. 1) EET 96 CoCiCa * 


in’which the symbol c, is a or b. The class of I- recois " s similar 2 to A 
is a trajectory Q represented by. x 
Let z be an m-block of Q. The numbêr of symbols a or b in x will be 
termed the a-length or b-length respectively of z and written a(z) jor b(z). 
Corresponding to the comparison condition of $ 2 we here introduce the fol-. 
lowing condition. 


, S. Under Condition S the a-lengths (b-lengths) of two m-blocks gk 
‘the same m shall differ by at most one. ` 


A trajectory whose'blocks satisty Condition S will be termed a Sturmian 
‘trajectory. e 

Prior to the present section we have been considering Sturmian series. 
Such series are sequencés in which. the cells are indexed rather than the sym- 
bols a and b. They gre accordingly logically distinct from indexed trajectories. 
In the latter the individual symbols a and b are indexed. Each Sturmian 
series T however defines a trajectory Q consisting of the symbols g and b 
appearing in T ordered as in T. We shall say that Q is represented by T. 

In any Sturmian trajectory at least one of the symbols a or b | appears 
infinitely many times preceding and following any given symbol. If lin par- 
ticular the symbol a does not so appear, the trajectory T must have one of the 
two following special forms: ! 
(7.2) | + -bbbbb-;:, 


(7.3) | -> bbabb- >- 


1 





These trajectories will be called b-trajectories. 
The trajéctories defined by Sturmian series always include infinitel many 
as and so never include the b-trajectories. More ‘precisely, ' we haye the 


following theorem.. i 


THEOREM 7. 1. The conne defined by Siurmian sertes satisfy Con- 
dition S and include all such trajectories except the b- -trajectories. 


-Let T be a Sturmian series and let Q be the trajectory defined by T} Let 
z and y be arbitrary m-blocks of Q. Let u be the chain of maximum a-length 
in z, and v a chain of minimum a-length in F containing y. ‘We seé that 
a(v) = a(y) +2, and that a(u) —=a(z). If it were true that: 


(7.4) a(y) +2Sa(z), ° 
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it would fotlow that a(v) = a(u) and there would be a subchain of u with 
the a-length of v. We could then infer from Condition C that 


(7.5) b(v) Sbu) +1. 
From (7.4) and (7.5) we find that 
a(y) + (09 Z a(z) + bu) —1. 
But this is impossible since 
m£ a(y) +b(v), a(z) +b(u) Sm. 


Relation (7.4) is accordingly false. We infer | a(x) —a(y)| <1. Upon recall- 
ing that + and y have the same length m we conclude that | b(z) —b(y)| 1. 
The trajectory Q thus satisfies S. 

Conversely, let Q be an arbitrary Sturmian trajectory which is not a 
b-trajectory. It is clear ‘that there are no unending sequences of b’s in Q. 
The symbols b of Q can therefore be grouped into maximal blocks of symbols 
D each preceded and followed by a symbol a. There accordingly exists a cell- 
sequence 7’ whose symbols a and b appear in T in their orter in Q. It remains 
to prove that the chains of T satisfy Condition C. 

Let x and y be two s-chains of T. If b(y) < b(x) — 1, the subblock of 
x obtained by dropping the two terminal a’s of x would contain a subblock z 
of the length of y. Then a(y) —a(z) = 2, contrary to the fact that Q satis- 
fies Condition S. Hence b(y) = b(xz) —1. Similarly b(x) = b(y) — 1. 
Thus T satisfies Condition C. 

We have seen that a Sturmian trajectory Q which is not a b-trajectory is 
representable by a Sturmian series T. The n-chains of T will be termed 
n-chains of Q. It is clear that the class of n-chains of Q is independent of 
the choice of the Sturmian series 7 representing 2. 

A non-special Sturmian trajectory will be said to have the frequency « 
of any Sturmian series T representing Q. It is clear that « is independent 
of the choice of Sturmian series T representing Q. A special Sturmian tra- 
jectory will be said to have the frequency a = co. 

A Sturmian trajectory Q defined by an irrational or skew Sturmian series, 
respectively, will be termed irrational or skew. The special trajectory (7.3) 
will also be termed skew Sturmian. Sturmian trajectories defined by periodic 
Sturmian series are periodic in the sense of SD. They include all periodic 
Sturmian trajectories except (7.2). 

A trajectory Q is recurrent if corresponding to any positive integer n 
there exists an integer m such that each m-block of 2 contains a copy of every 
n-block of Q. The least such value of m is called the n-th recurrency index 


` 


, l 
è | 
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Æ(n).of Q and the Foci R(n) is T the recurrency funeiidn of Q (cf. 
SD, p.-827). It is clear that a skew Sturmian trajectory is not recurrent. 
We shall show that all other Sturmian trajectories are recurrent. 

The Sturmian trajectory >+ bbb- +- has the recutrency function 
R(n)-=n. Any other periodic Sturmian trajectory has a finite rational - 
frequency « and is represented by 7'(0,a). It is clear that Q isi recurrent. 
The rectxrency function of 2 does merely on a and will be denoted by 
R(n, a). 

To show that irrational Sturmian trajectories are recurrent we is need 


the following lemma. l 


Lemma 7.1. If Q is a Sturmian suai with frequency aland OF is 
a limit trajectory of Q, then Q ig a Sturmian trajectory with frequency a. 
Since 0’ is a limit trajectory of Q, every block of O appears in @. Hence | 
0’ satisfies Condition S and is Sturmian. Every #-chain of 0’ appears in Q, 
and since the definition of the frequency « leaves the choice of n-chains arbi- 
trary subject to the condition that n become infinite, it appears that Q and ©” 
have the same frequency. The proof of the lemma is complete. 
Now consider a Sturmian trajectory Q with irrational frequency a. Any 
limit trajectory 9’ of Q is Sturmian and has the frequency a. ‘It follows from 
Theorem 3..7 that Q and Q contain the same chains and hence the same blocks. 
The permutation number P(n) of Q (cf. SD, 86) is accordingly identical 
with that of Q’, so that © is a minimal trajectory. A minimal trajectory is 
recurrent as stated in Theorem 7.2 of SD. Finally, the recurrency function ` 
of Q depends only on a. For the chains and blocks of Q are exactly those 
of T(0,«) so that the recurrency function of Q is that of +. + We thus 
have the following theorem. , | | | 
THEOREM 7.2: dey Sturmian trajectory with irrational frequency is 18 
recurrent with a recurrenc y function R(n,a) uniquely determined by a. 


8. The derivation of Sturmian trajectories. Let Q be a re 
trajectory represented by a cell-sequence 


representation 


| 
(8.1) - aB_,aB,aB,a- | 
Corresponding to @ we introduce a new trajectory 0’ with an i 
i (8. 2) A + C-2C_100C1C2 * one x | i | 


defined as follows. Let ċ;==« if B; is of minimum type, and let c; = b if 
B; is of maximum type. If all cells B; are of the same type let c; =a fot all i. 


| ‘The trajectory © will be said to be dertved from © and the I- representation 


| 
| 
E 
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_ (8.2) of Q” will be said to correspond to the representation (8.1) of Q. We 
proceed with a proof of the following theorem. 


THEOREM 8.1. Let 0’ be a trajectory derived from a recurrent Sturmian 
trajectory Q with a frequency a. The trajectory Y is Sturmian and has a 
frequency | | 
(8.3) Of = 


where w == a — [a]. 


w 


1— ov 





Let k be the number of b’s in a cell of Q of minimum type. Suppose Q 
represented by (8.1). Let x be an arbitrary n-chain of (8.1) and let y be 
the corresponding n-block of (8.2). Let b, denote the b-length of æ and let 

- Na and my denote the a-length and b-length respectively of y. Each symbol a 
in (8.2) corresponds to a cell of (8.1) of b-length k, and each symbol b of 
(8.2) corresponds to a cell of (8.1) of b-length k + 1. -Hence 


ba — Ena + (k + 1) ns. 
But na + no = n so that bn may.be given the forms 
(8. 4) bn = (k + 1)n — ns = kn + m. 


Since (8.1) is a Sturmian series, b, varies by at most one for different 
n-chains x of (8.1). Hence the values of na {rw} differ by at most one for 
different n-blocks y of (8.2). Thus 0’ is Sturmian. K 

It remains to evaluate the frequency @ of Q. First observe that the 
symbol a occurs infinitely many times preceding and following each symbol of 
(8.2) since Q is recurrent. Suppose the block y of (8.2) is an m-chain. 
Then m = ne — 1 and 





d = lim 2 = lim — = lim © 
moo M no Ta — 1 no Na 
Upon making use of (8.4) we see that 
by — kn a— k 


sa n e ee ee 


If a is not an integer it follows from (2.11) that k is the least integer 


such that 
f ' a—l<k<a+1. 


Hence k =— [a]. If æ is an integer each cell of Q has the b-length a and 
k = a = [«]. Hence (8.3) holds as stated. 


3 [a] is the maximum integer not exceeding a. 
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COROLLARY. If Q is periodic or irrational, then ©’ is respectively periodic 


- or irrational. 


For is recurrent and «’ is rational or ivrational according as a is 
rational or irrational. 

Let Q be a recurrent Sturmian trajectory with frequency a andilet a be 
the trajectory derived from ©. Since Qe+is recurrent it has the following 
property of chain recurrence. Given any positive integer n there exists an 
integer m such that every m-chain of Q contains a copy of every #-chain of ©. 
The least such integer m will be denoted by p(n,a) and termed the chain 
recurrency function of Q. If o is the frequency of © it is clear that 


7 =E (», is) : 








where o = a — [«]. In particular, if OS «< 1, then [«] —0 and o = o 

so that | | 

(8.5) l (ma) =R (nz), (0=a<1). 
Upon setting | ; o! 
ksi < ‘ 

8 ta? (OSaK 1) 
we find that 
è 1 
= ——___— < + 

am | osek 20 ) 


so that (8.5) takes the form 
i ` § i 
(8.6) RnB ons), o Oska). 


The recurrency function R(n,8) of recurrent Sturmian trajectories will 
accordingly be known once we have determined the chain recurreney: function 
p(n, a) for 0a < 1. We proceed with a study of chain recurrency functions. 
9. The determination of p(n,a) in terms of the functions E(e, a) 
and I(n,a).. The function p(n, «) is the chain recurrency function of T (e, a) 
and is independent of c. We shall suppose that a is irrational inasmuch as 
the recurrency function of a trajectory with period w is o + n— 1 for n = w. 
‘We introduce the points 


(90) P(e +4), P(o+2),-- P(e +n), (n 21) 


on the B-circle T. These points are all distinct since æ is nat They 
` determine n'non-overlapping intervals PQ on T, where in the spécial case 
n == l, P.== Q and PQ is the whole of T. (For the conventions concerning 
tervals PQ see 84.) Let this set of intervals be denoted by I(c,n, a). 
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Since the set (9.0) can be obtained from a similar set with c replaced by č 
by a rotation of I into itself, the lengths of the shortest and longest intervals 
of I(c,n,«) are independent of c and will be denoted by L(n, a) and L(n, a) 
respectively. When n is infinite in (9.0) the points (9.0) are everywhere 
dense on T and it follows that | 

lim 1(n, a) «lim L(n, a) = 0. 

nw n>0o 

Let ¢ be a constant such that 0 < e5 8. Let E(e, a) be the least integer 
m such that the maximum of the lengths of intervals of I (c, m, a) is at most e. 
It is clear that H(e,«) is independent of c in (9.0). We term Els a) the 
ergodic function belonging to a. 

Recall that [r,s:c] denotes the chain [r,s] of T (c, a). 

LEMMA 9.1. A set of n-chains 

[1, 7: ai], (t= 1,2,---,%), 
contains all n-chains of T (c, a) if and only if there is a point of the set P (a), 
(i= 1,: ; :, k), in each of the intervals of the set 1(0,n + 1,2). 

An arbitrary n-chain [r,s] of T(c, &) is identical with the chain [1,7] 
of T(c—r,«). Hence the n-chains of T(c, a) are found among the chains 
[1, n] of T(a, «) for suitable choices of a. Observe that two n-chains [1, n] 
of Ta, a) and T(a@’,a) will be identical if corresponding subchains have the 
same type. Lemma 4.1 gives the conditions under which two such subchains 
are of the same type, stating these conditions in terms of the intervals of T 
defined by the points 

P(1),---,P(n+1). 


Lemma 9.1 follows from Lemma 4. 1. 


THEOREM 9.1. The chain recurrency function p(n, a) of a recurrent 
Sturmian trajectory has the value 


(9.1) p(n, a) = E[l(n + 1,4), a] + 2—1. 

We shall begin by proving the following: 

(a) If m is an integer which equals the right member of (9.1), then 
any m-chain x of T(c,a) contains every n-chain of T(c, a). 


The m-chain 2 is identical with an m-chain [m:a] for a suitable 
choice of a. Since m =n, the chain [1, m:a] contains the n-chains 


[1, n:a], [2n + 1:0], o, [m— n+ 1, m:a]. 
These chains are respectively identical with the chains 
(9.2) [L n:a], [1,n:a—1],---, [Ln:a—m +n]. 
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According to Lemma 9.1, the set (9.2) contains all n-chains| of T(c, a) 
provided there is a point of the set 
(9.3) u P(a), P(a—1),---,P(a—m+n) 
in each of the intervals of [(0,n-+-1, a). | 
By SPA NS in (a) : 


| m8 Lt E a) a] 
and it follows from the definition of Æ (e, «) that the maximum length of the 
non-overlapping intervals on T defined by the points (9.3) ig at most 
_U(n el 1,@). But the length of the shortest of the intervals Z(0,n + 1, a) 
is L(n + 1,a) and we conclude that there is a point of the set (9. 3) in each 
of the intervals of I(0,n + 1,a). The set (9.2) and hence [1, m:a] and z 
contain each n-chain of T (c, a). The proof of (a) is complete. : . 

-It follows from (a) that when m Site the right member of (9.1) _ 
p(n, a) Sm. Hence . 


(9.4) | p(n, a) S Eln +1 a), a] +n—1. 











We shall now suppose that m is an integer such that 
(9. 5) 0<m—n+1<F[l(n+1,2), 2] 


and show that there exists an m-chain of T'(c,«) which does not contain 
every n-chain of T (c, a). 
When (9.5) holds there is an interval A(a) of r of length I(n —- 1, a) 
containing none of the points (9.3). By choosing. a properly, the interval 
A(a) can be brought into coincidence with any. given, interval of length 
I(n-+-1,a) of r. There is an interval A* of length Un +1, a) in the set 
I(0,n + 1, a) and we can assume a 80 chosen that the interval A(a) coincides 
with A*. But then the set (9.2) of n-chains does not contain all jn-chains 
of T'(c,a) so that the m-chain [1, m: a] does not contain all n- ie of 
T(c,a). But the chain [1, m: a] is identical with an m-chain of iT (c, a), 
for the cell-series T(c,a) and T(a,«) contain the same set of m-chains. 
Thus, if (9.5) holds, there is an m-chain of T (c, a) which does not contain 
“all n-chains of T (c, a). We conclude that: 


(9.6) p(n, à) Z Ell(n +1, a), a] +n—1. 
The theorem follows from (9.4) and (9.6). 
19. The evaluation of the recurrency function of Sturmian tra- 
jectories. According to (8.6) the recurrency function R(n, a) of a recurrent 


Sturmian trajectory with frequency a > 0 is the chain recurrency function 
of a Sturmian trajectory with frequency a(1-+a)*—y. As given by (9.1), 


I 
i 
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the chain recurrency function p(n, y) is completely determined by the fanc- 
tion E[l(n +1,y),y]. We shall show that the latter function bears a simple 
relation to the denominators Dy of the successive convergents of the continued 
fraction representing y. | 
According to its definition (n,a), n = 1, æ irrational, is the length of 
the shortest of the intervals of T détermined by the points 
(10. 0) P(c-+1),P(c+2),---,P(e+n). 
Thus (nr +1, a), where n = 1, is the length of an interval P(c +-+)P(¢ + 5) 
of T where +547 and + and 7 lie between 1 and n-+1 inclusive. But the: 
length of such an interval is |s—r8 | 40, where s=|t—j| and r is a 
properly chosen integer. Thus 


l(n+1,a) —|s—r | 
where s is an integer such that 0 <s<n. Conversely, if s is an integer such 
that 0< ssn and r is any integer, then either the length of the interval 
P(c+1)P(c+s+1) or that of its ie aaa on T does not exceed 
|s—rB | and hence 
i l(n +1, wRr 
Hence we have the following lemma. 


Lemara 10.0. The function l(n + 1,a) is the least positive value of 


[s—r8 |=} ser] (a > 0) 


as r ranges over all integral values and s assumes the values 1,2,° ` >, n 


. We are concerned with the beliavior of R(n,a) for large values of n. 
If æ == q/p where (p,q) —1, the Sturmian trajectory has the period p-+-q 
and i . . 
E(na)—p+q+n—l1, n2p+g. 
We turn to the case in which a is irrational. Let 
a= [bo dsb] | 


be the development of æ as a continued fraction (cf. Perron, p. 39). The 
integers b; are uniquely determined by « and with the. possible exception of bo 
are positive. The successive convergents Ay/By, v= 0, of « are determined 
recursively by the formulas 
(10. 1) 1 A..==1, Ag=bo, Av byAv: + Av, (v1), 

" B. 0, Bol, Br—brBri+ Bro ` (v1). 
As is well known, the integers ‘Ay and By are relatively prime and 
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(10.2) | 1=Bo<B <Bi<'::. 
Set 
M, —By— Arp. 


| 
Leara 10.1. Corresponding to a given irrational a, ne + Tia) ts con- 
stant on each interval of the form Bra Se n< By, (v> De and there has the 


value | My | . 


If n is an integer satisfying the conan of the lemma and s is an 
integer such that 0<sin, it follows that 0<s< By. Recall that 
(Av, By) = 1, 80 that r/s ~Av/By no matter what the integral choice of r. 


It follows from a theorem of Lagrange (cf. Perron, p. 52) that 

(10. 3) Í | sa —r | EL] Braa — Ava | = | Ma |. | 

It follows from Lemma 10.0 that ° 
i(n + Í; a) =| Mya | . 


The function 1(n + 1, a) decreases monotonically with n so that for in = By.s, 


I(n+1,a) S1(Bv.+1,¢). 


By virtue .of Lemma 10. 0, J(Bv..+1,a) does not exceed the 
|s—rB| when s = By, and r— Ay. so that ; 





(n +1, a) S| Bra— Araf | = | Ma |. 


The proof of the lemma is complete. 


value of 


Recall that L(n, «) is the maximum length of the nr inter- 
vals of T determined by the points do, 0), and that it is ea of ¢ 


in (10.0). 

Lemma 10.2. For a irrational and vy > 0 
(10. 5) © L(Bva + By, @) S | Mra l, . 
(10. 5)’ (Bra + By — La) > [Mil, — 


except when v = 1 and: Bo = By. 


The left member of (10.5) is the maximum length of the non-overlap- 


~ ping intervals of T determined by the points 
(10.6) - P(1), P(2),- +>, P(Bva + By). 


To prove (10.5) it is sufficient to show that each point of the set (10.6) is 
followed on T by a point of (10.6) at a distance not Gros | My =l . We 


distinguish two cases according to the sign of My... 
Case I. My. > 00 ‘It follows from Perron, p. 42, that 


0 < Mvi = È (Bria — Avs) < . E: KAE E 


aN 
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On the axis of reals the point By, + t follows the point à at a distance Br. 
But By. == My. mod 8 and My lies between 0 and 8 so that P(Bv..+ 1+) 
follows P(t) on T at a distance equal to Mfy1. Since Mv, > 0, it follows from 
Perron, p. 42, that M, lies between 0 and — £ so that the point P (i) follows 
P(B» + i) on T at a distance equal to | My | < My.. Thus each point of the 
set (10.6) is followed on T by a point of (10.6) at a distance not exceeding 
Ay... The proof of (10.5) is complete in Case I. 


Case TI. y: <0. Arguments similar to those given in Case I show 
that each point of the set (10.6) is preceded (and hence followed) by a point 
of (10.6) at a distance not exceeding | Mv. |. 

The proof of (10.5) is complete. 

We turn to the proof of (10.5)’. It follows from Lemma 10. 1 that 

(By, CA) = | My | == (Brv + 1, a) 

so that there is no point of the set 
(10. 7) P(1),P(2),- --,P(B;—1) 
in the interior of an interval I of length 2 | Mtv. | with P(B») as midpoint. 
Both end points of J cannot belong to the set (10.7). For if this were the 
case and these end points were P(i) and P(j) where 1Si<jSBv—l, 
then 

B,—i=)—B, mod £. 
But this is impossible if @ and hence 8 is irrational. Thus it is clear that 
there is an interval I” of T of length exceeding | Mv. |, with P(B») as one 
of its end points, and with no point of the set (10.7) or P(B») in its interior. 

Consider the set 


(10. 8) P(B»), PURE Tes er Peat eH: 
There are By. points in this set and if y = 1, or if ==? and Bo = B,—1, 
the set consists of the single point P(B,) which is not in the interior of J*. 
In any other case it follows from Lemma 10.1 that the shortest distance on T 
between points of the set (10.8) is | My | > | Mv,]. But then a suitably 
chosen subinterval Z** of I* contains no points of the set (10.8) in its 
interior and has a length exceeding | Wv-ı |. There are no points of the sets 
(10. 7). or (10.8) in J**. This implies (10. 6)’. 

The proof of the lemma is complete. 

Leama 10.38. Ifv>0 and Br: = n < By, 
(10.9) E{l(n +1,a),a}= By. + By. 

For if By, n < By, it follows from Lemma 10.1 that 
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In + 1,2) = | Mvi |. 


According to (10. 5)’ there exists an interval of F oF length exceeding | 
which contains none of the points 





(10.10) . P(1), P(2),- © <, P(Bya+ By—1) 
and hence ` 
(10. 11) B{l(n + 1,4), a} > Brı + By — 1. 


It follows from (10.5) that every interval of T of length aha 
tains a point of the set (10.6) and thus. 


` (10.12) E{l(n + 1, a), a} S By. + By. 
The equality (10.9) is implied by (10.11) and (10.12). :- 


Recall that yma (1+a)t. Let y= [do, di, de,- * +] be ithe con- .: 


M. | 


| con- 


tinued fraction representation of y and let Cv/Dy be the corresponding v-th 





convergent. 

THEOREM 10.1. The recurrency function R(n,a) increases by unity 
when n increases from n—1 to n except when n is a denominator D, of 
y—=a(1+ a)". For these exceptional values of n, 

(10. 13) R(Dy, a) = Dr + 2Dy—1 (v= 0) 


starting with D, when Do== Di. R(n,a) ts thereby uniquely determined 


for all posttwe integers n. 

According to (8.6) and (9.1) 
(10.14) S (my) = BU (n-+1y),9) $81. 
If Dra Sn < Dy, it follows from (10.14) and (10. 9) that 
(10.15) R(n, a) = Dy+Dratn—1 
and consequently if Dr: < n < Dy; 
(10.16) R(n, a) = Dr + Dis +n—1—=R(n—1, 4) +1. 


But when a and hence y is irrational, each positive integer n not a denomina- 


tor of y lies between two successive denominators of y. Thus (10.16), 


holds 


if n is not a denominator of y. Upon setting n — D, in (10.15) we find 


that | | 
R(D», a) — Dy + 2Dv4 —, 1 
excepting the case where Dy. = D, = D. 

We infer the truth of the theorem. 


THEOREM 10. 2. For irrational « the recurrency functions R(n, a) and 
R(n,1/a) are identical; conversely if R(n, a) and R(n,«) are equal for all 


values of n, either a = a or d = g>. 
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Let Q be the Sturmian’ ee defined By T(c,«). The function 
R(n,«) is the recurrency function of G. The trajectory O* obtained from | 
T(c, «) by replacing a by b and b by a has the frequency at. But the defini- 
tion of the recurrency function of a Sturmian trajectory is symmetric’ with, 
respect to a and b and it follows that the recurrency function R(n, w+) of 
O* is identical with R(n, a). ° 

To prove the converse let us assume that R(n, «) and R(n, a! ) are equal 
for all values of n. It follows from Theorem 10.1 that the denominators: of 
a(1 +a) and of «(1 + )~ are identical as sets of numbers. Let Dy and 
D’, be respectively the »th denominators of «(1+ «)~ and of a’(1-+a’)* 
with y20. We distinguish between four cases: 

Case I. D, <D,, Do < D. i 

Case IIL. Do=D, Do =D. 

Case IIL D= Di D'o € D. 

Case IV. D, <D,, D'o D’. 

The values assumed by the denominators D, form a set of numbers identical 
with the set of numbers assumed by the denominators D’y. In Cases I and II 
` it follows that Di == D’, for all admissible values of ù But this implies that 
the continued fraction developments of a(1-+ a) and a’(1-+ a’)-- are 
identical. Consequently these members ‘are equal and a = g’, 

In Case III it is clear that Din == D’; for each non-negative integer LA 
It follows that 


a 
Te [0, 1, da, ds ~ +], 


a 
Tew et a de I 

_ It-is easily shown that ag’ == 1. The proof in Case IV is similar to the proof 
in Cage III. 


11. The asymptotic behavior. of R(n, a). We continue with the case 
of an irrational frequency «a. The constant y — a(1 -+ «)~ is then irrational 
and the denominators of y form an infinite sequence l 


(11.1) | 1=D SD: <Da<:: ` . 

Given a positive integer n there exists a unique non-negative integer v such 

that Dy = n < Dyn and according to (10.15), for these values of n 

(11. 2) R(n, EE E (Dy Sn < D). 

This implies that i 

Dry TE: 1 
D 2 


y 


(13) E E Sim ce (DEn X Du), 
Dya — 1 nh 


3 
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where the equalities on the left and right hold respectively when n = Dry —1 





and n = Dy. Let 
| Z 1, 





‘and recall that A (a) may be infinite. It follows from (11. 3) that 
Baa) a)" 


(11.4) . mmp 3% 2 + a(a), 
(IRB) . lim inf Rma) —2 +2 (a), 


understanding that x(a) = 0 if xt) == oo. If A(a) is finite the closed 


interval 
2Ha) SeS2-4X(2) 


will be called the limit range of R(n, a)/n corrèsponding to a. If 
‘infinite this limit range shall be 2 S x < ‘co. 


Aa) is 


Tarorew 11.1. . The limit range corresponding to any irrational a ts of 


length at least 1.and on this range the numbers E(n,a)/n, (n= 1, 
are everywhere dense. . 


It follows from (10. 1) that 
Dyas Dy 

















REP 


— = = > 
Dy Dy dys =l (v= 0) 
and hence if A(«) is finite ae 
Don az, D, 1 > 
. Him sup = D, lim inf D, A(a) Nay = 1 


| 
The length of the limit range in this case is least L If À (a) = = œ, the length 


of the limit range is evidently infinite. : 


To prove that the.set R(n, a) /n is everywhere dense on the limit range, | 
let x be any point in the (open) interior of the limit range. ‘It follows from 


(11. 3) that there exist arbitrarily ae values of y such that 
| Dy Do — 1 
2 cee D. rs <e<2 Tep De 


According to (11.2), if n is an integer such that Dp Sn<n+1 
then aS 
R(n +1, @) < Bla, a) a) 
n +1 on 


and thus if n is a properly chosen integer - 


n +1 


AO LS LED, (DEn<n+1< D) 


< D V+ls 
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Since R(n + 1,&) exceeds R(n,a) by 1, the length of this interval is 


R(n,a) R(n+1,¢) _ B(n+1, 4) —(n+1) z! 
n n +1 n(n +1) ae ae 
But n becomes infinite with v so that the length of this interval approaches 
zero. This implies that the set R(n,a)/n is everywhere dense on the limit 
range, and the proof of the theorem is complete. 

The Sturmian trajectories thus yield no example of a non-periodic Stur- 
mian ‘trajectory with recurrency function R(n) such that R(n)/n has a finite 
limit as n becomes infinite. Whether or not there exist more general non- 
periodie recurrent trajectories such that this limit exists is at the present 
unknown. | 

The numbers « and a’ will be said to be equivalent if there exist integers 
a, b, c, d with ad — bc = + 1 such that 

aa +b 
éd ca + ca+d” 

THEOREM 11.2. The limit ranges corresponding to equivalent trrational 
values of æ are identical. 


Let & and @ be equivalent irrational numbers. Then «(1-+ «)1 and 

a (1-+ a’) are irrational and can be represented by continued fractions 

a(lta)*= [do dy, du * * “I, 

@ (1 + a’) == [do d'ai d'a, re “|. 
It is readily shown that the equivalence of æ and a’ implies the equivalence of 
a(1+ a) and o’(1-+ a’) and -consequently (Perron, p. 65) there exist 
integers k and 7 such that 

deu = Ahris 421. 
¢ 

Let Dy be the denominator of the y-th convergent of #(1-+ «)~ and let D’, 
be the denominator of the rth convergent of (1 -++ a). Then (cf. Perron, 


p- 32) 


Dry 
nee = [dui des +, dun, di + +, dl], 
k+i-1 
D + {à [à j d 
D DE mm Hide . Wa, d'y" i *, di] 
+1 


as [diss dksi-i» Ta dis, LEP AS di]. 


D D444 ) 
z -0 
y im (Fee D'ira 


It follows that 








and hence 
A(a) =A (g). 
The theorem follows directly. 
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Taxorem 11.3. The limit range corresponding to (V5 + 1)/2 ts the 
interval E . 

: 5 b+ V5 
(11.6) te << PV 


The limit range corresponding to any irrational a which is not equivalent to 
(V5+1)/2 contains the interval (11.6) in its interior. 


It a= (V6 +1)/2, 








| penp a — [0,1,1,1,: - +] = [do di, du, |]. 
But then ; 


= [dva, dr, ` eee da, dy] = {1, 1, Pr ; 1, 1] 
and igs 


= [1,1,1,: . j= yoo 





lim 21 
À me 
(a) 7A 


Thus the limit range corresponding to a = ( V5 + 1)/2 is | 


ENS 24At (a) £a 2 + À(a) VE, 
It follows from Theorem 11.2 that the limit range corresponding to any 

a which is equivalent to (V5 + 1)/2 is also the interval (11.6). | 
Suppose & is not equivalent to (V5 +1)/2. Let [do d * 4] repre- . 

sent the new value of y'and let Cv/Dy be the »-th convergent of y. It follows 

‘from Perron, p. 65, that there exists no integer & such that l 


"à ne dy = du == 
Consequently there exists an infinite sequence vı <v, <“: - such that 
dv Z 2, (t= 1,2, -). But then 


Dry, 
Dy, 


3 


= [dress dy, wets da, d] z= 2, 
and hence | 


A(a) = lim sup DE 2 2. 





_ It follows that the limit range of a contains the interval 
2 Srs 4 


which includes (11. 6 ).in its interior. 
The proof of the theorem is complete. 


THEOREM 11.4. If a— (V5 -+ 1) /2, 





# be. 
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3+ V5 Em a) 5+ VE 


j z n 8 
For the given value of a ` . 
a eg ire ota J. 


The relations (10.1) corresponding to the convergents C,/D, of a(1 + a) 
become 























(11.7) Ca=1, Co=0,. Dim ts eo (v2 0), 
j D = 0, D, = 1, Dyas = Dy, + Dray ‘ (x = 0). 
It is an easy consequence of these relations that i 
(11.8) Cy == Dvi, 7 (v= 0). 
Since Do == D; == 1 each positive integer n lies on án interval of the form 
IDSE n < Dru, | : (v > 0). 
It follows from (11.3), (11. K and (11.8) that 
Ele a) a) < Pract ee Cy—1 5 Cv 1 1 
(11.9) ——=— S$ 2 + — p — =8 + -5 Bat pT on 
By virtue i Perron, p. 48, a the relation &(1 + a) = at, 
1 1 3 | & Ge ed 
t>- Te De aie. PA 
It follows that 
x +9 1 1 
| DP a D, S% (> 0), 
and-upon using (11.9) that 
R öt Vö 
l 209) CPP RTE IX VS | 
Similarly it follows from (11.3) and (11.8) that 
R(n, a) > Dy Ora 1 Ova 
n 2t Dal ÉD S PE E 
But | 
Cv 1 Crus 
. | Dyas ~~ = DaD S (Dvn = + 1)Dy, 1 (v > 0), 
and hence , 
- BOD sits -3+5 


The proof of the theorem is PE 
It is clear that R(n, «) becomes infinite with n. For an rene value 
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of a it follows from SD, p. 830, that R(n, a) = 2n without ee The 


following theorem is concerned with a more precise description of 
in which R(n, a) becomes infinite with n. It is natural in analysi 


e manner 


of this 


character to be concerned with R(n,«) for “ almost all values” of & rather 


than with the function R(n, «) for individual values of a. 


Let (z) be a function which is postive and non-decrensing for += 0, 


and such that Jim p(T) — + œ. 





Tu i. 5. For almost all a 
| | R(n, a) 
fn ice nh (log ft) 
ts is finite or infinite according as the series 
ere 
ao p(n) | s 


ts convergent or divergent. 





Since the rational values of a form a ‘set of measure zero, it is Sufficient 


to prove the theorem for almost all irrational values of a. 
Let i an . 





R(n, a) 
. nd (logn) ` ' 
If a is mahora. it follows E (11.3) that 
R(D,, a) Don 
D$ (log Dy) de Dalle Dy y’ 


29 Sanm sup 


8 (a) — lim sup 


_ where D, is the v-th denominetor of «(1 + a) — == y. With the aid of 
we infer that 
| | | Docs 
. (11.10) g(a) es i it sP Hog Dy) Flog DI > 


1 


where the integers dy are TAN appearing in the continued 
[do, da, * * > ] representing y. 


(20. 1) 


a 


But it is known (cf. Lévy, p. 289) that for almost all values of y and 


hence for almost all irrational values of a: : 
(11.11) ' . limVD=k, 


ver 


where K is an absolute constant such that 3 < K<4 On applying or 11) 


to (11.10) we infer that for almost all irrational values of 


(11. 12) dim sup ac S 8(«) Slim m sup 
: 772 


dns: 
e(r log 4) = ( log 8)" 


According to a theorem of Borel and Bernstein (cf. ick p. 


a 
= 
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Theorem 14), if y(x), 2 = 1, is a function which is positive and non-decreas- 
ing, the relation 


by = O (40) ) 
is true for almost all y or false for almost all y according as the series 


æ% ıl 


is convergent or divergent. If the series 
(11. 13) S 
| v0 p(v) 
is convergent, it follows that the series 
eo 1 

_ À Ces) 
is convergent. On setting y(x +1) — g(xlog3), c=0, we infer from 
the Borel-Bernstein theorem that for almost all y 

dra = O ($ (v log 3)). | 
It follows from (11.12) that S(a) is finite for almost all a. 

If the series (11.13) is divergent, it is easily shown that due to the 

hypotheses on (a) the series 





S 1 
v=o $ (v log 4) 
is divergent. On setting y(z + 1) == ¢ (zlog 4), «= 0, we infer from the 
Borel-Bernstein theorem that for almost all y the relation 
du == O(6(v log 4)) 
does not hold. It follows from (11.12) that for almost all a, S(a) is not 
finite. 
The proof of the theorem is complete. 
If we set (4) =x, c21, and (rx) —1, 0S z& 1, the following 
corollary is an immediate consequence of Theorem 11. 5. 


COROLLARY 11.1. For almost all a 


This implies in particular that for almost all « 


lim sup ZD. L + co. 


no 
If a > 1 and we set ¢ (x) =, zt 21, ọ (z) = 1, 0 S x S 1, in Theorem 
11. 5 we obtain the following corollary. 


40 MARSTON MORSE AND GUSTAV A. HEDLUND. 


Cornottary 11.2. Ifa>1, 


‘R(n, a) 
Him su Mires n(log n)¢ TR | | 


- “except for a set of values of a of measure zero. 


We have excluded the case in which a is rational because for a = q/p 
with (q, p) == 1 the periodic Sturmian trajectory with frequency « has the 
period w = p + q and: i 
(1.14) ° R(n, a) =w+n—1, nZo 
If a is an integer,. (11.14) holds for n > 0. If a is rational but not an 
integer, (11. 14). does pot hold for all n < w, as we shall see. However, the 
values of R(n,a) for # <w can be determined by methods similar! to those: 
applicable to the case when a is irrational. 
If a= g/p, where (g p) = 1 and a is not an integer, and if we set 


t 





E ipa aaa 
‘y is not an integer and admits a unique representation in the form of a con- 
tinued fraction (cf. Perron, p. 30) | 
ym [do diss, dul, eZl, dz2. 
The recursion formulas (10.1) determining the successive convergents Cy/Dy 
of y are valid forvSy. We state the following theorem without proof. 
TexoRem 11.6. If a—q/p where (q,p) 1 and p#1, Theorem 
10.1 holds for n < Dus. For n= D: 
R(n, a) =p+q—1. 
- By means of this theorem it is easily shown that Theorem 10.8 is valid 
for positive rational as well as irrational values of a. 


12. Sturmian sequences in differential equation theory: We are con- 
cerned here with linear homogeneous second order differential equations with 
coefficients which are continuous in the independent variable + We shall 

‘make use of the important canonical form . 
(12. 1) = Y+é(z)y=0. 
We assume that (x) has the period 1. Corresponding to an arbitrary solu- 
tion u(x) of (12.1) with us 0, let T (u) and T’(u) be respectively cell-series 

À ig aB_,aB,aB,a B23 
in which B, is the number of zeros of u on the intervals 


n£Etr<n+l, n<oeSn+l. 
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It follows from the well-known Sturmian separation theorem that T'(u) and 
T'(u) are Sturmian in the sense of §2. -Moreover the frequency a of T(u); 
or of T’(u) depends only on (z) and not upon the choice of the solution u. 
We may refer to a as the frequency of (12.1). 

The cell-series T(u) and T’(u) include all of the bpe which we ia 
in the general study. Consider fo example the equation 

y’ + a’y—0 

sien a is a positive constant. The corresponding cell-series T (u) and T (uY 
have the frequency a/r = « and, as is easily seen, include all of the types 
T'(c,a) and T’(c,.a), respectively, for suitable choices of a and of u. When 
a = 0 we have the solution s. The series T(z) is skew Sturmian of the form 
8’(0,0). To obtain skew Sturmian series T (u) nioïe general in form it is 
necessary to go somewhat deeper. 

We recall a few facts*in the classical theory of : differential equations of — 
the type (12.1). Let y(z) and w(x) be solutions of (12.1). Keeping z 
real we admit solutions of (12.1) of the form Ay(z) + Bw(x), where A 
and B are complex constants. Let p be an: arbitrary positive integer. As is 
well known, there exists at least one solution u(x) of (12.1) such that 


(12. 2) ula + p) = pu(z), | (#0), 


where p is a real or complex constant. We term p a characteristic root of 
index p. The roots p satisfy a quadratic equation, the product of whose roots 
‘ig 1. There are two principal cases according as the roots p are real and 
positive, or not real and complex: There is also the degenerate case in which 
the roots are equal. The equation (12.1) possesses a canonical pair of inde- 
pendent solutions whose properties depend upon the classification of the roots p. 
Tt would not be difficult to show the precise connection between these canonical 
forms and the types of trajectories T'(u) and T’(u) defined by the solutions 
of (12.1). We shall not go into details beyond proving the following theorem. 


THEOREM 12.1. In case the differential equation (12.1) possesses two 
real positive unequal characteristic roots of index p, then for a suitable chotce 
of the origin and of the solution u(x), the series T(u) and T’(u) are skew 
Sturmian trajectories with a frequency of the form q/p. 


Let c be one of the roots of index p. The reciprocal of c is another such 
root. We seta==p*loge. It is easy to prove that there are two independent 
solutions of (12.1) of the form 


| fy (a) = oA (2), 
one) w(a) <= eB (2), 
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where A(x) and B(x) have the period p. Moreover 


y(c+ p) = oy (7), 
w(z+p)=Żu(2), 


and the only solutions u(x) of (12.1) which satisfy a relation of the form 
(12.2) are the constant multiples of y(x) ‘and w(x). 

Suppose that A(z) vanishes q times on the interval OS s <ip. The 
function B(s) likewise vanishes q times on 0 z < p since the zeros of y(z) 
and w(x) mutually separate each other. The series T (y) has the frequency 
q/p as do the series T’(y), T'(w), ete. 

Let w be a point which is not a zero of y(z) nor of w(x), and let u(x) 
be a solution which vanishes at œ without being identically zero. Then 
w(o-+ p) £0. Otherwise for some constant p oa 0 


u(z + p) = pu(2). 


As we have seen, u(x) would then be a constant multiple of y(x) or of w(x), 
contrary to the hypothesis that w is not a zero of y(x) or of w(z). The q-th 
zero w of u(x) following w is such that w — o Ap. If o’—w <'p, then 
‘after a suitable change of codrdinates of the form a’ == x + x, the interval 
0 <2’ <p will include both w and wo’ and T(u) will possess a p-chain of 
b-length g+1. If o&’—o>p and v is suitably chosen, the interval 
0S y S p will include just g —1 zeros of u(x), and T(u) will possess a 
p-chain of b-length g—1. In either case the series T (u) as well as the series 
T’(w) is skew Sturmian, and the proof of the theorem is complete. | 
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ON THE METHOD OF FINDING ISOTROPIC STATIC SOLUTIONS 
OF EINSTEIN’S FIELD EQUATIONS OF GRAVITATION.* 


By P. Y. Cov. 


1. Introduction. The isotropic static solutions of Hinstein’s field equa- 
tions can be obtained by solving a set of differential equations given by the 
author? For the type of problem which involves the determination of iso- 
tropic fields of a single body the complete solution of the problem reduces 
first to the solution of a non-linear partial differential equation of the second 
order (I, (3.4)), and secondly to the transformation of ds? in the (u, v, w) 
coôrdinates to the canonical form (I, (2.4)). When the problems are simple 
such as the examples given in I, where the non-linear partial differential 
equation degenerates into afi ordinary equation, these two steps can be accom- 
plished with ease. But in the general problem the solution of the partial 
differential equation and the transformation of codrdinates are both difficult. 

An alternative way of approach is to compute the equations (2.5) in I 
in terms of the (x,y,z) codrdinates in the canonical form (2.4). Then we 
have two dependent variables, U and o, satisfying seven partial differential 
equations with three independent variables, x,y,z. Unfortunately these equa- 
tions are non-linear in U and o and the determination of their general 
solution is by no means simple. 

In the present paper we shall derive from (2.5) in I, as a further neces- 
sary condition, another set of partial differential equations whose solutions also 
satisfy the field equations. The advantage of the present treatment over the 
previous one is, as we shall show presently, that we have to solve a set of seven 
non-linear partial differential equations of the third order satisfied by only 
one dependent variable e while the other function U can be constructed out 
of o and the partial derivatives of a. Out of the seven partial differential 
equations we shall derive the well-known Laplace’s equation. Hence as a 
method of procedure we can choose a harmonic function and this harmonic 
function will define an isotropic static gravitational field provided it satisfies 
the remaining six partial differential equations simultaneously. As an illus- 
tration of the present method we shall show that Kasner’s solution (I, (3.5) ) 
and the field of the semi-infinite plane (I, (3.9)) are the only two-dimensional 
isotropic static fields in Einstein’s theory of gravitation. 


* Received October 30, 1938. . 
1P, Y. Chou, American Journal of Mathematics, vol. 59 (1937), p. 754, which will 
be referred to as “I”, eq. (2.6). 
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2. Equations determining ge Woe write down some of the important 

equations in I. The field equations (I, (2.8)) are 

Gay Cor UU SO Gis, Cut 

The canonical form of the isotropic static arc element (I, (2. 4)) is 

(2. 2) dst = Utdt? — ee (da + dy? + dé). 


The equations due the isotropic solutions of the field equations can 
_ be written as (I, (2. ha 


(2.8) U, = (D, — à 9 UD); and R==0.. 


C 7 U 
' From (2.3) we can derive as an integral (I, (2. 6)), 
(2. 4) OU, = — k? (c + U’)$, 


We can eliminate U4,; between (2.1) and (2.8). Then 


(2. 5) ; Rij —— aoa (UU; — 5 90" Ux), U it om 0, 


In fact (2.5) was obtained when we proved the sufficiency of (2.8) determin- . 
ing the isotropic solutions of (2.1) (I, (2.17)). Now let us form the | 
invariant BoB from (2.5): 


(2. 6) | - R”*”Rmn — 24 (Ù*Un)?/ (c + UES 
By using (2. 4) we obtain as a consequence 
(2.7) | (c + Dr) RnB /24kt, 7 


Equations (2.5) and (2.7) are tensor equations and, accordingly, hold 
in any system of codrdinates. If we compute Ry; from the form (2.2), we 
find ? | | 
(2.8) Riy = — 0,45 + 040, — guy (Aro + Aro) | 
where | 

Agr == go, g Ayo = gto, 40,3. 


If we calculate o.4; explicitly, we get 


i Bia = — Gen + op + 05 (st + ay H o), i | 

(2.9) R 
12 = — Gey — Osy; | 

Ses we ‘uk ‘Ose — Pa/8z", gy = ĝo /ðy, etc., and the other components of 

the tensor Ay; can be obtained by cyclic permutations of the codrdinates T, Ys 2. 

2L, P. Bisenhart, Riemannian Geometry (1026), p. 90, eq. (28.6). le 
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_ Now we can eliminate U between (2.5) and (2.7) with the under- 
standing that the components of Ry; are given by (2.9). Then we see that 
(2.5) are seven non-linear partial differential equations of the third order in 
o. Since o has to satisfy seven equations simultaneously, its number of 
solutions must be limited. But if we can obtain one solution, then the 
corresponding function U is determined automatically by (2.7). 

The conditions (2.5) and (2.7), with Ry given by (2.9), are thus 
necessary for the field equations (2.1) to possess isotropic solutions in empty 
space. They are also sufficient. For from (2.5) and (9.8) we have 


e 2 2 
“CLG F A -Wubi — UixU; +3 90 Uir— z gxU*Un,4) 


== TE; UU a(g Ur — gaU). 

If we contract (2.10) by g# and U* separately and eliminate the intermediate 
expression U*Us,x, we find (2.8) again. Then the field equations (2.1) are 
also satisfied by the theorem proved in I. Hence we may summarize the above 
results in the following theorem: 


THEOREM. A necessary and suficient condition for the static field equa- 
tions (2.1) to possess isotropic solutions in empty space is that the function o 
should satisfy equations (2.5) where Ry; and U are given by (2.9) and (2.7) 
respectively. 


The theorem proved in I lays emphasis on the function U and the ‘present | 
theorem deals primarily with o. The method of obtaining o is as follows: 
From (2.5) and (2.9) we find 


(2. 11) eee [oes + oyy + ors — $ (0s + oy? + os) ] 
—— 6 (foo + fy + fo) = 0; fmol 


- In other words f satisfies the classical Laplace’s equation. Since Laplace’s 
equation possesses a large variety of solutions, the isotropic static fields are 
defined by those which will also satisfy the other equations in (2.5). Hence 
we may test whether any harmonic function f defines an isotropic static field 
by simply constructing Ry and U according to (2.9) and (2.7) and see 
whether every member of the equations in (2.5) is verified. 
We have dealt with the case in empty space only. ‘ A corresponding theorem 

within matter can also be proved in à similar way. | | . 


3. Two-dimensional problem. Although s we have given the general 
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method of finding isotropic static fields in the previous section, it is sti rather 
laborious to solve the general three-dimensional problem. On the other hand ’— 
when we restrict ourselves to the two-dimensional case, the present method ` 
with slight modifications can give us all the isotropic fields ; it is decidedly 
simpler than trying to solve the set of equations (2.3) directly, for. even in. 
the canonical form (2.2) when the coürdimate z is absent from the functions’ 
U and o, (2.8) are still non-linear and not much information can be obtained 
from these equations. 
Since both o and U are aani to be independent of es we have from 
(2.9), 





Rumo Hot — hla + oy") = — Gy lU —4 (Ue + D) 


: 2 2 P D S 6 a 12. 2 
fais Big — oy + 02° — à (oe age ggr lUs US Vy 13 





Us Uy, 


Bra m= — osy Oy == — 


TA U’. 
Bas Be) = giy Ty : (Ua? +- U; D 


the other components of R; all vanishing identically. According to ithe given 
procedure we should eliminate U from (3.1) by using (2.7). But this is 
clumsy and we shall avoid using it. Instead, from (2.4) and Rss in (3. 1) 
we find 


(8.2) | (c 4: U*)* = 67? (oa? + De 


Furthermore we can drop out the common expression Rss in Ry, \and Raa 
and get 
Bis: ogy de mle A/ ot Ue), 
(8.3). . Ra: oy + oÿ = 6U,2/(c + U*), 
Ra: Ogy + Tag "= 6UsUy/(c + U?).. 
The harmonic function f in (2.11) can now be taken to be 
(3.4) feet F(s H iy) + F(e—iy) =F + F*, 


where F is an analytic function of the complex variable x + à. Then. the 
partial derivatives of o with respect to z and y can be computed. If we denote 
the dena of F re Teper to its argument by F, we find (3. 2) to be ` 


(3. 5) (e+ T3) = 4P ER.. 


By means of (8.4) and (3.5) we can put (8.3) in the following form : 
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0+ U? oF pe 6F* 
Rua: TA +)? == LR je fa?’ 
UES fe 602 E- F Es f 
+ per CFU? -e a 
(3.6) Ra: -3 + +% (P— ay = eu LE ae | 
Ru ns de oa CHU F/Ë_6F\: (+ 6m 
. Ra: F (F—F*) F 8 (as —F*) — 602 [G h) (È f |. 


Then we form linear combinations of the above equations. The expression 
Biz + Res gives 








(3.7) RTE shy (fe vi) 


which in combination with (3.5) also gives (2.4). From Ri,— Ra: we find 
CORTE — A (Pi) | 
CHU CF 6F\! (E*N. 
Se E-E-E] 


From Ris in (3.6) and (8.8) we can easily see that the following equation 
and its conjugate must hold: 


oe . / 2 ‘a ; 2 
es pate ae 





which can be simplified into 
U*f2#?/F* +. o(fF/F? — 6)? —0. 


Since only U? presents itself in the arc element (2.2), we may take the 
‘positive sign in front of U, namely, 


(3.10) fP/E = 6V— c/(U + V—c). 

Taking G 9) and ita conjugate, we can eliminate U* from (3.7) and obtain 
(É—)(É- 3), or 

(3. 11) F F* 


dfi\ a [1 =) 
ma Ea jt). SS as) 
f dz (3) da* (a) oe ee aor dz* F* 
Inserting F from (3.10) into (3.11) and simplifying, we find finally 


(3.12) [V—e+ (V—o)*]U —0. 
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In other words, the arbitrary constant c must either be zero or |a 


positive 


constant which can be taken to be unity without the loss of generality. 


Case (1). c—0.. Then from (3.10), we find 
(8.13). F—0, F = bz/2, f= be and 6° bit 


From (3.5) we get U? = 1/b°r? so that k= b. This is Kasner’s solution 


for an infinite plane (I, (3.5)). 


Case (2). c==1. Then F must be different from zero and (8. 


be written as: 
dz* d?z* : | 
, * JE es 
ee) + Fe 3 (4/5 amt ape oe) 


Since z and #* are actually uidependan we may eonclude that: 


` ds tet 
(3.15) : ; F+3% Ir = 1, 


where a must be a real constant. In fact a can be put equal to zero 


11) can 


i 


without 


any loss of generality, for it only adds an imaginary constant to F and in the : 


final expression of f in (3.4), it does not appear. Hence integrating 
we find 
(3, 16) PF (Br), 


_in which £ is a constant of integration. 


(3.15), 


We have two expressions of U from (3. 10) and (8.6). They must be . 


identical so that BB* — 64k°. Then 





(3.17) D= iant ($ +8)/23, et = fi Tin cost ($ E8), 


where 8 is the amplitude of the complex number 8, $ and p are the amplitude 
and modulus of s -+ iy. This represents the field of the semi-infinite plane 
(I, (3.9)). In other words Kasner’s solution and the field of the semi- 
infinite plane are the only two-dimensional isotropic static fields inj empty 


space according to Einstein’s theory of gravitation. 


Tue NATIONAL SOUTH-WEST ASSOCIATED UNIvERsiTy (being a wartime union of 
National Tsing Hua University and National Peking University of Peiping, and 


Nankai University of Tientsin), 
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ON THE ALMOST PERIODIC BEHAVIOR OF THE LUNAR NODE* 


By AUREL WINTWER. 


Introduction. In view of Newton’s deduction of his approximation 
formula for the mean motion, w, of the ascending node of the lunar path (cf, 
e. g., Tisserand [6], pp. 42-44), the constant » may be characterized, from 
the astronomical point of view, not only as the average velocity of the nodal 
angle Ÿ = Ÿ(t), but also in terms of the relative number of times the Moon 
passes through the ecliptic (on the average). Correspondingly, the precise 
form of Newton’s approximation theory, as developed by Adams by means of 
infinite determinants, is directly based on the Jacobian differential equation 
which determines the ordinate z — z(t), if the ecliptic is the (a, y)-plane 
(cf., e. g., Tisserand [6], pp. 286-288). 

From the mathematical point of view, there immediately arise several 
questions. Some of these have been investigated by Levi-Civita [4], who 
proved that the two definitions of w (those based on Ÿ(#) and z(t), respec- 
tively) are equivalent, and that the limit which is supposed to represent 
actually exists; so that, in the theory of Adams, 


(1) #(t) ot + y(t), where w = const. 40 and | w(t) | < Const. 


The present paper deals with certain analytical refinements of Levi- 
Civit4’s result; refinements which, though of apparent astronomical significa- 
tion, can only be treated by using analytical tools developed recently (Levi- 
Civita’s paper appeared in 1911). 

The modern theory of the Moon, as originated by Hill and further 
developed by Brown (cf., e. g., Poincaré [5]), is based on certain tacit assump- 
tions which, for the case at hand, imply that the non-secular part of #(é), 
i.e. the remainder term y(t) of (1), may be analyzed into an anharmonic 
Fourier series. This assumption will be justified by proving that y(t) is 
almost periodic (almost periodicity will always be meant in the sense of Bohr). 
The formal situation is as follows: 

The variational equation of Adams for the ordinate z is of the form 


(2) z” + f(t)z = 0, 
where f(z) is a given periodic function of the time. One can write this 
differential equation of the second order in the form 

* Received January 3, 1939. 
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(3) w == a(t)u + b(t)», v = c(t)u+ d(t}v 


. of two differential equations of the first order. Then, on introducing into the 
(u, v)-plane polar codrdinates by placing / i 


(4) u= (us +0)? cos ù, v= (u? + v°) sin ð, 


one can conclude from a general theorem, that not only is (t) of the form 

(1) but, in addition, y(t) is almost periodic (cf. Wintner [9]). 
| However, this approach to the problem is based on the assumption that. 
the codrdinate Ÿ = (t) of the ascending node of the Moon is idéntical with 
the angle Ÿ = #Ÿ(t) which is defined by (4). Now, the latter #, being the 
polar, angle in the (u,v)-plane, clearly is not the lunar node, if one writes 
(2) in the form (3) by placing u =z, v =z (or u—2’,v=z). Fortunately, 
it turns out that the polar angle in the (u,v)-plane becomes identical with 
the nodal codrdinate # = (t) of the Moon if, instead of identifying u, v with 
z, 7, one subjects the pair z, 2’ to a suitable linear substitution whose matrix 
is a certain persed: function of ¢, and then defines u, v as the reine linear 
combinations of z, z 

Due to the relations which Levi-Civita used when identifying the two 
definitions of w (cf. the beginning of this paper), the linear transformation 
defining u,v is quite explicit. Correspondingly, the existence of œ and the 
almost periodicity of y(t) will be proved directly. This direct proof will not 
involve an actual modification of the program sketched above. In fact, the 
proof depends, in either case, on an application of the following theprem, 
_ formulated as a conjecture by the present author, and subsequently proved 

by Bohr [1]: 

If #(¢) is real and i(t) almost periodic, then there exist a constant 
w and an almost periodic function y(t) such that #(¢) = ot + y(t). (The 
converse of this theorem is obvious.) 

Since the proofs will be based on the theory of almost periodic functions, 
the proofs are independent of the results of Levi-Civita (cf. the ening of 
this paper), which, therefore, follow as corollaries. 

The almost periodicity of the remainder term y(t) will also imply the 
existence of an asymptotic distribution function for the angular variable D(t). 
This means that there exists an asymptotic probability p == pla B ), that the 
lunar node #(¢) will lie on a given arc a = Ÿ(t) = B, where }— V(t) is 
thought of as reduced mod 27. Needless to say, (1) in itself would be insuff- 
cient to guarantee the existence of such an asymptotic distribution function. 


1. The considerations of this section are more general than those actually 
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needed for the problem at hand, and concern the explicit characterization of 
all (non-conservative) linear canonical transformations in case of n degrees 
of freedom. The result, though of algebraic simplicity, does not seem to occur ` 
in the classical literature of the subject, apparently because the verification 
requires some effort, if one starts out with the standard Pfaffian criterion of 
canonical transformations. On the other hand, the result follows quite 
naturally by using a method developed recently (cf. Wintner [8], van Kampen 
and Wintner [7]). 

Reserving the sign ’ for d/dt, let A* denote the transposed matrix of the 
matrix A (although all matrices occurring will be real); so that A’* = A*’, 
where A = A (t). Correspondingly, the bilinear form belonging to a matrix 
B will be denoted by F*BX, where X, Y are real column vectors. Let*H be 
the n-rowed unit matrix, O the n-rowed zero matrix, and I the 2n-rowed 
skew-symmetric matrix « | 


(5) 1=(_9 AL so that I = If = — I, det I == + 1. 
Then the most general linear (homogeneous) Hamiltonian system with n ` 
degrees of freedom is | | 
(6) IX" = 8(t)X, 


where S(t) is an arbitrarily given, 2n-rowed, symmetric, possibly singular, 
continuous matrix function of the time ¢. In fact, if 21,- ` *, ın denote the 
components of the vector X, one can write (6) as 


d'in — OH tun ‘T'en = OH dtn no Ree ee 


where H = H(X;t) is the quadratic form H = 4X*S(t)X; so that Tiun, 
where t == 1,: - -, n, is the i-th codrdinate, and x, the momentum canonically 
conjugate to Tisn. . 

If one subjects Y to an arbitrary linear substitution 


(7) XY =m T(t)X, 
where T(t) is a matrix function of (2n)? elements which have continuous 
first derivatives and a non-vanishing determinant, then (6) clearly is trans- 


formed into a system of 2n differential equations of the first order which are 
again homogeneous and linear and can, therefore, be written in the form 


(8) IX! = §(t)X, 


the matrix (5) being non-singular; so that S(t) is uniquely determined by 
S(t), T(t) and the derivative matrix T’(t). However, the transform (8) of 
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the canonical system (6), is not, in general, again canonical, since S(t) need 
not be symmetric matrix whenever S(t) is symmetric. Correspondingly, (7) 
may be defined to be a canonical transformation if it has the property that 
(8) is a canonical system for every canonical system (6), i. e., if S*(t) == S(t) 
whenever S*(i) = S(t). 

Now, T(t) will have this property èf and only if there exists. a scalar 
constant u Æ 0 such that ' 


(9) T*(t)IT(t)—=pI for every t . (#=0). 
(It may be mentioned that (9) implies that 
(9 bis) det T(t) =p”,  (detT(t) #0), 


and not only that |det T(t) |=]|a|”). Furthermore, if the necessary and 
sufficient condition (9)—(9 bis) for a canonical linear transformation (7) is 
satisfied, the matrix S(t) of the transformed Hamiltonian function, i.e., of 
the quadratic form $4*8(t)X, follows from 


(10) T*ST = pS + T*IT’, where S = S(t), T T(t), det T(t) g 0; 


(so that (10) is a symmetric matrix for every symmetric S(t) if, and only 
if, (9) is satisfied). 

The proof of the statements (9), (10) will be omitted. For, on the one 
hand, these statements may be verified from the general (non-linear)! results, 
obtained loc. cit. [7], at least if one disregards the fact that, this time, only 
the existence of a first continuous derivative T”(t) is required (the problem 
being linear). And, on the other hand, a direct verification proceeds in! exactly 
the same way as in the particular case T (t) = const, treated loc. cit. [8]; 


2. Suppose, in particular, that (7) transforms momenta into momenta 
and coördinates into codrdinates; so that 


an) OS A a) 


t 
| where A(t), B(t) are non-singular n-rowed matrices. It is easily verified from 
(5) that (9) is satisfied by (11) and »=-+-1 if and only if A*(t) =B (t). 
It follows that in the particular case A — B of a cogredient transforma- 
tion of the momenta and coördinates, the condition is that A (t) be, for avery t, 
an n-rowed orthogonal matrix (of determinant +1). Application of (10) 
to this particular case shows that the transformed Hamiltonian function is 


(12) 4¥*5(t)X —4X*9(t)X + HUA (4) A*(t) — V*A'(t) A* (4)*0}, 
| 
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where U and V are the vectors with n components, (#;) and (Zin), which 
are formed by the n first and n last components of the vector (7) with 2n 
components. 

For instance, if the degree of freedom is an even number, say n = 2m, 
condition (8) is satisfied (with »—+1) if T(t) is the 2n-rowed matrix 
which one obtains by repeating, 2m times along the principal diagonal, any 
two-rowed rotation matrix 


cos $ — sin ¢ 
13 ® =f. = 
(13) DRE eee), where =el) 
is any given scalar function which has a continuous derivative ¢’(t). In this 
case, the quadratic form ${ }, which in (12) represents the deviation of the 
new and old Hamiltonian functions, readily reduces to ` 


(14) $2451) 2 —$X*5()X = 4'(t) X (& Hi — mE), 
c= 1 
if Zi, Za, Zs, Las * +, Zona, Zen are respectively denoted by ©, Hi, Be, He: :, 


Ém, nmy Where m == $n. 


3. Let the degree of freedom be n == 1, and write p,q; u,v for 21, 223 
Tı, Z, respectively; so that (7) becomes 
u = a(t)p + B(t)g, Gy a 
15 where == T(t). 
je Dm y(t)p + 8(#)9, v(t) a(t) PFO 


It is easily verified from (5), where E = 1, O =Q in the present case, that 
the necessary and sufficient condition (9) for a linear canonical transformation 
is satisfied by (15) if and only if det T(t) = const. But const. = m, by 
(9 bis) ; so that the criterion takes the form 
(16) a(t)8(t) —B(t)y(t) = p, where p == const. 0. 
It follows, therefore, by straightforward reductions that 

2 2Aya(t) Agy(t) oo) 

17) Aul T(t) T(t —( 

CD OO — ag, (t) — Baal?) 2Aug(#) 
if the elements of this two-rowed matrix denote the determinants defined by 


2 


(18) Ag (t) =x (t)A (t) — N'(t)x(t), where y = dv/dt. 


If, in particular, the constant (16) is 1, then, on denoting the new and old 
Hamiltonian functions, i. e., the quadratic forms $X*S(t)X and 4X*S(t)X, 
by K(u,v;t) and H(p,gq;t), one sees from (10), (7) and (17) that 
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(19) K(u,0;t) = H(p,q5t) + Haru? $ ess Aus + Aagtf}, | 
where H(p,q;¢) is thought of as expressed by means ‘of the inverse of the 


substitution (15) as a function of (uv; t), and the A are, in view lof (18), 
given functions of t. 


4. Let g A ; . 
(20) af — 2y = Qa(7T, Y, z), y” + Ra! = Q(z, Y, z), z” = Q, (x, Y z) 
be the equations of motion of the (non-planar) restricted problem ‘of three ' 
bodies in a synodical barycentric codrdinate system (x, y, z); so that|the axis 
of syzygies is the z-axis which rotates, with reference to a sidereal planar 
“coôrdinate system which coincides with the (x, y)-plane, with constant angular 
velocity. “Thus, (20) is ati irreversible, conservative, dynamical system with 
three degrees of freedom, admitting Jacobi’s integral of relative energy: 


(21) . (a? + y? + #1) —alz, y, z) = const. ` ` 


The classical mathematical literature of the restricted problem of three © 
bodies concerns the case z(t) == 0 of a planar solution 


(22) zæa(t), y= y(t). 


Starting with any given planar solution (22), consider the non-planar solu- 
tions in the infinitesimal neighborhood of (22) ; so that the third of the equa- 
tions (20) may be replaced by its Jacobi equation belonging to (22): Then 
2u=2(t) is determined by the equation (2) of Adams, in which the coefficient 
function f(t) is obviously given by 


R f(t) => Ral (t), y(t), 0). 


Furthermore, on denoting by # — (t) the longitude of the ascending node, 
and by «==.(¢) the (small) inclination, with reference to the synodical 
coérdinate system (x, y), one has in (2) | 


(24) z= — x sin « sin # + y sin « cos f, dd sin « sin Ÿ + y sin 1 cos Ÿ, - 
at least so long as | 


(26) z(t)y/ (t) — y(t) (t) 70. 


` In order to see this, it is sufficient to write down, within the degree of accuracy 

of (2), the projections of the vector product of (x, y,z) and (2’, y’,2’)|on the 
.coürdinate axes; cf. Levi-Civita [4], pp. 366-367 (where, however, the ascend- 
ing node is referred to the sidereal, instead of the synodical, coôrdinate system ; 
so that one has to replace (t) by #(¢) — t). : 
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5. In view of (24), where x,y are the given functions (22) of t, one 
can replace the differential equation (2) of the second order for the ordinate 
z by two differential equations of the first drder for the Eulerian angles 1, #. 
In order to legalize this, it is sufficient to observe that, barring the case of the 
trivial solution z(t) ==0 of (2) which belongs to the given generating solution 
(22) of (20), the angles : == 1(#)$ #—#(¢) do not become Do 
since 
(26) sin «>< 0 


for every t. For if (26) were violated at some t = t, it would follow from 
(24) that z == 0 and z = 0 at this particular t. But then (2) implies that 
z == 0 for every t. 


6. It turns out that the differential equations for «, #, mentioned at the 
beginning of 8 5, readily lead to the desired equations (3) in which u, v repre- 
sent certain linear combinations of z,z’ in such a way that the requirement 
(4) of the Introduction becomes satisfied. 

To this end, put 


(27) u= (zy — yz’) sine cos}, v = (sy — yr’) sinc sin 6, , 


if the determinant (25) is positive, and modify the factors (sy — yr’)? on 
the left of (27) in an obvious manner, if the continuous non-vanishing function 
(25) of ¢ is negative. According to (27), one can write (24) in the form 
(28) z= (zy — yr) (yu — sv), g= (ay — ya’) ¥(y'u — z'o) 


of a linear substitution of u, v into 2,2. The coefficient matrix of this linear 
substitution is, by (22), a known function of ¢ and has, in view of (28), the 
determinant + 1 for every t. Hence, on placing p = z, q =7, and writing 
(28) in the form (15), the condition (16) for a canonical transformation is 
satisfied by a= 1. On the other hand, (2) may be written in the form 


(29) p= — 0H/8q, g = ôH /ðp, 
if one puts l 


(30). H=H(p,q;t) = — #49 — łf(t)p’, where p =z, q =z. 


Consequently, the representation of (2) in terms of the variables (27) is the 
linear canonical system 


(31) u = — 0K /év, v = 0K /éu, 


where the Hamiltonian function K == K (u,v; t) is a quadratic form in (u,v) 
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and is apel given by (19) and (30), the determinants us) being (obtained 

by identifying (15) with (28). 

| Finally, on identifying (31) with (3), one sees from (27) that the 
requirement (4) of the Introduction is satisfied, and one has 


(32) ` gk tot (ay — y) sin? a. 


It should be mentioned for later application that the pair of conditions 
_ (25), (26) is, in view of (32), equivalent to the condition that u= u(t), 
v= v(t) do not vanish simultaneously; a condition which, in turn, is equiva- 
lent to the exclusion of the trivial solution u(t) =0, v(t) ==0 of (3), ie, 
of (31). 


7. Without assuming that the (real) ‘system (3) has the canonical form 
(31), suppose that its coefficient functions a(t),« > :,d(t) have a common 
period, say r. Suppose further that the characteristic exponents of (3) are of 

-the non-degenerate stable type, i:e., that the pair of the characteristic roots i 
of the monodromy group is of the form (p, 1/p), where | p | =1 but pé + 1. 
Then it is readily seen from the Fuchs-Floquet representation: of the| general 
splution of (3), that (3) admits a fundamental matrix which is the [product 
of two real matrices of the following type: One of these two matrix functions: 
of ¢ is periodic, with r as period, while the other matrix factor not|only is 
periodic but represents a uniform rotation, with a period which is determined 
by the characteristic exponent, i. e., by arg p. | 

Now, the existence of a funidamental matrix which possesses a tetas 
zation of this type clearly implies, not only that every solution u == u(t), 

. == u(t) of (3) is almost periodic, but also that the greatest lower bound of 
u? + v? for — œ <t< + œ is distinct from zero for all those (real) solu- 
tions u == u(t), ve=v(t) of (3) for wag u? + v? = 0 does not hold at 
some fixed t == to. 

Consequently, u == u(t) and v = v(t) cannot simultaneously come arbi- 

trarily close to 0 for —;œ < t < + œ, i one excludes the trivial solution 

u(t) = 0, v(t) = 0. . 


-8. Now consider the case in which the P planar solution (22) of (20) 
is periodic. Then so is the coefficient function (23) of (2) and, therefore, ` 
- the coefficient matrix of (29), or of (31). If, in particular, (22) is that 
` solution of the restricted problem of three bodies which corresponds to-Hill’s 
intermediary lunar orbit of his limiting case, then (25) is known to be satis- 
fied, and the characteristic exponent of (2), i. e., of (31), fulfils the stability 
condition required. at the beginning of $7 (as to the numerical situation, cf. 
Tisserand [6], p. 288). . , 
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It follows, therefore, from $ 7 that 


(i) the solutions u = u(t), v==v(t) of (31) are almost periodic and 
have frequencies contained in the integral modul of two numbers, say of A 
and v, where it is understood that À and v are or are not linearly dependent 
according as the period of (22) dogs or does not satisfy a commensurability 
condition with reference to the characteristic exponent; 


(ii) barring the trivial solution u(t) = 0, v(t) == 0, one has 
(u(t) )? + (w(t) Y > const. > 0 for — oo <t < +o, 


where the const. depends on the integration constants of the solution u = u(t), 
v = v(t) of (31). 


On comparing (ii) with (27), (32), and using from (i) only the fact 
that u == u(t) and v = v(t) are almost periodic, one sees that exp 10(t) is 
almost periodic. It follows, therefore, from the general theorem of Bohr, 
mentioned in the Introduction, that (1) holds for a certain constant w and 
for a certain almost periodie function y(t). . 

If, in addition, use is made of the description (i) of the moduli of u(t) 
and u(t), it also follows that w and the frequencies of y(t) are contained in 
the integral modul generated by the pair of numbers A,v (which may be 
commensurable) ; cf. Bohr [2]. 

The actual values of the integers j, k for which jA + kv becomes the mean 
motion w readily follow, for mere reasons of continuity, from an inspection 
of Newton’s approximation, i.e., of the problems of two bodies (cf. Levi- 
Civita [4], p. 876). 


9. In view of the significance of the lunar node, it is natural to ask, 
how are the values of the angle #(t) distributed asymptotically along the 
boundary of a circle 0 < 62. More precisely, the question concerns the 
existence (and then the determination) of a function o (8), the angular 
asymptotic distribution function, which is defined for 0 < 0S 2r as follows: 
If Lr(6) denotes the sum of the lengths of those t-intervals which, on the one 
hand, are contained in the range 0 S¢T and, on the other hand, are such 
that on their points ¢ the (continuous) angular function #(t), when reduced 
mod 2r, satisfies the inequalities 0 < #(¢) £ 8, then there exists on 0 < 0S 2r 
a monotone function o(@) which satisfies the relation o(2r) —o(+ 0) =1 
and is such that relative amount of time represented by the ratio Lr(6):T 
tends, as T — + œ, to the limit ¢(@) at every continuity point 6 of ø. 
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t 7 . 
It is known (cf. Haviland [3]) that a given #—#(t) has an angular 
asymptotic distribution function e =o(0) if and only if all the time averages 


(33) M {exp w(t) }, where n = 0,1,2,; -: 
| 7 | 
and M{g(t)} = lim f g(t)dt/T, exist, in which case 6(8) may be deter- 
0 


mined as the solution of the trigonometric momentum problem, 


(34) f edo (9) — M{exp ind(t)}; (n—0, 1,2): °°). 


9 


Now, since exp ##(¢), hence also exp ind (t), ie almost periodic, the time 
averages (33) exist, as does, therefore, o (8). 


10. In view of (i), §8, the actual determination of o is or is not an 
“elementary ” task according as À and v are or are not commensurable. 

In the first case, exp i(t) is a periodic function; so that the asymptotic 
averages (33) reduce to averages over a finite t-range, and so o(8) simply 
follows from (34) by the inversion process which ‘expresses the Lebesgue 
' . integrals as Stieltjes integrals. 

In the second case, the content of (i), 8(8) , may be expressed, with a suit-. 
able choice of notation, as follows: If © is the torus 0 <¢,1,0< ph S1 
which is obtained from a Euclidean (¢:, ¢,)-plane by reduction mod)1, then 
there exists on © a continuous function of the position, say F= Ae 2)» 
in such a way that either P(t) — F(ut,t) or — 





(35) b(t) = ot + F(ot, t), 


where.» is an irrational number. i 

It will be sufficient to consider the case (35). Then, by Weyl’s corollary 
to Kronecker’s approximation theorem, the time average (33) may be expressed 
as the space average of exp ini + F(¢:, $2) over the torus @: Hence, (34) 
becomes 


ar 1 1 , ; 
(36) f exp inddo(0) — f f exp info: + F(x $s) }ddiden; 
5 i 8 
| | (n == 0,1,1 -). 
Now, on writing the Lebesgue double integral on the left of (36) as a Stieltjes 





ON THE ALMOST PERIODIC BEHAVIOR OF THE LUNAR NODE. 59 


simple integral, one sees from the uniqueness theorem of the trigonometric 
momentum problem that the angular asymptotic distribution function o(@) 
is the area of the set of those points (¢:,¢2) of © on which the function 
Pi + F(u $2), when reduced mod 2r so as to lie between 0 and 27, attains 
values which do not exceed 8. 


+ 


11. It is seen by comparison of the cases mentioned at the beginning of 
$ 10, that the description of the angular asymptotic distribution function of 
an almost periodic function exp ##(t) is a problem of Diophantine intricacy. 
This situation is strikingly illustrated by the following consideration (which, 
however, cannot be applied to the problem (35) at hand). 

Let 
(37) v(t) = 3S am cos (Amt — om) 


be any real almost periodic function, and œ any real number which is not a 
linear combination (with integral coefficients) of the frequencies Am of y(t). 
Then the angular asymptotic distribution of #(¢) — œt + y(t) is the equi- 
distribution; so that o(#) is the linear functions 9: 27, no matter what is 
the remainder term (37) of (t) — ot. 

In order to prove this, it is, in view of (34), sufficient to show that 


0 = M {exp ind (t)} for n==1,2,---, 


2r 


since = f etido == 0 for n = 1,2,---. But M{exp inot} — 0 for 


9 
n= 1,2, >; while Ÿ({) of + y(t), where y(t) is given by (37). 
Consequently, it is sufficient to show that | 
M{exp (inot) }M {exp (in > Am COB (Ant — Am) ) } 
— M {exp in (ot + 2 Gm COB Am (t — Gm) ) }. 


Now, the truth of the last relation may readily be verified, for every n, 
from the assumption that w is linearly mudependont of the Am. 
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REMARKS ON A CONJECTURE OF MINKOWSKI* 
By D. DERRY. 


Let 
L, (2) = 1t ferret GinEn 


Lig (2) = amta F` + + ann 


be n linear forms with rational coefficients and determinant +1. A well 
known conjecture of Minkowski states that if no integral valued solution 
Ti, Tes” * +, M exists, other than the solution in which all the æs are zero, 
for which | L(x)| <1,:-+,]L,(x)| < 1 then the forms after a possible 
unimodular transformation of the z’s and a rearrangement of order have the 
form 

L(x) = z 

La(2) = la, + 22 


Lin (2) a amt > > ++ En 


Mordell has shown? that the conjecture may be stated in another form. 
Let Lı (z), Ls(x),- - -,La(z) be linear forms with rational coefficients and 
unit determinant which satisfy 


Condition 1. For any set of integral values z1, 22, ` * ,æ,, other than the 
set in which all the æs are zero, at least one of the forms takes a non-zero 
integral value. 

By Mordell’s result the Conjecture assumes 


Form 1. At least one of the forms has integral coefficients with no 
common factor. 
Let p be a prime which ‘is henceforth fixed. We assume the forms also 


satisfy 


* Received April 7, 1939. 

1L., J. Mordell, “ Minkowski’s theorems and hypotheses on linear forms,” Oslo 
Congress 1936. Form 1, communicated to me verbally by Mr. Davenport, differs slightly 
from the form given by Mordell. Mordell states the hypothesis in terms of the reciprocal 
matrix of the forms and the conjecture itself in terms of the original forms. This form 
states both the conjecture and the hypothesis in terms of the reciprocal matrix. 
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Condition 2. The coefficients of the forms are rational numbers whose 
denominators are some power of the prime p. 

This note considers further equivalent forms of the Conjecture when 
Condition 2 is satisfied. These are stated in terms of finite Abelian groups 
and their normal series. 


THEOREM 1. Without restriction in’ generality the forms Lr (2) used in 
Form 1 may be taken to have the form 


L, (2) — pas 
L(x) — QT, + pT 


L, (2x) ni + pT 


where ri, Te, © `, Tn are integers with rm Ste S> tO S Ta, pop eoe p™ == 1 
and p'am integral for j,k Z s. 


Proof. The above special form is easily derived? from any system of 
forms with unit determinant satisfying Condition 2 by rearranging the order 
of the forms and subjecting the «’s to unimodular transformations. ' 


r -THEOREM 2. Li(x),L:(x),- - -,L,(x) are a set of forms with unit 
determinant satisfying Conditions 1 and 2 and with the form of Theorem 1. 
Then the values the forms assume for a set of integral values z3, T2; ` +, 2n 
are. either all multiples of p™ or at least one of the forms assumes an integral 
value which is not a multiple of p™. | 


Proof. For a set of integral values 2, 23,: ` + ,æ, let Ly, La , Ln be 
the values taken by the forms L, (s), La(s), > +, Ln(£) respectively. Let 
Lx be the value with the least subscript k which is a non-zero multiple of pr. 
If no such value exists either all the forms vanish, which occurs if and only if 
Ti = 0, 2 = 0, - +, En = 0, or by Condition 1 at least one of the forms must 
take a non-zero integral value which we have assumed not to be a multiple 
of p™; thus if no such value Ly exists the truth of the theorem must be 
admitted. If we replace æ by Te— Lep" the forms Li(x), Lals), =>; 
Lx- (x) retain their original values while Zx(x) takes the value Zero. The 
remaining forms Lrs (T), Luoe(x),- - +, L,(x) take values which differ from 
the original values by multiples of pr for by Theorem 1 p ax is integral for 
j =k and we are assuming p'»*| Lx. By repeating this process we ultimately 
replace Tı, %2,: * *,%n by a new set of integral values which give the forms 





2 B. L, van der Waerden, Afoderne Algebra, § 106. 
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values which we may call L’;, L'a: © +, L'a. Each of this latter set of values 
differs by a multiple of p» from the corresponding ‘value of the set La, Lay > ©, Lu. 
None of the values Z’,, L'a, - +, L'n can be a non-zero multiple of prs. In 
case all these values are zero we deduce Jy, L:,- + +, En are all integral multi- 
ples of p™. If on the other hand one of L’,, L’2,- - +, L’, is different from 
zero then by Condition 1 at least one value say L’, is a non-zero integer and 
furthermore an integer which is not a multiple of p™. Consequently Lr is an 
integer which is not a multiple of pr. The theorem is then completely 
established. 


Form 2. 11,12," + +,» are integers not all- of which are zero for which 
ETES". Sry and ri+fe+: Hi —0. § is an Abelian group 
of rank n with type (ppt. - - ym); & a subgroup of & of type 
(prp. e e prta); Ba, Bay >, 84 a system of cyclic subgroups of % 
which together generate %. For every subgroup À of § containing § with Y/N 
cyclic it is known that a subgroup 8; exists with A = 8, Y  8,. 

Then a subgroup 8; exists with § = 8,7, § 30". 


Proof of equivalence to Conjecture. We shall first show how Form 1 of 
the Conjecture may be stated in terms of finite Abelian p-groups. Let 
I, (x), Lift), + +,Ln(%) be a set of linear forms satisfying Conditions 1 
and 2 and having the form of Theorem 1. We shall further assume that 
1, < 0. For otherwise from the conditions stated in Theorem 1 regarding the 
integers 71,1%," * *,1, we deduce that the coefficients of the forms are all 
integral in which case there is nothing to prove. ‘This last assumption im- 
plies n > 1. | | 

As %1,%,° ` *,@n each independent of the other take all integral values 
modulo prit" let @ be the group of vectors p'i(ti,%2,° - `, En); $ the sub- 
group of all vectors (L(x), Liz), * + +, En(æ)); Br, Br, 3r” the subgroups 
of vectors | 


pts,” +, Tras Q, Srey * *, En), 
DT, t, Cr- PO Lye, Trans è En), Lr a. 
p(t, ss, Tri ptr, Brie te Tu) . 


As the forms have the form of Theorem 1, § has type (prit para. ++ prrntre), 
From Theorem 2 follows that for every non-zero element £ of $ groups 8r, 3,” 
exist with A e8,”, 4 \ 8, and conversely if the groups have this latter property 
the forms satisfy the condition of Theorem 2, which implies Condition 1. If 
one of the forms L,(æ) has integral coefficients $ S 8,” and conversely the 
existence of such a group 8,” implies that the corresponding form has integral 
coefficients. If the integral coefficients of L,(x) have no common factor 
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$ & B. Again the converse is true for the relation § £ 8,’ implies that 
the integral coefficients of L(x) are not all divisible by p; because of the unit 
determinant and of the denominators of the forms none of them may, have any 
_ other common factor. 

From a system of linear forms we bays constructed an equivalent system 
of Abelian groups in terms of which the Conjecture was restated. It is possible 
to start with an abstract system of vector groups &, Š, D 8,,8., 1<r<n, 
with all the above relationships including the fact that § have type 
(prtsttap ratte Pere pom) where Ti + Te + whe + Ta = 0 and Ti z< Te < LT <= Tn 
and by considering an automorphism of @ on § to construct a system of 
linear forms with unit determinant satisfying Conditions 1 and 2. , In other 
words to every set of groups satisfying the conditions of the above group form 
of the Conjecture a set of-linear forms exists satisfying the original condi- 
` tions of the Conjecture. Thus the complete equivalence of the two: forms is 
established. | 

Let 3 be the multiplicatively written character group of the | group G. 
To every subgroup 8 of Œ we order the subgroup 8 of & of all characters x 
for which x(A) —1 for A eÑ. It is a classical result of Weber that o=@ 
and that 8 determines 8 while %/B = 8. Accordingly if $ be the ‘character 
subgroup associated with § it will have type (pipi. - -grire). The 
dual groups 8, of the system 8, form a system of cyclic groups 8;—= (Ar), 
1r Sn, which generate the group § while the dual groups of the system | 
$r, , Br’ are the cyclic groups (A,?"**), (Ae), lSrsn respectively. Now 
if A be an element of © the dual of the cyclic group (A) will be a subgroup 
N of % containing § for which §/% is cyclic. Conversely every subgroup 
A of % containing Q, for which F/Y is cyclic, is the dual of a cyclic subgroup 
(A) of $. Accordingly, translating the conditions of the above group form 
of the Conjecture into the character groups, we see for every such subgroup % 
a subgroup 8, exists with Y = 8,7», A Æ 8r The Conjecture itself becomes 
under similar translation: a subgroup 8, exists with 6 = 8,7», § # Bern, 
Thus the Conjecture in Form 1 is shown to be ARTE to Form 2 and the 
proof is complete. : 

ou For an Abelian group © of order p'” a series of subgroups 
G — Gy, G2," + +, O, Gus = (E) is said to form an r-series if Gs/Gers is 
cyclic and of iden p for 1=s<=n. 


Definition. A subgroup § of Œ is said to be reciprocally ad if: the 
factor group @/$ is cyclic. 


Form 3. B is an Abu group of order p™ with rank less ‘aia n. 
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€i, ©: + +, Ex are n cyclic subgroups of B. For every reciprocally cyclic sub- 
group D of B groups Ce, Es," > * , Ca, exist so that the factors of the series 


B— (Ce, Ce ° . s Css B),° a) (Cas D), D 


have order not greater than p". Then the groups @:, @2,- : :, €n after a 
possible rearrangement of order buitd an 1-series 


B = (Gy, Cr, + +, En), + +, (Gi, Ca); Ex (E) 
for the group &. 


Proof of equivalence to Conjecture. We first show the above would follow 
from a proof of the Conjecture expressed in Form 2. @,, G2, © -, €n together 
generate the group 8 for otherwise a group D containing Gi, G2,---,€n 
would exist for which 8/9 was cyclic and of order greater than 1. Then by 
hypothesis a group ©; would exist for which (@+, D): D would be greater. 
than 1, contradicting the fact that ©; S D. : 

Let 71, 12,° © ©, Ta be integers for which r, S fre S> > S ra and such that 
B has type (pp tats - ptr), Now ra =r because the rank of 8 is by 
hypothesis less than n. Furthermore the order of 8 being p™, ry + re +: : 
+ Ti 0. Now let, be a group of type (pts pv. - : pts) of rank n 
generated by elements Z:,22,: © -,Z24 If Ci, C2, *', COn be elements of B 
which generate the cyclic subgroups ©;, Ce, > - -, €n respectively, we define a 
homomorphism of %.on 8 by the correspondence Z, — C1, Ze —> Catt, Zn Cn. 
This is possible because Ci, C2,: © -,C, generate B and the order of each 
element of % is a multiple of the order of corresponding element of B. Let 
§ be the subgroup of % which is built into the unit element of B in the 
above homomorphism. As %/ = 8 we deduce from the type of & and 8 that 
$ has type (pipe. pat»), For a subgroup A of % containing § we 
have by the second ‘isomorphism theorem F/A = $/G/U/H. Hence if M is 
reciprocally cyclic, N is built by the homomorphism into a reciprocally cyclic 
subgroup © of B. Now by the hypothesis, as +, = +, a subgroup ©, exists with 
1< ((@,,D):D) S p from which we can conclude AZ (Z), A È (Za). 
Thus % and its subgroups (Z+), 1 Sr n, satisfy all the conditions of Form 2. 
Therefore if the Conjecture. be true a number s, exists with § = (Z,,°""), 
GB (Zar). This implies ©,,9 == (E), Cap 54 (E) i. e. Gs, has exact 
order p’, But as r, = 7, Gy, has order p". 

In the factor group B/@, let C1, - + +, Cor, Cons © +, Oh be the cyclic 
subgroups of restclasses defined by @,: - +, Cois Ger,’ © +, ©» respectively. 
By using the second isomorphism theorem, any reciprocally cyclic subgroup © 
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` 


`of B/ Œa, may be shown to have the form D/G,, where D is a a reciprocbliy cyclic ‘ 
subgroup of P. By hypothesis a series 


(Cip: g 3 Ets D), * +, (Gis D), D 


exists from B to D whose factors have order not greater than pr. DZ = Ci 
it follows from the second isomorphism theorem that the factors of this series 
are isomorphic to the factors of the series | 

| 


B/C, = (Cir Om, WD), °°; woes 


Therefore the factors of this series have order not. greater than pt. | We have 

proved the order of Œ., is p”. Hence the order of the factor group B/6,, is 
p”, We have thus proved that the factor group B/G., and its associated 
subgroups @’,,° + +, Cass Cou * * , Cn satisfy all the conditions of Form 3 
«with n replaced + n—1. We id therefore deduce from the truth of the 
Conjecture exactly as above that at least one element @’,, has order |p" which 
means (Cas Csa) : Ce, = p". | Un 
By repeating this process we deduce after n steps that Œi, G- - -, CA 
after a possible rearrangement of order build an r-series 


(G,;° i * Ca): “"; (Ci Gs), C1, (E) 


` for 8. This shows that the problem stated in Form 3 is a consomen of the 
Conjecture as stated in Form 2. 

To show the converse we need only sn the factor group ÿ/$ and 
its associated cyclic subgroups of restclasses @:, €a - -, €, defined by 
81, Ba + +; Ba respectively. #/$ has order pr". For any reciprocally cyclic 
D of S/$ let D be the subgroup of & of all elements which are built into - 
D by the homomorphism 3} ~%/G. D by the second isomorphism theorem 
is reciprocally cyclic, hence by the hypothesis of Form 2 a subgroup Bs exists 
with Y = 847", DV £ Br. Therefore (D, €:,)/D has order greater than 1 
but not greater than p™. Now the subgroup (D, @:,) is also reciprocally cyclic ` 
and so proceeding exactly as before we could find a subgroup Œs, with 
(Cis Cry D)/ (Ern D) of order greater than 1 but not greater than pr. Con- 
tinuing in this manner in a finite number of steps we could construct|a series 


3/9 = (Cis: 7 "Gr D),° 3 (C D), D 


with cyclic factors of order not greater p™. Thus &/$ and its subgroups 
Ga, Œa: - +, En are seen to satisfy all the conditions of Form 3 with r replaced 
by fa. Therefore if Form 3 of the Conjecture were true a subgroup €, would 
exist with exact order p* which would imply: $ = BoP", $ Æ B7 which is 
Form 2 of the Conjecture. Therefore Forms 2 and 3 are completely equivalent, 
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REMARKS ON MULTIGROUPS.* 


By J. E. EATON and OYSTRIN ORE. 


The present paper may be comsidered as a supplement to a recent paper 
on multigroups by Dresher and Ore.t It contains various contributions which 
lead to simplifications and improvements in certain parts of the previous 
theory of multigroups. The notations and terminology are the same and. need 
therefore not be explained here. 


1. Existence of cross-cut. In the following let Mt denote a multigroup 
and let X and B be submultigroups. The theory of submultigroups differs 
from ordinary group theory in that the cross-cut (W, 8) may be void. On the 
other hand certain theorems in the theory of normal submultigroups require 
that the cross-cut of particular submultigroups shall not be void; hence this 
must be stated as a separate condition, or it can be fulfilled by assuming that 
‘the multigroup contains units. We shall show however that in some of the 
most important cases these conditions are not necessary because the existence 
of a cross-cut follows from: 


THEOREM 1. Let U be a left reversible and B a left closed submultigroup 
of M. Then the cross-cut (U,B) is not void. 


Proof. Let a and b.be elements in & and # respectively and let us 
determine m such that | 
am Db. 


By the reversibility of & follows m C a,b or ` 
| Eu amb D ab Db 


and here a; must belong to 8 since $ is left closed. 
From Theorem 1 follows further: 


THEOREM 2. Leb A be normal and left reversible while B is left closed 
in M. Then every element in the union [A, B] is contained in a product ab. 


Proof. It is obvious from the definition of normality that any element 
not in Y or B must be contained in such a product and for the elements in 


* Received April 25, 1939. i 

+ Melvin Dresher and Oystein Ore, “ Theory of front American Journal of 
Mathematics, vol. 60 (1938), pp. 705-738. We shall quote this paper in the following 
as D. and O. 
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pi and 3 it follows from the existence of an element d belonging both to 
“W and 8. 

On thé basis of these two theorems one obtains the main properties of 
normal submultigroups: ? | 


Let A and B be normal reversible submultigroups. Then the union 
[M, B] is also normal and reversible and the cross-cut (A, A is normal and 
reversible in X and in $. 
… If Y ts normal and reversible and 5 closed in the union [X, H] then 
(X, B) ts normal and reversible in B and there exists an isomorphism | between 
the quotient systems. 
[1,8/4 = B/ (A, B). 


2. Homomorphisms. A multigroup S is said to be Tonene iie to 
another M* when there exists a e a m—>m* between their ele- 
ments such that 

abc 
implies . 
; a*b* D c*, 
Furthermore evéry element of M* shall be the image of some element of M. 
One proves (D. and O. Theorem 12, Chapter 2) that if M* contains a 
left scalar unit e* then all elements Y of Mt corresponding to e* form a right 
multigroup which is right closed and if e* is an absolute unit element! of m* 
then N is a closed submultigroup. 
In order to derive further properties of the homomorphism it is necessary 

to make assumptions on the inverse correspondence from M* to M. 
We shall say that Ÿ is left properly homomorphic to M* when: 
1. M* contains.a left scalar unit 6*. 

We denote by N the right closéd right multigroup consisting of the 


elements in Ÿ corresponding to e*. | 


2 If ms =m", then there exist elements a, and a, in Y such that 
am, D ma, -dig D M. 


This condition shows that Y is left reversible. Hence there exists a coset 
expansion of W? with respect to A (D. and 0. Theorem 9, ae 2) and = 
is easily shown: 


THEOREM 8. If a multigroup DE is left properly homomorphtc to another 
M* then M* is isomorphic to a quotient multigroup 


° These are somewhat simplified statements of the results in D. and O., chap. 3, § 2. 
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ME se M/A 
where U is a left reversible right submultigroup of M. 


In the paper by Dresher and Ore also the following type of homomorphism 
has been introduced: A multigroup W is strongly (left) homomorphic to M* 
when any relation : 
a*b* D o* 


implies that to any b and co corresponding to b* and c* respectively there 
exists some & corresponding to a* such that 


‘aby D Co. 
One can then show: 


THEOREM 4. Let Mt be (left and right) properly homomorphic to M*. 
Then W is also strongly homomorphic to M*. 


Proof. When W is both left and right properly homomorphic to M? 
there must exist an absolute unit e* in WM” and those elements in Nt which 
correspond to e* form a reversible submultigroup W. But in this case A must 
also be normal since all m with the same image m* must be contained both in 
a coset mA and a coset Um. From Theorem 3 it follows that 9* is isomorphic 
to the quotient multigroup M/A. To show that Wè is strongly homomorphic 
to M/A let us assume that a relation 


(1) mW: mW D mA 


holds for three cosets. Any m corresponding to m, may be taken as the 
multiplier of this coset and similarly for mM. Hence we shall only have to 
show that (1) implies the existence of some element + in mA such that 


NT D Ms 


and this follows from the normality of 2. 
This also implies D. and O. Theorem 1, Chapter 3. 


3. Strong normality.’ An important concept in the theory of multi. 
groups is that of strong normality. This concept has been defined (D. and O. 
$4; Chapter 3) under the assumption that right and left units and hence 
inverses exist in the multigroup WM. We shall show here that one can also 
give alternative definitions in which these assumptions are not necessary. 


DEFINITION. A closed submultigroup A of Nt is strongly normal if for 


any m, there exists an m’, such that 


3 This theorem is a corrected form of Theorem 13, chap. 2 in D. and O. In all state- 
ments on p. 721 strong homomorphism should be replaced by proper homomorphism. 
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(2) LD m Am, 

and to every Ma an M'a such that 

(3) i A D ma Ame., 

Let us derive some consequences of thig definition. We determine m,” 

such that 

(4) ; J5 AD m? Am,” 

and from (2) one obtains 

(5) m Am’ Am,” — Am,” — mA. 
- When this is substituted in (4) one finds further 

f YD m’, mA 

and since À is closed 
f mim CM. 

Similarly one has | 

Mae (ee N. 


We show next: i 
A strongly normal submultigroup is reversible. 
If namely | 
am, D m 
_then one can determine some z such that 
ZM D m 
or nn ; 4 3 | 
` ŒaMe D aM, D Ma. 


When this relation is multiplied by m: one finds a relation of the form - 


Zla > Oy 
showing that z belongs to W since W is closed. 
À strongly normal submultigroup ts normal. 


From the reversibility it follows that right and left coset expansions of Mt 


. with respect to Y exist and since each ‘coset contains its ae we 
from (5) ` 
Um, — ma. 


obtain 


Let us finally consider the quotient multigroup M/M: From the con- 
dition of strong normality it follows that for any m, and m, one can find an ms 


such that 
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Wms D Am Mma. 


This indicates however that the product of two cosets guu only a i | 
coset; hence the multigroup M/N is an ordinary group. 


Tusorem 5. The necessary and suficient condition that a mulitgroup | 
M be homomorphic to a group © is that M contain a strongly normal sub- 
multigroup A such that 
© = M/W. 


We have already proved the sufficiency of this condition and the necessity 
follows by the same argument as in the proof of Theorem 12, Chapter 3 in 
D.'and O. 
~ Let us prove finally: 


THEOREM 6. The strongly normal submuliigroups form a Dedekind 
structure. 


Proof. Since the Dedekind Ratio holds for normal submultigroups 
(D. and O. Theorem 5, Chapter 3) we shall only have to show that the 
submultigroups form a structure. 
The union of two strongly normal submultigroups W and % is closed 
(D. and O. Theorem 6, Chapter 2) and [M,8] — NV (Theorem 2). To any 
`m let m’ be determined such that mm’ C A. Then _ 


MABm = mm AH — WB. 


The cross-cut D =- (N, B) is also closed (D. and O. Theorem 4, Chapter 
2). Let m and m be arbitrary. Then 


| As D mDm’, | Br D MDM 


for any s in mm’. If m is determined such that mm’ d, where D contains 
d, then 
XD mDm/, BD mDm 
and hence a 
D D mw. 

Theorem 6. again implies the existence of a unique minimal strongly 
normal submultigroup À, such that M/M, is a group. 

To conclude let us remark that a submultigroup Ÿ can A be said to be 


strongly normal if it is left reversible and the relation (2) holds. This 
definition can be shown to be equivalent to the preceding. | 
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ON THE IMBEDDING OF ONE SEMI-GROUP IN ‘ANOTHER, WITH 
APPLICATION TO SEMI-RINGS. » 


By H. S. VANDIVER. 


In other papers * a semi-group was defined as a set of elements closed 
ander an associative operation and for which the equivalence and the substi- 
tution postulates hold. In the present paper we shall employ instead of the 
substitution. postulate, the postulate that if A= B, then.CA—OB and | 
AC = BC for any 4, B or C in the set, which we shall call the composition 
postulate.? 

A gruppoid * * is a semi-group with an à identity element, that is, an # such 
‘that, AR = EA = À for any À ‘in the set. À quasi-group is a semi-group 
. such that from either of the relations | i 


and ooo AB = AC 
i | ‘BD— CD. 
we infer | ; i -B = oO 


where each letter denotes an element of the semi-group. Cancellable elements 
in a semi-group S are elements C such that if CM == CN then M =N or if 
HC —KC then H = K, where each letter denotes an element of 9. It is 
easy to see that the product of two cancellable elements in § is also canvellable $ 
hence these elements form a sub-set of S which is a quasi-group. Isomorph- 
. ism between two semi-groups is defined in the same way as for the isomorphism 
between two groups, and the central of S will be the set of elements in S 
which are permutable with each element of S, as in group-theory: ‘|A semi- 
group § will be said to be imbedded (or immersed) in another semi-group 
& if 8’ contains a sub-semi-group which is isomorphic to 9. 

Graves‘ in a recent paper showed how to immerse a commutative quasi: | 


` * Received July 3, 1939. 
1 Vandiver, Proceedings of the National Acudesiy of Sciences, vol. 20 (1934), p. 579; 

` Bulletin of the American Mathematical’ ‘Sooiety; vol. 10 (1934), p. 916; America Mathe: — 
matical Monthly, vol. 46 (1839), p: 24. i: -> 

* The relations between -these two, sets of enira and: to other sets of ppstulates 

for a semi-group I hope to discuss elsewhere. . . 
2 Here we follow the terminology used by Specht and Garrett Birkhoff. | Cf. the 
latter, Annals of Mathematics, vol. 35 (1934), p. 351 and note references there given. 
t American Mathematical Monthly, vol. 45 (1938), pp. 664-68. Graves -ealls the 
system I have called a quaëi-group a semi-group! | 
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group in a group. In the present paper we prove a generalization (Th. 1) 
of this result and apply it to the theory of semi-rings. 


THEOREM 1. If a semi-group K contains a cancellable element, and all 
the cancellable elements of S belong to its central, then S may be imbedded 
in a gruppoid S’ whose cancellable elements form an Abelian group G, and 
the identity element of G is the identity element of 9’. 


For proof, let the distinct elements of S be denoted by 
ta, Qa > 
and in particular the distinct elements of this set which are cancellable by 
C Cg 


since the latter, by hypothesis is not a null-set. 
Then consider the set S’ formed by the pairs 


(Gi, cs) 

and write 
(3) (ai, Ca) (ax, Ce) = (idx, Cece). 
Also we shall agree that 
(4) >. (Gn, Gr) = (ai, Ce) 
if and only if 
r Co == Q1Cr. 
By (3) the closure law holds for the pairs since the closure holds for the a’s, 
and also for the c’s. We now examine the equivalence postulates. 

(a, Cs) Lu (tis Ce) 


obviously holds from (4). Symmetry obviously holds since it holds for the 
a’s, As for transitivity, if 


(as, Ce) = (as, ct) 


and 

(as, ct) a (4x, Cr) 
then 
(3) GyCt == jC, Ayer = ACE 


whence, since composition holds in 8, 
GiCeCr == Ajl alr 


and if the c’s belong to the central of 8 then 
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Bille =m QjCrCs 

or from (5), using composition and transitivity in 8, 
| Cret = AjCrl, = dxCtCs — OKCE E 
and since c+ is a cancellable element in § then 


` Mtr Akts . 
‘and by (4) 
(aus Cs) = (ae, Cr). 


Transitivity then holds. We also have that if. 


(6) en (a, Cs) x (ax, Cr) 
then 
(7) (ai, Ca) (ay, Ct) == (Gx, Cr) (as, ct), 
for we have from (6) 
4 QiCr = AUCs 
and 2 
AqCrh jt = Ugh jCt 
or i 


(ilj, CaCt) = (axaÿ, Cree) ; 


whence we obtain (7), using (8) and (4). Hence composition holds since 


' (7) holds for the reverse order of the factors. It is easy to verify 
associative law holds for the pairs, hence, with the above conclusions, 


that the 
we have 


proved, that 8’ is a semi-group.. We shall now show that S’ is a gruppoid. 
Since 8 by hypothesis contains a cancellable element, say c, we note that 


(8) ; (u, Cs) (c, c) _ (ae, Cac) 
and also 
ACC == Ayla = UC, 
. whence 
(ac, Ceo) = (Gi, Ca) 
and (8) gives 


| (Gi, Ca) (6, 6) = (a4, cs). 
We find similarly that 
(c, c) (a, Cs) a (a, Ca) 


so that (c, c) is an identity element of S’, and S” is then a gruppoid. 
We shall now show that all elements of the form 
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(ci, ct) 
are cancellable in &’. For suppose that 
(6,64) (au 0r) = (csc) (ap 0) ; 
then . 
(Calli, Ctr) == (cas, CtCe) 
‘by (3), and (4) gives 
Celi Ct Cy = CeljCiCr 


and since the c’s belong to the central of S and are also cancellable in $ we have 
GC = ayCr, | 
or | 
(a, cr) — (45, Cy). 


We obtain a similar result for the other order of the factors. 
Hence (ca, c+) is a cancellable element in S’. 
Tt will now be proved that any element of the form 


(a, Ce) 
is non-cancellable in & if a; is non-cancéllable in S. For in that case there: 
exist elements a; and a with a; 7 dy and 
( 9) ajas = lihi, 
or there exist elements a;, and ar, with dy, £ Qj, and such that 
(10) | 010), = 1dr, | 


Now if (9) holds we have 
(4504, CC) — (Axli, Ce) 
or 
(ai, c) (a, =e = (ax, c) (ace) 
with 
(arc) Æ (mc). 

We obtain the same result after treating (10) in a similar way. 

Hence all the cancellable elements of 8 are of the form (cs, ct). These 
form a group @ in & since for given elements (cs, ci) and (¢a,c1) we may 


verify that 


_ (Cas Ct) (Cate, Cats) = (on, C4) 


‘and similarly for the other order of the factors on the left, and the result 
follows if we note the product of two cancellable elements in S is a cancellable 
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element in S.- Also the group is Abelian since the c’s are commutative in 


8. We now show that 
' Gy dagt 
is ‘isomorphic with A ‘ 
(asc, c), (a26, pre te 


To show this we note first that the elements of the last set are distinet since 


the a’s are, for 
(ac, c) = (aye, c) 


gives 
i i uc? = ajc?, 
or i 
di = Qj. 
Now if A 
| (arc, c) <> an; h = 1, 2, 3,-- 
and 
| (asc, 6) (asc, ¢) == (axe, c) 
then 


ilj = a% 


and conversely ; hence the sets are isomorphic. ` Also the identity el 


ement of G 


is (c, c), so that the identity element of G is the identity an of 8 and 


our theorem is proved. 


We note the peculiarity that in order to prove the transitivity law in 8’ 
~ we make use of the fact that the c’s are cancellable in S; We may note that 
if we use a non-cancellable element n selected from the a’s in connection with 


the pairs of the type 
5 (a, n) 


then it'is possible to select a particular set such that the law of 


transitivity 


does not hold within it. For let $ contain an annulator, say k, then if we use 


the definition of equality as in (3) we have 
(k, k) = (ai, cy) 


for any ¢ and ĵ, ‘but transitivity does not hold since (ai, cy) (as, 4) forts 


and c; a cancellable element. Since the pairs obviously obey law 


similar to 


those which fractions follow under multiplication in- ordinary arithmetic, we 
see that the above situation is similar to what we have in arithmetic when 


we attempt to employ the fraction 0/0. 
Application to semi-rings. Following closely soie paper 


5 Proceedings of the National Academy of Sciences, vol. 21 (1935), p. 


5 we define 


162. 
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a semi-ring as a system of elements which form a semi-group under addition, 
a semi-group under multiplication and the right and left distributive laws 
hold. Let S now form a semi-ring and employ the notation 


Ke 
c 
for (a,c) and define addition of these symbols by means of 


(11) ny Eten 


and call 
ay Ba =. UEU 
C1 ` Ca C1C2 ” 


multiplication. It is easily seen that addition is associative since 8 is a semi- 
ring. The distributive law holds since 


Ca Cs G ` Cacs 


ay (2 a) das + Uso 
A 


5 (4142) (C163) + (4145) (C162) 


(C102) (¢1¢s) ? 


since c is a cancellable element under multiplication in S. But the last 
expression on the right equals 
C1Ce C1Cs ” 
and similarly we find 
da 2) My _ 2h | Got 
C2 Cs / Ci CoC C361 : 
It is easily seen then, that S’, consisting of the elements a/c, is a RUE 


“Hence we have the 


Taxorex 2. A semi-ring R whose multiplicative semi-group S contains a 
cancellable element, and the cancellable elements of S belong to the central 
of S, may be imbedded in a semi-ring R’ whose multiplicative semi-group ts a 
gruppoid. The cancellable elements of this gruppoid form a group whose 
identily element ts the identity of the gruppotd. | 


We shall call R’ the quotient semi-ring of R. 
Suppose now that R is a ring, then since its additive semi-group is an 
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‘Abelian group using (11) we see that R’ forms an additive Abelian semi- 
group since À does and it is a group since 


C1 Ca À 


has the solution - i 


C1C2 ' 


‘and the word semi-rinig may be replaced by ring in the statement of Theorem 
2. Also we see that any non-cancellable element n of the multiplicative semi- 
group of a ring is zero or a zero divisor in the ring, since 


na = nb 
with a&b gives | | 
n(a—b)=0; 


and n is zero or a zero divisor since a— b40. A ring where division is 
always uniquely possible when the divisor is not zero or a zero divisor is 
called a quasi-field by the writer in another paper.® This term however is 
employed in a different sense by other writers.’ If R is a realm or domain 
of integrity then K’ is a field called the quotient field of R. This special case 
of Theorem 2 is me known. 


University or TEXAS. 


° Proceedings of the National Academy of Sciences, vol. 21 (1935), p. 162. 
"Cf, for example, Albert, Modern Higher Algebra, p. 23. 
# Van der mee Moderne Algebra, lat ed., Bd. 1, p. 7; Albert, loo. oit, , pp. 27-29. 


NOTE ON EULER NUMBER CRITERIA FOR THE FIRST CASE OF 
FERMATS LAST THEOREM.* 


By H. S. VANDIVER. 
For the solution of 
(1) "att ytta—0 


ryz 540 (mod 1); z, y and z rational integers, la given odd prime, we have 
the Kummer criteria 


t 


Beft-an(t) 2 0 (mod 2), 


2 : 
(2) fra(#) =0 (mod 1) 
where 
: (2a) —tesa/y, Y/2, y/2, 2/y, 1/2, 2/3, modulo l, 


(n = 1, 2,- my (1—3)/2), 
fa(w) = =S iwi, 


Further, the B’s are the Bernoulli numbers, B, = 1/6, Bz = 1/80, ete. Now 
all the known criteria which have been derived from (2) for the solution of (1) 
and which are independent of z, y, and z may be shown to have a certain 
` relation to each other. All may be derived from a set of criteria of the form : 
| + [ri/m] .- 3 
(3) O(m, i,r) == 5 st = 0 (mod 1) 
s a=[lr-1) l/m] 
for certain small values of m and where i =—= 1 — 2, 1— 3, 1— 4, I — 6, i— 8, 
1— 10, 1—12; with r in the set 1,2,: - -,m— 1, and criteria consisting 
of linear functions of the type (3) with rational coefficients. 
It is known that + 


(4) bza = S (n— 2i + 1)0 (n, 2a—1, 1) 


modulo l, for 1>8;n5£0 (mod 1) ; P] is the greatest integer in h, and the 
b’s are defined by 


OHI n>1. 
where the left-hand member is expanded by the binomial theorem and by 
substituted for bt. As is known (—1)**Ba=bes. For n=1—1, (4) gives . 
I— nH 


[n/2] s . 
ba TDR = > GARROS REX 


* Received July 3, .1939. : 
1 Vandiver, Duke Mathematical Journal, vol. 3 (1937), p. 572, relation 10. 


79 


80 H. 8. VANDIVER. 


modulo À, and using 


— Lee 4+24- eet (7—1) 1 == bıl (mod 1) 
we have | 
j 1 — ni [n/2] 


(5) a= À Go 1—2, i) | 


modulo l. Frobenius? found criteria of the type (8) for i=l — 2, and various 
values of m = 26. Morishima,’ showed that if (1) is satisfied in Case 1 then 


m= =] (mod l) 


for each m such that 0 < m = 81. apres (5) we din ils of the 


_ type mentioned above. 


\ 


It has been shown, using (2), that for on = 1—3, L—5, L-7, 1— 9, 


1— 11, 1-18, we have the criteria B,==0 (mod l) in Case 1. Using (4) 
we obtain more criteria of the type mentioned in connection with (3). Also, 
employing various formulas due to the writer (1. c., 572-4) we obtain a number 


of congruences each involving only one of the C(m,1,r). 


‘In another paper of the writer’s“ it was shown that if (1) js satisfied 


in Case 1 then 


[1/3] | 
(6) Hu > ri = 0 (mod 1), 


r=1 


and it was shown by Schwindt® that this yields the relation 


: ' [2/8] 
(6a) > es. God 1). 


Te 


From the writer’s article last cited (p. 91, and see also last patagraph in 


article) we also have.the criteria 
(7) -I Fato) Fateh = 0 (haat) 


where:, is an m-th root of unity and ¥ indicates summation, over De distinct 


” values £1, of p;(m,l) —1. Further 
F,(w) =w + Qu +: + (ml—1) 0", 
The value m = 3 gives (6). Set m = 4, then we note that 


3 Berlin Siteungsberichte (1914), pp. 653-81. Cf. also Emma Lehmer, 


Mathematics (vol. 39 (1938), pp. 358-9. 
8 Japanese Journal of Mathematics, vol VIII (1931), pp. 169-173. 
i Annals of Mathematios, vol. 26 (1024), pp. 88-94. 


s Jahresberichte Deutscher Mathematische Verein, vol. 43 (1933-4), pp: 


Annals of 


229-31. 
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sii Sipe pap an, 
ME À Ae E pe ne, 
417 ia 
ee eens +++ (11) wt (mod 1) 


where we regard a fraction of the form 61, where the denominator is a poly- 
nomial in w not all of whose coefficients are divisible by 7, as = 0 (mod l). 
Set w—t/p; we find since p* == 1, {'==t (mod J), 





— (We) py = P(t); — (t/p) ls re (t — 1) P(t) 
modulo l, and 

Z(t) D eal) os (HDA jo) Lee) hale) 
modulo. Now : 


= SE — (t/p— p) (t/p — P) (t/p —p°) 


(t—1) (tp?) (t—p), 
= F 5 
so that (8) becomes, using (7), and noting that t£ 0 (mod?) and t— 1560 
(mod 4), | 
© p(t? — (p+ ot) p) Liste} 0 (modi) 
‘or since eee + p+1—0, we have 


(9) Dol + (P +1) + p)? BAE) =o (mod). 
This congruence is of degree four and since it is satisfied by all the values (2a) : 
then either #— t -+ 1==0 (mod!) or t——1,2 and 1/2. The first con- 


gruence è is inconsistent with (2) and #——1 satisfies (9) identically, but 
t = 2 and 1/2 give in turn’ | 


3 (det + dpt + p) E) 0 (mod 2), 
St t+ ap) Helo cost, 
and subtraction gives 


(9a) + EG" —p) , ale) = 0 (mod 1). 





e Pollaczek, Wiener Bericht, vol. 126 (1917), pp. 1-15. 
6 . $ 
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Now we have? i i 
(46 +3) = — bre 

l — 2 
“where (mb + k)” is expanded by the PRE on and b4 substituted for 
bt in the result. Also 


=(—1)" epale) (mod l), 








t2 _ 
CL 
and subtraction of the last two congruences gives | 
(4b +3) — (4b +1) So (P — e)l | 
(10) 1— 2 = pt ee S 


The left-hand member® is (—1)(4#-3)/2#43,,, where E, = 1, E, == 5, 
E, = 61, etc. are the Euler numbers, and (10) gives with (9a), 


Gay. S E (mod). 
Emma Lehmer?’ gave the relation 


(2/41 1 4] ; L 
et as = (—1) G-/24F ya i 
ra 7. | 


modulo J, and remarked that if we could show the left-hand member ==0 (mod 2), 
provided (1) holds in Case 1, that (11) follows. Here we may reverse this 
process, as using (11) we have | Ho 


[1/4] 
= r= =0 (mod?) l 


as criteria for (1) in Case 1, and this is evidently also included in me class 
of relations (3). Hence we have, using (6a) also, 


THEOREM. If 
at yt + zt 0 


with z, y and z rational integers and tyz 5&0 (mod l); l a given odd prime, 
then 


FE a-ayj2==0 (mod 1) | 


there E, = 1, E: = 5, Es = 61,- : -, are the Euler numbers. Also 


[3/4] 1 | 
— =0 (mod 1). 
ETUDES 


UNIVERSITY oF TEXAS. 


x à i 

T Frobenius, Sitzungsberichte, Berlin (1914), p. 655, formula (2) for n =1—2. 
8 Cf. for example, Frobenius, Stteungsberichte, Berlin (1914), p. 848. ! 
° Loo. oit., p. 359. 


t 





ON EXPANSIONS IN SERIES OF EXPONENTIAL FUNCTIONS.* 


By Marvin G. Moone. 


Introduction. Carmichael has expanded functions of exponential type 
in a series of exponential functions (3) associated with the exponential sum 
h(t) in (1). He has pointed out that, for the special case h(t) = ef — 1, 
(3) becomes the Fourier expansion, the natural polygonal region of con- 
vergence reducing to the line-segment (0,1). We are led, then, to investi- 
gate the possibility of generalizing the properties of biorthogonality and the 
convergence theory of the Fourier series to expansions associated with the 
more general functions h(t). 


I. PRELIMINARY CONSIDERATIONS. 
Let 
(1) h(t) = cent + ce? + - + - +4 ouest, 


where cs 40 and a; 54 a for 74k, and where N = 2. 

Let P be the smallest closed convex polygon in the complex plane con- 
taining the points a, a,---, ay. In special cases, this polygon may reduce 
to a line-segment. Then Carmichael * has demonstrated the existence of con- 
tours Ci, Ca, > > about the origin having the following properties: first, 
there exists a positive e for which 


(2) [Aiet] De 


for every z in P and for every ¢ on every C4; second, if the sectors Sy are 

defined to be those regions in which F (aut) > R(axt) for all Ap (R being 

the real part), C, lies along the circle having radius s and center at the origin, 

except for portions of bounded length lying within a bounded distance of the 

rays which separate the sectors Sy; and third, no point of C, lies outside 
Ste 


The series with which we are concerned are to be of the form 


(3) Š S Palzem, 


=l ki 


* Presented to the Society, Dec. 30, 1937. Received June 19, 1938; Revised July 1, 
1939. 
1R. D. Carmichael, Transactions of the American Mathematical Society, vol. 35, 
No. 1 (1933), pp. 1-28. 
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where the degree of the polynomial Px,(x) is at least one less than the order 
of the zero tx, of A(t), and where tis, dos,” * `, to, are those zeroes of h(t) 
lying between C:4. and Ca. 


II. PROPERTIES OF BIORTHOGONALITY. 


Turores 1. Let w(k,s) be the order of the zero trs of h(t). Let 
Crs be a small circle passing through no zero of h(t) and containing only 
the zero tra on its interior. Then, for q —0, 1, 2,---, w(k,s) — 1, and 
for a any point of the closed region P, | 


1 > a f etoute-æi)t dtd 
== č gigt ——_ Ti 
Pet Í i Cm h(E) 


(4) = 0 for tees oe tim i 
— letra for tks = tjm. 


Upon integration with respect to zı, the left member of (4) reduces to 


the form 
at*q | | g(ias-Haret 
me EENEN A T. b 
z na re @—)!—ye 


which, by Cauchy’s Integral Theorem, vanishes if trs 5& tym. 
Tf tze = tym, it reduces to 


> (x —a)*at’ o) gti; 


where (‘) is the binomial coefficient, and the expression, by the binomial 


theorem, equals 
zigtue, 


We have, then, conditions generalizing the biorthogonality conditions 
pertaining to the Fourier series, here arising when A(t) —eî—1, our 
results in that case taking the form, after the contour integrals are evalu- 
ated: If a is any fixed point of the do (0,1), then for m and i integers, 
positive, negative, or zero, : ] 


1 0 
gtlrt (att) f gilm-l rings, — etats ettm-Dringz, 


i § =0 for ml 
| —emria for m =l. 


If a== 0, this reduces essentially to the customary form of the statment 
of the Fourier biorthogonality conditions. 
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Theorem 1, viewed in the light of the theory of eee: functions, 
suggests the examination of the series (see (3)) 


6) È PE Sa f KORN elonte-a)t (h(t) }dida,, 


which we shall call the F-series. Series (5) is of the form (3). 


Ill. LEMMAS ON CONTOUR INTEGRALS. 


We shall find it convenient to define Py as the region P exclusive of the 
portions | £ — a, | < 7 about the vertices. 


LEMMA 1. For every positive n, there exists a positive K for which 


if, etth(t) dt | < K 
for s=-1,2,---, and for x in Py. 


Since, by (2), the integrand is dominated in absolute value by et at 
every point ¢ of C., for all s, and for all z in P, it follows that the portion 
of C, which does not lie on the circle of radius s, having for its length a. 
bounded function of s, contributes a bounded quantity to the value of the 
above integral. 

To show that a bounded quantity is also contributed by each of the 
circular arcs of C, which remain; for any sector Sy, let c—a,—ret¥ 
t = pe'?; where y, œ are real and r, p are positive. It follows almost imme- 
diately from the definition of Sx that, for s in P and for ¢ in #8, 
R(at) = R(at), so that 

cos(y + $) SO. 


On the are of C, in Sy, as well as at all other points of C, and for all s, 
eM {h(t)} | < e+, so that we shall, along this arc, dominate the integral by 


{er - 
eo) Vero Se fo eine (cy +4). 


Since, for þr S s S 7, cos r S — Pr 1(r— $r), we may dominate this 
expression by . ' . 





r 
2e f etre lode) pdg = artes (1 — eT?) < my te 
{ 


1/2)# 


for r =n. The lemma has then been proved. 
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Lemma 2. Let 0, be the supplement of the angle of P at ap tf ap is a 
vertex of P, and let it otherwise be zero. Then 


ent dt iby | 
oe, h(t) E Gh’ | 


_We shall consider only the case for which ap is a vertex, for: otherwise 
the conclusion is a special case of Carmichael’s result that | 


(6) lim f et (h(t) jdt = 0 
8=0 e Cr 
for z in P and not a vertex.? j 


Breaking C up into C’ and ©”, where C’ is that portion of C's in Sy, 


em di. i nest dt Cuesat dt 
«Sama LES - ilar, MD € 
Take parabolas è with vertices at the origin and with the bounding rays 
of Sp as principal diameters. Writing t = pet”, we then see that the result- 
ing integrands approach zero for ¢ outside the parabolas, for s: becoming 
infinite, and are bounded for ¢ inside the parabolas. The ranges of integra- 
tion with respect to ¢ approach zero inside the parabolas, so that the integrals 
with respect to D approach zero. The integrals with respect to ip likewise 
approach zero, the integrands approaching zero in that case. 








IV. CONVERGENCE OF THE F-SERIES. | 


Let P’ be a convex polygon contained in P and let it have the property 
that for every point qu(u = 1,2, > -, N), there exists a point ’,' in P’ for 
which qu 4- æ—c, lies in P and not at a vertex of P, for all v in P and 
not vertices of P’. In particular, P’ may coincide with P, in ch case 
d'u = Op. 

Let a curve Hp from each point a’, to the corresponding point au be 
made up of a finite number of straight-line segments and let it have the 
property that, for every point x, on Hp and for every x in P’ and not a 
vertex of P’, a, + z— z lies in P and not at a vertex of P. In particular, 
the curves Ha may be taken to be straight lines, although, if straight lines 
are not suitable, we are not, in general, restricted to them. i 

3 Carmichael, loc. oit., p. 24. : 


“Carmichael uses such parabolas for similar purposes, loc. cit., p. 24. ; 
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We shall find it convenient to use the notation 
flz + 0(a@,—2)] = lim f[z + o(p —2)](o 9 030 > 0), 


when the limit exists, in the following theorem. 

Let f(21) be so defined that both its real and imaginary parts are summable 
(L) along each of the line segments of which the curves Hy» are formed and 
also along every straight line segment in closed P’. Further, if P’ does not 
reduce to a straight line segment, let f(z.) be analytic in open P’ and let its 
integral between any two points of closed P’ (taken along any finite num- 
ber of straight-line segments) be independent of the path of integration 
in P. Let a be any point of P’ and let the paths of integration Ly from 
a to @(u—1,2,---,N) be made up of the straight lines from a to op 
combined with the curves Hp. 


THEOREM 2. Let the above hypotheses be satisfied. 


Then, first, for every point x on the interior of P’ (if there be such 
points) ; and second, for every x on the boundary, and not at a vertex of P’, 
for which there exists a positive number y, such that both the real and 
imaginary parts of f(x) are of bounded variation in the linear interval 
[z, 2+ m(e’p»—2)] (=1,2, : -,N); the F-series for f(x) associated 
with h(t) and Lu(u= 1,8," >, N) converges to 


N 
3 È Our” f [e + 0(@,—2)]. 
H= 
For x on the interior of P’, this expression is equal to f(x). 


On breaking up the functions f(z) and 


(7) | f etcure-)t{h (t) }dt 


(which we shall writé as Q) into their real and imaginary parts for s, on any 
straight-line segment (8,y) on Lu, since both the real and imaginary parts 
of Q have continuous derivatives with respect to 2, 


(8) J, ” Fa) Qda; 


(which we shall write as I(8,y) ) may be written as the sum of real and 
imaginary Lebesgue integrals, each of which may be integrated by parts, 
so that 


+E. W. Hobson, The Theory of Funotions of a Real Variable, vol. I (1927), p. 616. 
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Ten fran (an fo f Har) donde 


If (8, y) lies in P’, both factors of the first term of (9) are independent of 
the path of integration in P’, while both factors of the integrand in the second 
term are continuous throughout closed P” and analytic on the interior, so that 
the second term of (9) is independent of the path of integration in P’, and 
(8) must be also. 

, Now let a, and az be any two choices of the point a. Then, under our 
hypotheses, | Ù 


D eu(l(@u 4) —1 (am a2) = f Fle) f eesidiaes 


which, by the Cauchy Integral Theorem, vanishes, so that every term of (5) 
is independent of a, for a in P’. 

We may then (and shall) take a at the point x, writing, by the theory < of 
residues, the sum of the first s terms of (5) in the form 


1 
a 2 Cd (T, au), 


where U and J replace Q and J respectively for Ors replaced by C.. We shall 
then consider the separate terms (for convenience dropping the subscript x), 
which may be written as 


(10) 5S; [fle + o —2) Jude: | 
+55 J" GG) fle +0 —2)]}Ude. 


Upon evaluation of the first integral in (10), its limit as s becomes 
infinite is seen, by (6) and lemma 2, to be equal to 


a ofle + O(au—2)]. 


It will, then, be sufficient for our proof to show that the limit of the second 
term of (10) vanishes. 
We now set 


R{f (2) — fls + 0 (8 — z) ]} = Ai (2, 21) — 4a (7, mi) 


for z, on L (where L now joins x to a), A, and A, being monotonic functions 
of z, for z, in the open interval [x,æ+m(a —x)], approaching zero as 
tı — v and being summable on L. Then for every positive £ there exists a 
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positive n < yı for which | A:(x,ær)| < ¢ for z, in the linear interval 
[z, £ + n(a’ — z) ], so that, upon using the second mean value theorem,” we ' 
see that | o 
w=2+7(a’-2) g $ - 
fe Aa (2, 4%) R(U) aR(m) 


may be dominated in absolute value by 
motn (a-a) f 
(11) © uf R(U)4R(2:)|, 
z$: 


where & lies in the interval [z, z + (a —2)]. Since dR(2,)/dz, is to be 
constant, we may actually carry out the integration, and then, by (2), we 
find that (11) is dominated by 4r£et. 

Take now any straight line portion (8,7) of the part of L not involved 
in (11) and let us consider 


(12) [ITA 8)R(D)AR(a). 


As we prepare to apply Hobson’s General Convergence Theorem,® we 
note first that, by our hypotheses, a + «— a, lies in P and not at a vertex for 
every 7, on closed (8, y). It must, then, be bounded away from the vertices, 
80 that for every positive y there exists an 7 for which a + æ— z, lies in Pi. 
By lemma 1, there then exists a positive K for which | U | < K for all s, so ` 
that | R(U)| is also bounded, and Hobson’s first condition is satisfied. 

Noting that 





<| [rare one) fr ve 


we carry out the integration of U with respect to z, and then apply (6) to 
show that the second condition is also satisfied. 
Then, for every positive 7, (12) approaches zero as s becomes infinite. 
We may then combine the finite number of line-segments which form L to 
obtain the result: For every positive » and for every Positive é, there exists a 
positive integer 5 for which 


| ST RaR) 


(13) | [7 "As(x, 2) DR (a) | < E+ Anke? 
| for s > &.. : 


#E. W. Hobson, loc. oit., p. 618. 

® Reference will be made to E. W. Hobson, The Theory of Funotions of a Real 
Variable, vol. IT (1926), p. 422; see also Proceedings of the London Mathematical 
Society (2), vol. VI (1908), p. 349, and (2), vol. XII (1912), p. 166. 
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We now note that the second term in (10) may, by the separatión of the 
real and i imaginary parts of the factors of the integrand, be divided| up into a 
finite number of parts each of which may be given essentially the same treat- 
_ ment as we have given the integral in (13), so that the second term in (10) 
approaches zero as s becomes infinite, and the F-series converges to 


4E bafle + O(du—x)]. 


_ In particular, if h(t) —e'—1 so that we are dealing with the Fourier 
series on the interval (0.1), we find the series to converge to 


a{f(x+ 9) + f(e—0)}. 


It may be shown by similar methods, for P’ coinciding with F, that, at 
‘the vertex a, the F-series converges to 


3 3 flan + O(a] —4 3 Brea eal tn + 0(a,— a))] 
In particular, the Fourier series converges to 


4{f(L—0) + f(+)} 


at either end-point-of the interval. 


- INDIANA UNIVERSITY, 
BLOOMINGTON, INDIANA. 





EXTREMAL PROBLEMS FOR FUNCTIONS ANALYTIC AND 
SINGLE-VALUED IN A DOUBLY-CONNECTED REGION.* 


By Maurice H. HEINS. 


1. Introduction. It is well-known that certain fundamental inequalities - 
of analysis such as Julia’s Principle of the Harmonic Majorant,’ the Two 
Constant Theorem,? Lindeléf’s Principle the Principle of Hyperbolic Measure * 
are the “best possible” when the domain of definition G, for the functions 
. w(z) involved is simply-connected. When one considers, however, functions 
which are analytic and uniform in a multiply-connected region, these in- 
equalities are, in general, no longer the “ best possible”; it is then a question 
of interest to determine effectively exact bounds and the associated extremal 
functions for these inequalities when we restrict our attention to functions 
which are analytic and single-valued in a given multiply-connected region Gs. 

By application of the Poincaré Uniformisation Theorem ë and the Pick- 
Nevanlinna theory of interpolation ê one can determine effectively the exact 
bounds and the associated extremal functions for the inequalities cited above 
for the case where G, is doubly-connected and has as its boundary two disjoint 
continua. To this end we shall study the problem of interpolation for bounded 
functions which are analytic in the unit circle and satisfy a given functional 
relation. By the results of this study we shall give a method for determining 
effectively the exact bounds at a given point z of Gz and the associated 
extremal functions for the following inequalities: 1) Julia’s Principle of the 
Harmonic Majorant of which the Nevanlinna-Ostrowski Two Constant 
Theorem and Hadamard’s Three Circle’ Theorem are special cases, 2) the 
Principle of Hyperbolic Measure of which the Aumann-Carathéodory “ Starr- 


* Received March 6, 1939. 

1G. Julia, Prinoipes géométriques d'analyse, 2itme partie (Paris, 1932), pp. 26-27. 

`R. Nevanlinna, Hindeutige Analytische Funktionen (Berlin, 1936), pp. 41-42. 

*E. Lindelöf, “Mémoire sur certaines inégalités dans la théorie des fonctions 
monogènes et sur quelques propriétés nouvelles de ces fonctions dans le voisinage d’un 
point singulier essentiel,” Acta. Soo. Soi. Fenn., 35 Nr. 7 (1908). 

t R. Nevanlinna, l. o., pp. 45-61. 

5H. Poincaré, “ Sur l’uniformisation des fonctions analytiques,” Acta Mathematica, 
vol. 31 (1907). 

eR. Nevanlinna, “ Ueber beschränkte analytische Funktionen,” Ann. Aoad. Sot. 
Fenn., vol. 32, No. 7. 
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heitssatz ” 7 is a special case. In addition, our preliminary study permits the 
complete treatment of the analogue of the Pick-Nevanlinna problem for the 
case where the interpolating functions are analytic and single-valued in a 
doubly-connected region. Lammel has considered related interpolation 
problems. 

Recently Carlson and Teichmüller” have considered the problem of 
improving the Hadamard Three Circle Theorem for functions which are 
uniform. Their methods are quite distinct from ours, which admit application 
to other problems as well—the extremal problem for the Principle of Hyper- 
bolic Measure and the Pick-Nevanlinna interpolation problem for doubly- 
connected regions. | 

The author wishes to express his thanks to Professor Walsh for his 
helpful discussions during the preparation of this paper. 


2. The Pick-Nevanlinna theory of interpolation. In this section we 
shall state briefly the principal results of the Pick-Nevanlinna theory of inter- 
polation important for the sequel. For a detailed account of this theory the 
reader is referred to the treatise of Walsh.1° 

Let € denote the class of functions w(z) analytic for |z| <1 and 
satisfying there the inequality | w(z)| <1. Let æ be any complex number 
for which |a| <1. We denote by Z(z,a) the linear fractional function 





a—z 
1— az" 
Further let the points z1,: - >, Zn be given interior to the unit circle | z | = 1 


and let there be associated with each z a complex number wz), | wz) | <1 
(K—=1,2,---+,n). “Define w™,: > -,w, by 


(2.1) Lux), wy?) allie, 2) L (we, |z, | wy) 
(b= 2," + +, 0) 
(where caine 2,) is to be replaced by z, if zı = 0), and, in paca 
tx) (k —v+1l,;:::,n) by the recursive formula 


TQ, Anmain and C. Carathéodory, “ Ein Satz über die konforme Abbildung mehr- 
fach-zusammenhängende ebene Gebiete,” Mathematisohe Annalen, vol. 109, pp. 756-763. 
: ? F. Carlson, “ Sur le module maximum d’une fonction analytique uniforme,” Ark. 
för Mat. Astron. ooh Fys. Bd. 26, 2A9, pp. 1-13. 
? O. Teichmüller, “ Eine Verschärfung des Dreikreisesatzes,” Deutsche Mathematik 
vol. 1 (1939), pp. 16-22. 
#9, L.-Walsh, Interpolation and Approsimation by Rational Finis in the 
Complex Domain, New York, pp. 286-304. 
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(2. 2) L(g, wD) —Læl D(a, ay) L (wy), |: Zv | we) 
(kært i cn) 
L(z, zv) ia to be replaced by z, if zv = 0). Then we bave 





(where læ] 
Zy 
THEOREM 2.1. A necessary and sufficient condition that there exist a 
function w(z) C E for which w (zy) — a where the % aren distinct points ` 
interior to | z | = 1 is that either 


1) fam | <4, [ame [<a +, mp | <1, vR |m, 
nl 5 
or 2) | wi | <1, w| <1,---,| wy") | <1. 


If 1) occurs, w(z) which satisfies the interpolation requirements is unique 
and is given by the formulas (2.1) and (2.2) in conjunction with 


(3) Lim) m) =H aLe) a l) 


(2. 4) L(wy-1(2), toy) ) =l re, zv) L(wv(2), | Zy | wy)) 


(2.5) wv (ze) = we Ss (kev +1, yn), 


where wo(z) = w(z). 

If 2) occurs, w(z) ts not unique. All such functions and only such 
functions are given by the formulas (2.2), (2.4), and (2. 5), where uate) 
ts any function of class €. 


Farther, if 2,%,:-°- (| Y < 1) are infinite in number, Theorem 2.1 
` admits the extension \ l 


THEOREM 2.2. A necessary and sufficient condition that there exist a 
function w(z) C E for which w (2r) — wy (k= 1,2,- - +) is that either 
1) | 20,10) | < 1, | wa?) | <1,---, | wpe) | <1, |w® | = 1 


Atl 
w) = p) oo 
etl Bt 


or 2) | Jw, | <1, |w | <1,--:. 


«If 1) occurs, w(z) with the required properties is unique and is given 
by the recursive formulas (2.2), (2.4), and (2.5) where wy(z) =w %, 


In the situation of Theorem 2.2 we have 
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' Tuxorem 2.3. If there exists a function w(z) C E satisfying the inter- 
polation requirements of Theorem 2.2, then all w(z) CE satisfying these 
requirements can be expressed in the form | 

P(z) —Q(z)woo(z) | 
cee) vl) 1— 8(z)wa(z) | 
where P, Q, and § are specific functions of class € defined by the interpolation 
requirements and woo(z) is an arbitrary function of class €. And conversely, 
every function w(z) defined by (2.6) where wo(z) ts an arbitrary function 


of class € belongs to class € and satisfies the interpolation requirements of 
- Theorem 2. 2. | 


A necessary and sufficient condition that w C € satisfying the inter- 


polation requirements of Theorem 2.2 be unique is that 
P8 —Q=0. 


Remark. If PS -— Q £0, the function PS — Q vanishes at the points 
a and at no other points for |z| < 1. : 





8. À particular interpolation problem. We now turn our attention to 
a particular interpolation problem which is fundamental for the study that 
we shall make. Let Tz(5£z) denote any linear fractional transformation 
mapping | z | 1 onto itself (in the sequel we shall consider exclusively the 
case where T is hyperbolic), and let Uz denote a second such transformation, 
but here we do not require that Uz 5<z; in-fact, the case where Uz =z is of 
prime importance. We wish to study those functions w(z) C È which satisfy 
certain interpolation requirements at assigned points æ (k= 1,2,: >+, n or 
k = 1,2,---) and which satisfy for |z | <1 the functional ‘relation 





(3.1) Cor) = Uwe). | + à 


We shall demonstrate the following 


THEOREM 8.1. À necessary and suficient condition that there exist a 
function w(z) C € for which 


w (2) = we), | ze | <1, w( Te) = Uw, 
(k= 1,2, n or k= tr ;mm0, +1, Æ) 


(it is assumed that these interpolation requirements are consistent) and which 
satisfies the functional relation (3.1) is that there exist a function w*(z) CE 
which satisfies the interpolation requirements for w(z). 


1 
i 
i 
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It is clear that this condition is necessary. 
To prove that it is sufficient we note that if w*(z), the existence of which 
‘is posited, is unique, then i i 
w*(T) = U[w*(2)], 


for U+{w*(T)] CE and satisfies the same interpolation requirements as 
w*(z). Therefore 
U*[w*(T')} = w* (z) 


or w*(T) == U[w*(2)]. 


I£ w*(z) is not unique, let {w*(z)} denote the totality of functions 
w*(z) which satisfy the interpolation requirements and let z be any point 
distinct from all the points T2 


(k = 1,2,- | on or k= 1,2,- . “3 m=—=0,+1,+2,: saji 


Then {w*(2)} is the totality of values which the functions w*(z) C {w*(z)} 
take on at z. Consider the set {U-[w*(T'z.)]}. We assert that 


(3.2) | {w* (zo) } = {UT [w* (To) 1}. 


Let w*i(z) C {w*(z)}, then it follows that U-'[w*,(T)] C E and satisfies 
the interpolation requirements; therefore U~[w*,(T2%.)]C {w*(z)} and 
therefore | 

{U~[w* (Tao) ]} € {w* (20) }. 


Also if w*,(z) C {w*(z)}, then w*,(z) —U7(U[w*.(T°Tz)]]. But 
U{w*.(T*)] CE and satisfies the interpolation requirements. Therefore 
w*, (zo) C {U0-[w*(T2,)]} and it follows that 


{wè (20)} C {U+ [w* (P20) ]}- 


Therefore the relation (3.2) is verified. From this fact we shall deduce 
certain functional relations between P(z), Q(z), S(z) and P(T), Q(T), 
S(T). Let us note that since the solution of the interpolation problem is not 
unique and since zo is distinct from Ta (k = 1,2,---,n or k—1,28,--:; 
m= 0, +1, + 2,-- -), P(2)Q(20) — 8 (20) #0 (cf. remark Theorem 2.3), 
and therefore the set {w*(z.)} fills a proper circle which we shall denote by 
Kz» This follows from the formula (2.6) where w(z) is replaced by w*(z) 
and the statement of the interpolation requirements of Theorem 2.3 is 
replaced by the statement of the interpolation requirements of the theorem 
which we are to prove. The transformation 
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P (2) — Q (20) w* oœ 
1 — 8 (25)10*o0 
maps | #%o a <1 onto Kn; ane the transformation 
P(Tzo) — oCa)ore| 
1— 8(T 29) w* co 


-maps | w*co | € 1 onto Ks, by virtue of the relation (3.2). Therefore the 
transformation ‘1 


(3.5) P(t) — Q (20)t _ v- [Pots Ca] 


1— B(z)t 1—S(Ta)r 


(3. 8) ; wt = 





(3.4) et 


is non-degenerate and maps | # | 51 onto |r| 51. Let us write U(z) in. 
the form | 
e” (g — a) /(az — 1) ((a] <1,0<8< 2r). 


Then (3.5) takes the equivalent form 


wl Ez) — a __ Q (to) — 28 (20) — a8 (zo) , 
(3.6) P(T) —Q(Tzo)r__ ° L&P()—1  _a@P(%)—1_ 
1— S(T2)r 1 — 29 (20) —8 (z0) , 
aP (z) —1 


It follows that (3.6) can be written in the form : 
r= [A (20) + e(20)t]/[L +À (zo)e(zo)t] 


where A(z) and e(%) are suitably chosen, |A(zo)| < 1, |e(z)]=1. A 
necessary and sufficient condition that the transformation (3.6) can be 
written in this form is that the following equations be satisfied : 


. (P (20) —a) _ P (Tz) —X(¢0) Q(T 20) 


“@P(g)—1  1—X(2)S(T2) R 
(3.7) e a aed) has Q(T2o) —A(%)P(T20) 


1—A(%)8 (Ta) ” 

&Q (Z0) — S{20) Lite) 8 (To) — À (20) 

aP (z) —1 °? 1— (20) S (Tee) ` i 
Now the equations (3.7) with the subscript dropped determine e(z), A(z), 
A(z) as functions of z, single-valued, analytic and defined for all values of z 
interior to the unit circle |z | 1 other than the {7'™2,}. The conditions 
le(z)| == 1 and A(z)X(z) real for every such z imply that e and À are con- 


oid. In particular, pp. 296-304. 


< 


= 
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stant. Therefore the relations (3.7) are valid for all z, |z| <1 where «, À 
are constant. Me Re ; i 

Suppose now that w(z) satisfies the required interpolation conditions 
and further the functional relation (3.1). -Then w(z) can be written in the 
form | 


' P (2) — Q(z) we: (2) 
(8.8) w(z) = 1—S(z)wo(z) ? 





and w(z) satisfies the functional relation 


P(T) — Q(T)wo(T) P(z) — Q(z)#%00 (z) 
PER ame L -aeea |” 


and from our discussion we infer that 


(3. 10) wo(T) = U*[weo(z) ], 
where 


U*(r) = (A+ er)/(1 + er). 


Conversely, if woo(z) C € and satisfies the functional relation (3.10), 
w(z) as given by (8.8) satisfies the required interpolation conditions and 
further the functional relation (3.1). Now there always exists a function 
wWoo(z) C E which satisfies the functional relation (8.10). For since U* 
maps the closed interior of the unit circle onto itself, it is either the identical 
transformation or a transformation of one of the following types: elliptic, 
hyperbolic, parabolic. If U* is the identical transformation, the existence of 
a function wo(z) C € satisfying (3.10) is evident; any constant k,|k|=1 
satisfies (3.10). If U* is not the identical transformation, then it is well- 
known from the theory of linear fractional transformations that U*(r) has 
at least one fixed.point in the closed interior of the unit circle |r| 1. Let 
T* be a fixed point of U*(r) ; then it is evident that r* satisfies the relation 
(3.10). (We shall return to the study of the functional equation (8.10) 
and consider the possibility of non-constant solutions.) Thus Theorem 3.1 
is established. a 

Let us remark that when w*(z) satisfying the interpolation requirements 
is not unique, there is a one-to-one correspondence between the functions 
Wo(z) satisfying the functional relation (3.10) and the functions w(z) 
which satisfy the interpolation conditions of Theorem 3.1 and the functional 
equation (3.1). 

Denjoy has given the following criterion for the uniqueness of w(z) of 


vi 
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Theorem ?.2:1? A necessary and sufficient condition that the function w (z) 
of Theorem 2.2 2) be unique is the divergence of the series 


2 1—| 2 | 


3.11 nT. 
( ys ZT Tue] 


Let us remark that in the applications which we’shall make of Theorem 
3.1, we shall consider exclusively the case where T'is hyperbolic. We shall 
demonstrate that if T is hyperbolic, a necessary and sufficient condition that . 
w(z) of Theorem 3.1 be unique is that the function w*(z} of Theorem 3.1 
be unique. But a necessary and sufficient condition for the uniqueness of 
w*(z) and therefore for the uniqueness of w(z) is the criterion of Denjoy 
(3.11) where the notation is suitably modified. 

As we have shown, if w*(z) of Theorem 3.1 is unique, then w(z) is also. 
unique. Suppose now that w*(z) ‘is not unique. We shall show that w(z) 
is not unique. ox 

Let us recall that there is a one-to-one correspondence between the w(2) 
of Theorem 8.1 and the functions wo(z) C € which satisfy the functional 
relation (8.10). If U* of the rélation (3.10) is the identical transformation 
or hyperbolic, there is more than one solution of (3.10) which belongs to 
class €. Two cases remain. U* may be parabolic or elliptic. But in these 
cases the relation (3.10) may be reduced to the following canonical forms: 

A) U* elliptic | | 


fas) =ef), BG) >0, [fIS1 
6 real, À positive (#1), 
B) U* parabolic i r` 


fa) =f) +i Be) >O  R(f) 20 
x real, À positive (5 1). 


Let us consider Case A). It is clear that the function 
| : Ke log #/1og À 
satisfies the equation í 
f(z) = etf (2) 
` where K is a constant. We seek solutions f such that |f| <1 for R(z) > 0. 
| Keit log s/log À | ni | K | e2 larg #/log À). | 


4 A. Denjoy, “Sur une classe des fonctions analytiques,” C. R. de Vacad, des set. 
de Paris (7 janvier 1828), pp. 140-142. 
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But since R(z) > 0, | argz | < 7/2, and therefore e-?(are s/N jg bounded for , 

` R(z) > 0. Thus if we choose | K | sufficiently small, | Ket log s/08à | € 1 

and therefore there is an infinity of functions satisfying 1) f(Az) == ef(z) 

and 2) |f| 1 for R(z) > 0 where 8 is real and A positive (41). 
Similarly B)- may be discussed. The functions 


K+ fF log 2/log À 
where K is constant and R(K) is sufficiently large satisfy all the require- 
ments for f of B). Therefore returning to the equation (3.10), we find that, 
if U* is elliptic or parabolic, there is always an infinity of solutions, and 
therefore, if w* is not unique, w is not unique. Thus we have | 


THEOREM 3.2. Let T of Theorem 3.1 be hyperbolic. Then the criterion 
of Denjoy is a necessary and suficient condition for the uniqueness of w of 
Theorem 3.1. 


4.’ The principle of the harmonic majorant.® Julia has stated and 
proved in his “ Principes géomètriques. d’analyse ” the following principle 
which he terms the “ Principle of the Harmonic Majorant”: 

“Let f(z) be a function of the complex variable z which satisfies the 
following conditions: 


1) The function f(z) is analytic and regular at every point of a region 
G,. The modulus | f(z)| is single-valued for z C Ge. 


2) There exists a function u(z) harmonic and single-valued in G such 
that in the neighborhood of the boundary log | f(z)|—w(z) is less than 
every positive number; that is, for each point ¢ of the boundary and for every 
positive e, there exists a circle with center £ such that at every point of Gs 
interior to this circle the inequality a 

log | f(2)| —u(2) < € 
is satisfied. . : 

If these conditions are satisfied, log | f(z) |S u(z) at every point of Gs. 

If the equality log | f(z) | = u(z) takes place at an interior point of Ge, 
f(z) is of the form ' 


e¥ (a) +to(e) 


where v(z) is a conjugate function of u(z).” 
Let G, be a doubly-connected region whose boundary consists of two 
disjoint continua. (We recall that a continuum is a closed set, not a single 


1 Q, Julia, L c. 
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point, which is well-chained. The degenerate cases may be dismissed as 
trivial.) Further let us require that f(z) be uniform for z C G, as well as 
that it satisfy the hypotheses of the Principle of the Harmonic Majorant. 
If ewtiv ig single-valued for z C Gs, then the inequality 


(4.1) | log | f(2)| = u(2) 


is a “best possible” inequality and e**# is an extremal function for the 
class of functions f(z) which we are considering. 
| In general, e“#, which we shall denote by ẹ(z), is not single-valued and 
the inequality (4.1) is strong. If we continue an element of ¢(z) from a 
given point of Gs along a closed path in Gs containing in its interior one and 
only one of the continua which constitute the boundary of Gs back to the 
same given point, (+) changes to e#¢(z), which transformation we shall 
denote symbolically by 


p(z) — eh (2) (0 < 8 < Rx). 


Tf f(z) is analytic and single-valued for z C Gs and satisfies (4.1), it follows 
that f/ is analytic and has a single-valued modulus for z C Gs. Furthermore — 
| f/p|S1, and f/p— e*f/d. If w(z) is analytic and has a single-valued ` 
modulus for z C G, and in addition 1) | y | <1, 2) prey, then f=y¢ 
is analytic and single-valued for z C G, and |f|<|¢]. Thus if we wish 
to calculate 1. u. b. | f(z)/p(2)| for zo C Gs we may consider the equivalent 
problem of calculating 1. u. b. | (20)| for zo C Gs and for the class of func- 
tions {y(z)} as defined above. It is clear from the definition of w that © 
Lub. | y(%)| < 1 (for 6540 (mod 27), which we shall denote by y, is attained 
by some function y*(z) which belongs to the family {y(z)}.- This is an 
immediate consequence of the theory of normal families.* However we can 
go further. We shall show that x is the limit of a monotonic non-increasing 
` sequence which will be defined below and shall exhibit the totality of extremal 
` functions which correspond to the bound p. 
To this end we shall introduce Poincaré’s uniformisation fonction: = 
Let G,® denote the universal covering surface of Ge and let g= z(x), |s| <1 
denote the mapping function which maps the interior of the unit circle | z | = 1 
one to one and conformally onto G,© such that z(0) —0 and 7(0) >0. | 
It is well-known that z(z) is automorphic under a cyclic group of trans- 


1 P, Montel, Legons sur les familles normales (Paris, 1927), p. 21. 

18 H. Poincaré, l. o.. For the properties of the mapping function s(æ) see G. Julia, 
Legons sur la représentation conforme des aires multiplement connexes es 1934), 
Chap. 2 and 3. j 
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formations {7*} where T is a hyperbolic transformation which corresponds 
to the passing’ from a given point ¢ on a given sheet of G,® to that point of 
G® which has the same geometric ‘position as £ and lies on the sheet. which 
follows (or precedes) the sheet containing &. 

Now consider the class of functions w(x) defined for |x| <1 by 
w(z) =y(2(xz)). It is clear that w(x) is analytic and single-valued for 
[2] <1 and 1) |w(z)|S1 for |x] <1, 2) w(T) —e#w(x), and 
3) w(0) —y(z). Conversely, if z(z) denotes the determination of the 
inverse function of z(x), such that x(2) = 0, and if w(x) is analytic for 
'@| < 1 and satisfies 1) | w| < 1 and 2) w(T) =¢~w(z), then it follows 
that y(z) —w(2(z)) is defined and analytic for z C Ge, has a single valued 
modulus there, and satisfies 1) |y|S1, 2) y— ety, and 3) | #(z)| 
= |w(0)|. Thus there is a complete one-to-one correspondence between the 
two classes of functions {y(z)} and {w(2x) } and l.u.b. | 4(zo)| =1.u.b. | w(0)| 
by virtue of 3) and the extremal functions of one class corresponds to 
the extremal functions of the other class under the conformal representation 
of @,® on |x| <1. Therefore we shall consider in place of our original 

- problem for y the equivalent problem of determining 1. u. b. | w(0)| where 
w(x) is analytic for | z| < 1 and satisfies the two characteristic conditions 
1) |w| 51 and 2) w(T) = w(x). 

Recalling thatu—lu.b.|w(0)|, we see that 0 < a < 1, if 05€ 0 (mod 27). 
For one can construct simple examples of functions satisfying all the require- 
ments of w(x) which do not vanish for 0. Without loss of generality we 
may assume that the extremal value x is attained by a function w(z) for 
which w(0) > 0, since the relation w(T) —e"t#w(#) is linear and homo- 
geneous. Let x, denote the largest positive number such that there exists a 
function w,(z) CE for which w:(0) = m, wi(To) =e Vus, pa (T70) = eH pm. 
For this value of ju, 1) of Theorem 2.1 occurs when the notation is appro- 
priately modified. For if 2) of Theorem 2.1 occurred, x, could be replaced 
by a larger number since all the inequalities in 2) of Theorem 2.1 are strong. 
Thus y, can be calculated directly from the relations 1) of Theorem 2.1. 
Similarly, let x denote the largest positive number such that there exists a 
function .(z) CE for which w(T*0) == e*tfus(k = 0, + 1, +2). Here, 
too, 1) of Theorem 2.1 occurs, and therefore, y. can be calculated algebraically 

_ from 1) of Theorem 2.1 and & = pe. In general, let ua denote the largest 
positive number such. that there exists a function w,(x) CE for which 
w(T*0) = eus (k = 0, +1; +2, :,+n). Once again 1) of Theorem 
2, 1 occurs and ya can be calculated algebraically from the relations 1) of : 
Theorem 2.1. The sequence {pn} is monotonic non-increasing and converges 
to a positive lower bound p*: ` Since x € y, for all n, it follows that p S p*. 
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1 


We shall prove that »* << and therefore conclude that a= lim an. Let 
‘Wa (£) denote the unique function of class € for which ae 


wy (TO) = etan (k=0, +1, +2, En). 

It is clear that the sequence of functions {wa} forms a normal family 
and that any limit function w*(z) of the family belongs to class € and 
satisfies the interpolation conditions w(T*0) — e*4#u* (k=—0,+1,+2,--:). 
Let us recall that, as a consequence of Theorem 3.1, the existence of 
w*  € which satisfies the interpolation conditions w*(T*0 = e*#y*, 
(k = 0, +1, + 2,-- -) implies the existence of a function w C € which not 
only satisfies the interpolation conditions of w* but also the relation 


“w(T) —e#w(z). Therefore p* <p and our assertion that p= lim pn 
n+00 


is established. It is now easy to determine the totality of extremal functions 
_for which | w(0)| =x. They are given by the formulas (3.8) and (3.10) 

where the notation is suitably modified. Thus returning’ to the: class of 
` functions {y} =“ we have considered, we conclude 


THEOREM 4.1. For the function w(z) defined aie we have 
l.u. b. | Y(zo)| == lim un. The totality of extremal functions y*(z) for 
7-00 


which | Y*(20)| — lim un =p ts given by the totality of functions w(x) 


for which | w(0)| =æ |f(æ)| Sa] $ (2o) where equality is attang ter 
the functions y* and only such functions. 


5. The principle of hyperbolic measure. We shall now enunciate the 
so-called “ Principle of Hyperbolic Measure” and show how the methods 
. which we have developed can be applied to study the extremal problems 
‘associated with this principle. 

Let G, and Go be two regions which have each at least three boundary points 
and let f(z) be an analytic function which can be continued throughout Gs such 
that its functional values lie in Gy. Let t(w) map conformally the universal 
covering surface Gu of Gw on | t| < 1 and 2(z) map the universal covering 
surface G® of Ge on |z| <1. We form the function t[f(2(x))] = (x) 
which can be extended throughout | z| < 1 and which takes on values interior 

` to the circle | t| ==1. We denote the hyperbolic lengths * of the four linear 
elements dz, dz, dw, dt by dos, dor, dow, do: respectively so that in accordance 
with the invariance of these lengths under the transformations 2->z and 


16 Cf, note 4. 
1TR. Nevanlinna, Hindeutige Analytische Funktionen (Berlin, 1936), Chap. 1. 
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w—>t, we have dos = dos and dow == dot. We conclude from Pick’s Theorem *8 
that do: = dos and therefore that dow © dos. This is the Principle of 
Hyperbolic Measure. , 

If G, is not simply-connected, and if we consider functions f(z) which 
are single-valued for z C G,, then, in general, the inequality dow <= doz is to 
be replaced by the strong inequality dow < dos and 1. u. b. dow,/dor, < 1. 

Let us suppose henceforth that G@, is a doubly-connected region the 
boundary of which consists of two disjoint continua. We shall consider 
functions which are analytic (save for possible poles) and single-valued for 
2 C G, such that w= f(z) C Ge where Gw is any region the boundary of 
which! contains at least three points and f (Zo) — wo where z is a given point 
of G, and w is a given point of Ge; Our problem is to determine effectively . 
L u. b. dow,/dor. 

As in the general statement of the Principle of Hyperbolic Measure, let 
(x) map |z| < 1 onto G,™ one to one and conformally such that z(0) = 2 
and z’ (0) > 0, and let w(t) map |t] < 1 onto Gy if Ge is simply-connected, 
or if G is multiply-connected, onto G® one to one and conformally such 
that w(0) =w, and w’(0) >0. Then the function (zx) =#[f(z(2))], . 
|z| <1, where that determination of ¢(w) is chosen for which {(w5) == 0, 
as it has been defined above, has the properties $(0) 0 and |¢|<|2| 
for |z| < 1 by Schwarz’s Lemma. Furthermore we know that z(x) is auto- 
morphic under a cyclic group of hyperbolic transformations {7} generated 
from the hyperbolic-transformation T. If G, is simply-connected, the inverse 
function of w(t) is single-valued; on the other hand, if Gy» is multiply- 
connected, w(t) is automorphic under a denumerable group of transformations 
Gw[U1, Uz, +] which are either hyperbolic or parabolic, and therefore t(w), 
any determination of the inverse of w(t), is a linear polymorphic function 
which has the law of transformation - 


#(w) > Uslt(w)] 


when we continue ¢(w) along a path in G,® from a given point on Gy” to 
any point which has the same geometric position as the given point. With this 
fact in mind, let us study the possible functional relations which $(x) may’ 
satisfy when x is replaced by Te. It is evident that 2(T) = 2(x) ; therefore 
f(z(T)) and f(z(z)) have the same geometric position on Gw and therefore 


tLf(2(t))] = Uale(F(2(2)))] 


18 G, Pick, “ Ueber eine Eigenschaft der konforme Abbildung kreisformiger Bereiche,” 
Authemátteohe Annalen, vol. 77 (1916), pp. 1-6. 
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. where Ux is some substitution of the group Gw. Hence (x) satisfies a func- 
tional relation of the form . - 
CT) — Uxlé(z)] 


where Us C Gy. But not all substitutions Ur C Gy are candidates. For if Us 
is to be a candidate, we must have (x) = Ux1[$(Tzx)] and we know from | 
Schwarz’s Lemma that |$|<]zx| for |x| <1; therefore | Ur7[4(T)] | 

SS [a| for |e] <1. Setting z= T10 we find ~ 


| U0 | S| T0 | < L. 


Now there are only a finite number of substitutions Ux of the group Gy for 
‘which this is true.!® Furthermore it is conceivable that there need not exist 
a function ¢ analytic for | s| < 1 which vanishes at c—0 and satisfies the. 
functional relation ¢(T) = Ux[¢(x)]. (Let us remark that apart from the 
` identical transformation the Ux are all hyperbolic or parabolic and therefore 
have fixed points on the unit circle.) By means of Theorem 3.1 we may 
eliminate those substitutions U; which must be excluded, a fortiori, from our 
discussion. Let U,’ '',Um denote those substitutions of Gw, finite in , 
number, such that ¢ as it has been constructed, satisfies one (and only one) 
of the relations © 
$(L) = Uel$(2)] (b= 1,2," +, m). 


Conversely, let ¢ C € and furthermore satisfy 1) $(0) =0, 2) o(T) 

— Ur[p(z)] where Ux is one of the allowed substitutions. Then the function 

f(z) ==w[¢(x(z2))] where the determination of (+) is so made that æ(20) 0, . 

-is defined and analytic throughout Gs. Furthermore f(z) is single-valued. 
For, as æ(z) is continued along a path of G,~ from a given point of G, to 

a point which has the same geometric position as the given point, x(z) is 

transformed into Tz"(z) and ¢(z(z)) is transformed into Us"[¢(a(z))]. 

But w(t) is automorphic under the group of substitutions Gw and therefore 


w[Uu"($(2(2)))] = wlb(x(s))1. 


Therefore f(z) is single-valued. It is evident that w—f(s) C Go and 
f(z) == Wo. Thus the study of 1. u. b. dow,/dos, and the associated extremal ` 


functions is equivalent to the study of Lu.b. “|. and the associated 


a | 20 
extremal functions where t == (s), |s| <1 satisfles 1)¢(0)=0 and 
2) (T) — Uxfb(z)] where Ux is one of the substitutions U, ` +, Um 


1° This is a consequence of the nature of the transformation U,. See G. Julia, l o. 
in note 16. | , 
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since dot == dow and dos — dos. But let ns.note that since | 40) 0, 
dos == | dx | and. do: = | dt | and therefore 


re) se |$ (0). 


From this it follows that 1. u. b. do de: = max p® where p® =l. u. b. | $°(0)| 
eo 


and œ satisfies the following conditions 1) CE, 2) (0) =0, 3) PT) 


= Us[$(2)]. 
` Let us determine »™). We have shown in Section 3 that, if œ satisfies 
the conditions 1), 2), 3), it may be expressed in the form 


(5. 1) OE OS UE 


1— Fa) do (g) 
where bo C € and satisfies a functional relation of the form 
(5.2) go(T) = U*, xl poo(z)] '(U*; linear) ` 


and where P,Q, S have the significance attributed to them in Section 8 
with an appropriate modification of the notation. It is established in the 
Pick-Nevanlinna theory” that S(0) —0 and since (0) ‘= 0, we have 
P(0)=Q(0) =0. Therefore #(0) = P’(0) —Q’(0)¢00(0) where po CE 
satisfies the relation (5.2). We have 


(5. 8) Lu b | #(0)| = Lu. b. | P/(0) — Q(0)e0(0)]. 


We are now in a position to apply the methods which we have employed to 
discuss the extremal problem associated with the Principle of the Harmonic 
Majorant. Let wu,” = max | P’(0) —Q’(0)d0™ (0)| where do (z) CE; 
do can be determined directly. Let pa — max | P/(0) — Q'(0)r4® | 
such that there exists a function go (x) C € for which 


Go! (0) —n®, go (To) = DD, boo (T70) — Dr. 


By 1) of Theorem 2. 1 we are assured of the existence of got (x) since there 
always exists a function doo C € satisfying’ the relation (5.2). It is clear 
that m® 2 ps®. In general, let pa = max | P’(0) — Q!(0)m® | such 
that there exists a function theo"? C € for which 


boot (T10) = Dai ® = (æ0,+1, +42, v&n). 
Tt is clear that {w®} is a monotonic non-increasing sequence, such that, 


Un D = pP, As a consequence of exactly the same reasoning that we have 


5° J. L. Walsh, L o., p. 304. _ 
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employed in studying the extremal questions associated with the Principle of 
the Harmonic Majorant, we conclude that lim pp® = 14 

nwo 


Let u—maxu®. It is now simple to determine the associated extremal 
functions. Let p=- | P(0)— Q'(0)»| with the restriction that | v| <1. 


Now, for those values of v so restricted that there exists a function dao C € 


for which 
po(T0) = U*atr (p = max u®), (i—0,+1,42,-°-) °- 


and only those values there correspond in accordance with Theorem 3.1 the 


associated extremal functions and Theorem 3.1 gives us the totality of such 
functions. By conformal transformation of the independent and dependent 
variables, we find the corresponding extremal functions i in our original problem 
. which is thus solved. 


Let us remark that this result gives the exact value of the “ Starrheits- | 
konstant” Q, of Aumann and Carathéodory’ as well as the associated ' 


extremal fanctions for doubly-connected regions. - 


6. The analogue of the Pick-Nevanlinna interpolation problem for 
doubly-connected regions. Theorems 3.1 and 8.2 virtually contain the | 


solution to the following interpolation problem: 


Let Ge be a doubly-connected region in thé z-plane, the boundary of | 
which consists of two disjoint continua. What is a necessary and sufficient - 


condition that there exist a function w(z) analytic for 2 C G, which satisfies 
the following requirements: 1) | w(z2)| = 1 for z C G,, 2) w(z) is single- 
valued for zC Ge, 3) w(z) = tw (k—1,2,---,n or k—=1,2,---) where 
uy, are assigned complex numbers the moduli of which are not greater than 
unity and the æ are distinct given points of G,? 

If such a w exists, when is it unique? 

If w is not unique, what is the totality of functions which satisfy the 
requirements 1), 2), 8)? 


It is clear from our discussion that this problem is equivalent to the. 
problem of Theorem 3.1 where U is the identical transformation and T is - 
hyperbolic. Theorems 3.1 and 3.2 furnish the solution of this equivalent : 


` problem and therefore of the problem which we have just posed. 
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RAMANUJAN SUMS AND ALMOST PERIODIC FUNCTIONS.* 
By M. Kao; E. R. van KAMPEN and AURRL WINTNER. 


Introduction. Several classical formal trigonometrical expansions of the 
analytic theory of numbers have recently been shown? to be periodic or almost 
periodic Fourier series of the functions which they represent. The object of 
the present paper is tó prove a corresponding result for á class of multiplicative 
arithmetical sequences. 

In particular, it will be shown.that, for the functions to be considered, 
the celebrated formal trigonometric sums of Ramanujan ® are almost periodic 
Fourier expansions in the sense of Besicovitch. Hence, the Ramanujan coeffi- | 
cients will turn out to be Fourier averages which vanish for incommensurable 
values of the frequency parameter, the almost periodic function in question 

` being always limit periodic (grenzperiodisch). It should be emphasized that 
the fact that the Ramanujan trigonometrical expansions turn out to be Fourier 
expansions leads without any further device to his explicit formulae, if one 
writes‘ down the Fourier average representations of the coefficients. 

Although the arithmetical functions f(n) will be. considered only for 
nm—1,2,--*, one can realize the usual assumption of the Besicovitch theory. 
by placing f(—n) =f(n) for n = 1,2,- - - and f(0) =0 (the multiplica- 
tive character of f then remains preserved). It is understood that the class 
(B) of functions f(n) which are defined for integers may be introduced either 
directly or by considering the step function ft) which has the value f(n) for 
n = Sicn+l. 


1. By a multiplicative function f is meant a sequence p (n); n = 1,2,3, 
for which f (nın) = f(m )f (n2) whenever (m, n:) = 1 and f(n) 0 for at: 
least one n (so that f(1) == 1). Only those multiplicative f(n) will be con- 
sidered for which | 


(1) T = IK) ie, f(p) =f) =f) =>, (FD = 1), . 


* Received March 31, 1939. 

1 Fellow of the Parnas Foundation, Lwów, Poland. 

2 A, Wintner, American Journal of Mathematics, vol. 57 (1935), pp. 534-538; Duke 
Mathematical Journal, vol. 2 (1936), pp. 443-446; American Journal of Mathematics, 
vol. 59 (1937), pp. 629-634; P. Hartman and A. Wintner, Travaux de l’Institut Math. 
de Tbilissi, vol. 3 (1938), pp. 113-119; P. Hartman, American Journal of Mathematios, 
vol, 60 (1938), pp. 60-74; A. Wintner, Revista de Ciencias (Lima, 1939) (in press). 

3 S. Ramanujan, Collected Papers, Cambridge University Press (1927), pp. 179-199. 
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where the p denote prime numbers. An f(n) which satisfies (1) will be called 
strongly multiplicative. A elassical instance of (1) is 


$C) onigi, : ($ — Euler's function). 


@) ps rn P pn P 


For any f(n) and for any positive integer k, put 


(8) fP (n) =1 or f® (n) =f (px) according as n560 or n= 0 (mod px), 
where fx is the k-th prime; and put 


(4) fa(n) = IFO (n); so that fa(n) =H F(p), where p< pi 


According to a the function f®(n) of n has the period p» and possesses 
the Fourier expansion 


(6) 10 (n) — 1 + PDT exp (Ori Mn), 


which is, in fact, nothing but the formula of equidistant trigonometrical 
interpolation. According to (4), the function f(n) of n has the period 
Pk = fife’ ` ‘Pxi1Px and possesses, in view of (4) and (5), the Fourier 
expansion 


á fp) — m 
(6)  fe(n) = œt c cis woa ne) eat + z 00 (2x i n), 


where a= Il x (1 + f(p) ite"), 

2. For a function g = g(n) defined for n=—=1,2,3,---, put 

ech 38 
(7) M{g} = M{g(n)} =lim= 3g(n), 
noo meal 

if this limit exists. . 

All considerations will be based on the following elementary lemma: 

If a strongly multiplicative function f(n) satisfies the condition 


then the mean value M{f} exists and 
re f(p) —1 
o) up =u (1+2 =). 
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In order to prove this, let g(n) denote the multiplicative function which 
is defined as 


nf) 1 
P 


according as n is not or is quadratfrei. Then, for every positive integer m, 
| f(m) = 3 dg(à); 


n n h 
teres mn À [E] mac, a 


g(n) ==0 or a(n) =i 


n ‘ n G n 
f(m) =n g(m) + 0(3m | g(m)|). 
Since the definition of g(n) and the assumption (8) obviously imply the 
(absolute) convergence of the series Š g(m) to the sum represented by the 
i m=i 


product on the right of (9), it follows that in order to prove (9), it is 
sufficient to show that 


a . 
‘0(3.m | g(in)|) = 0(n). 
But the last relation is clear from the absolute convergence of the series 


wo 
3 g(m) ; so that the proof is complete. 
m=1 


The proof which we had originally for the above lemma was function- 
theoretical in nature. The above elementary approach was then suggested to 
us by Dr. Paul Erdôs. 


8. A corollary of (8)-(9) is that for a strongly multiplicative f(n) 
one has : 
n o _1—f@)\ galal Le 
(10) ln pes À =n(: FP) ) | FOP) < 


In fact, on writing 5 tirain OOs obtain (0) naines 


32 


z n m=1 m 





if 23 üm — 4, then also 
Similarly, if f(n)\ denotes the A-th power of F(n) when either f(n) > 0 or 
A is an integer, then 


(11) sim ir 3 mo nf) 


it pH C o and à > —1 


110 M. KAC, E. R. VAN KAMPEN AND AUREL WINTNER. 
‘(and (10) may be thought of as the limiting case à = — 1).. In order to 


prove (11), it is sufficient to replace f(n) in (8)-(9)° by .the strongly 
multiplicative function f(n) and- then apply the Abelian lemma: | 


n n y 
if 3 am~ an, then (1A) 3 max — an for every À > — 1. 
m=1 | m=1 te R 


As an illustration, consider the example (2); so that f(p) = 1 — p>. 
In this case, (10) is applicable and goes over into Landau’s relation 





a g 1 e¢eye(a) 


en am) TON ; since u(1+ 


a) 
u(1— p°) 
TI p” (pt)? 





while (9) is applicable to any power of (n)/n and gives Schur’s relation 
D o I 
«(anti 
| n p p\ p 
for every real l (and, as seen from the proof of (9), for every complex / also). 
4. For every strongly multiplicative, positive f(n), let ft(n), f(n) 


denote the strongly multiplicative, positive functions which at an arbitrary 
n = p attain the values 


F(p) — Max (1,f(p)) and f (p) = Min (1,f(9)), | 
respectively. ‘Then (2) shows that | | 
(1%) AAP (M) 0<f(n) SIS p(n); 
while (4) clearly implies that | 
Ta P) Efel F) S feln); 
(8) fo fe) + tet. | 
Notice that either of the functions fa* is uniquely determined by f aid k, i. e., 
that (Je = (fe)* 


Using these notations, it will be easy to deduce from (9) the following 
theorem: 
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E Every strongly multiplicative, positive function f (7) which a ee (8) 
48 almost porodi (B); furthermore, 


COS © M{|f—fs|}—>0, a k> o. 


In fact, it is clear from (7) and. (6) that M {fx} == c». Since cs in (6) 
was defined as the k-th partial product of the infinite- product (9), it follows 
that 
(14 bis) = Uh) > Mf, as k—> oo. 


Hence, (14) is certainly true if either f(n) = f(n) or fe) = < fs(n) for 
Tv n and k. It follows therefore from (13,) that 


(15)  M{|f— f |}—>0 and M{|f — fe |} 0, as k— co. 


But the function (6) of-n is periodic for every f, hence also for f*; so that 
either of the functions fi? of n is periodic for every k. It follows therefore 
from (15) that either of the functions f*(n) is almost periodic (B). Since 
(122) shows that f(n) is a bounded function, it follows from (12:) that: 
f(n) is almost periodic (B). 

In order to prove (14), notice first that, by (13) and (133), 


(15bis) = M{|f—fAl}S M(t — NH HMP — fe) fer}. 


The sum M + M on the right of. (15 bis) may readily be written in the form 
2M {fx ft} —M{f} — M{fe}. It follows therefore from (14bis) and (15 bis) 
that in order to prove (14), ib is sufficient to show that M{fe ft} ~ M{f} 
as k—> œ. But this is obvious from (9) and from the definitions of fx 
and f*. ‘ 


5. The almost periodicity (B). of f(n), proved in 84, implies that the 
n-average M{f(n) exp 2riAn} exists for every real A. It turns out that this 
Fourier coefficient vanishes for every irrational A; so that f(n) is limit periodic 
(grenzperiodiech) ; more se a the Fourier series (B) of f(n) is 


#(p)—1 m 
(16 n) ~M M 335 e cos (2r — n), 
(16) K ) + {f} den MT) — 1, ( F i 
where the first (exterior) summation is over all quadratfrei q > 1, while, if 
q is fixed, the index p runs through all prime divisors p of g, and m through 
the (q) values which satisfy (m, q) =1 and 1= m <q. 
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In fact, _(16) follows from (14), (14 bis) and (6), since Py in (6) was 
. defined as the product of the first k primes. 

The restriction of the first summation index of (16) to quadratfrei q > 1 
may be eliminated in the usual manner, if one introduces the Möbius function 
u(r), where r==1,2,3,; > -. In fact, (16) may then clearly be written in’ 
the form 


f(e) 
QD. fm) ~ MS meyer) n ATOZTE” 


if cr (n) is an abbreviation for the finite sum 


` 


(18)  cr(n) = 3 cos (ra), where (m,r) —=1 and 1=m<r. 
s m 


Since the (r) angles which occur in the sum (18) are symmetrically placed, 
the sum which one obtains by writing sin for cos is 0; so that | 


(18 bis) cr(n) = 3 exp (2ni = n), where (m,r) =1 and 15m <r. 


Thus, the cr(n) are precisely the Ramanujan sums,‘ and so the Fourier series 
(B) of f(n) ts tdentical with Ramanujans formal trigonometric series for 


- .f(n). The coefficients of the series 


(9) fin) ~ È motn) 
are | 


(20)  @r =a (f) = M{f}a(r) H Wy tT (r—=1,2,38,- ++), 


by (17); while the expansion functions (18) of (19) may be expressed © in 
terms of the Euler ¢- -function and the Möbius -function as follows: 


ED a(n) (2) —s(n(£), where t= (m,r). | 


6. According to (16), the frequencies (Fourier exponents) of the almost 
periodic function f(n) are rational numbers between 0 and 1 (or, rather, 
between — 1 and 1): Let the terms of the Fourier series (16) be ordered 
in the Ramanujan fashion (17)-(18), and suppose that each of them actually 


#8. Ramanujan, loc. oit., pp. 180-181. 
50. Hdlder, Lichtenstein Memorial Volume, Prace Mains tome: Fizyoone,. vol. 43 
(1938), pp. 13-28. 
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occurs, i.e., that none of the coefficients (20) of (19) vanishes. Then the 
frequencies of f(n) are uniformly distributed on the interval [0,1] (or rather, 
{—1,1]). This may be proved as follows: 

Since | z(m)| <1, while ¢(m) — œ as m—> æ, Hélder’s formula (21) 
implies an observation of Ramanujan, according to which e-(n) =O(1) 
when either r is fixed and n— œ, or n is fixed and r— œ. In particular, 


(21 bis) . lim GAN), = 0 for every fixed n = 1. 
roo p(r) - 


Now, (21 bis) is equivalent to the equidistribution of the frequencies of (19). 
In fact, let S denote, for any fixed r = 1, the finite sequence 


. (r) (r) mo) 
m Me ( 

(22) SRE oe Oa ea A 
r r r 


of those fractions m/r whose numerator m satisfies the conditions (m,r) = 1 
and 1=m <r. And let pr(t), 0X x < 1, denote the distribution function 
of the ¢(r) fractions contained in 8‘. Then it is clear from (18 bis) that 
the ratio occurring on the left of (21bis) is the n-th Fourier-Stieltjes 
coefficient of pr(#), i.e., that 


1 


(22 bis) f exp 2xinadp, (x) = af) a Met 





0 


Thus, it is clear from the criterion of Weyl for equidistribution (mod 1), that 
the content of (21 bis) may be expressed as follows: The ordered infinite 
sequence of fractions which is obtained by writing r= 1,2,: >> in (22) is 
uniformly distributed on the interval [0,1]. This fact, which is equivalent 
to a result of Pólya, may be obtained without the Fourier analysis (22 bis) 
of the sequence (22) also, and contains the corresponding fact concerning the 
ordered infinite sequence Farey sections.’ 


7. The considerations of §4 and $ 5 may be modified in such a way as 
to lead to (B?) instead of to (B). To this end, one merely has to replace 
the condition (8) by the pair of conditions 


(23) PAC e llegar ge AAG Distal eed 
4 : p z 


Cf. G. Pólya and G. Szegi, Aufgaben und Lehrsdtze aus der Analysis, chap. II, 
no. 188. 
T-C£. loc. cit.*, chap. II, no. 189. 
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In fact, a strongly multiplicative (real) function which satisfies (23) is almost 
periodic (B°) and has the Fourier expansion (16) or (19); furthermore, _ 


(24) . M{(f—fx)?} 90 as k> œ, 


and'the Parseval relation lakes the form 
ao 
(25) | MAP} = 3 (r)a. 


In fact, if (23) is satisfied, then (4) shows that (9) is applicable to any 
of the three functions f(n)’, fx(n)*, f(n)fe(n). Thus, the three averages 
M{f°}, M{f#}, M{ffe} exist and have the respective values ` 

n(1—4=Le") It (ie), 
BP ? PSP» P 

g (aaia e, 

VSP, P P>P» P _ 

Hence, M{f(n)*} + M{fi(n)®} — 2M{f(n)fi(n)} —> 0 as k —'o. This 
proves (24). Since fx(n) is, by $ 1, a periodic function of n, it follows from 
(24) that f(n) is almost periodic (B°). Finally, (25) is clear from (17), 
since (19) and (18) show that every amplitude (20) occurs in (17) exactly 
p(r) times. — l 

As an illustration, consider the example (2). Then f(p) ==1—9p; 
so that (23) is satisfied, and (20) shows that the coefficients (19) are 


(26)... ae AL BAP de (f(n) = $(n)/n). 
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AN ASYMPTOTIC FORMULA FOR EXPONENTIAL INTEGRALS.* + 


By Pan HARTMAN. 


It is known that if f(r) is a function possessing a continuous second 
derivative in the interval 021, then 


(1) f. fe )exp (tas) da — exp(ri/2a)T (1 + a) F(0) 1-48 + O(t-2/2), 


as ¿—> + co for alla=2. Professor Wintner pointed out to me the problem 
suggested by the relation (1), namely, to determine conditions for the asymp- 
totic formula (1) which are less restrictive than the assumption that f(x) 
have a continuous second derivative, and to replace, at the same’time, (1) by 


(2) J” ato)exp(— sat) de ~ Cg (0j, |s|—> ©, 


where s = g -+- té is a complex variable. The object of this paper is to provide 
the answer to these questions. | 

It is clear that a necessary condition for (2) is that o, the real part of s, 
should be non-negative.. The case o==0 requires more stringent conditions 
than the case o > 0. For this reason, the main results are stated in two 
theorems. : 


Tueore 1. If 


(i) g(x) is of bounded variation in 0 S£ S 1, and 


(ii) è> 1, 
then 


(8) 8f g(a)exp(—sa")de — T(5+)g(+ 0)" + o(| s |=), 
uniformly as | s | —> œ in the half-plane | args | S 7/2. 


* Received December 14, 1938. 

} Presented to the Society, February 25, 1939. 

*The case a—2 was treated by O. Perron, “Über das infinitäre Verhalten der 
Koeffizienten einer gewissen Potenzreihe,” Archiv der Mathematik und Physik, Series 
HIT, vol. 22 (1914), pp. 329-340. The formula (1) was proved for a 2 by. A. Wintner, 
“On the asymptotic formulae of Riemann and of Laplace,” Proceedings of the National 
Academy of Sciences, vol. 20 (1934), pp. 57-62. 
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THEOREM 2. If 
(i) the integral? of g(x) over 0 S x <1 exists, 
(ii) g(+ 0) = lim g(x) exists, and 
- B40 
' (iii) 8 > 1, 
then (3) holds uniformly as | 3|—> œ in the ni region | args | = r/2@—e, 
where « > 0 ts arbitrary. If condition (i) holds, but (ii), (iii) are pue by 
GY) | g(x) —g(+ 0)] |logz |8 —>0, z—0, and 
(ii) ê > 0, 
then (3) holds uniformly as | s |—> œ in the angular region | args I< ic Tj? — e. 
It has ‘been recognized * that in the Laplace case, i.e. s real, ri + 51) 
X g(+ 0)s is only the first term of an extended asymptotic formula for the 
“integral in (8) if g(x) possesses a sufficient number of derivatives. | Actually, 
the same is true if s is not real. However, in the Riemann case, i.e. s purely 
imaginary, the remainder term cannot be better than O(t). Furthermore, 


the condition that g(x) possess a number of derivatives may be replaced by a 
much weaker condition. In this direction, we have the following corollaries: 


i 


COROLLARY 1. Let n= 1 be an arbitrary integer ; Bx, Ce, k =m ie “4M, 
arbitrary constants such that ‘ 
(i) OS Bi < f<: ` `< By 


(ii) f(z) = > cea» + h(x)ao%, where h(x) is of bounded variation in 
(Scii KO Lo, and 


(iii) a > 1+ £» | 
then : 


(4) a f f(x)exp (sz) de = Boar (1 + Ba) a] o(| s 0e), 
holds uniformly as | s | — co in the half-plane | args | S 1/2. 


COROLLARY 2. Let n 21 be an arbitrary integer; Pr, Cr k==1,---,n, 
arbitrary constants such that 


‘In this paper, an intergral over a finite interval is to be considered as an ordinary 
Lebesgue integral. An integral over an infinite interval is to be interpreted es an 
improper Riemann integral. 

3Cf. O. Perron, “Über die näherungsweïse Berechnung von Funktionen grosser ` 
Zahlen, “ Münchner Siieungsberichte, (1917), pp. 191-220; A. Haar, “Über Asympto- 
tische Entwicklungen von Funktionen,” Mathematische Annalen, vol. 96 (1926), pp. : 
69-107; A. Wintner, “ Untersuchungen über Funktionen grosser Zahlen ” paar re 

Zeitschrift, vol. 28 (1928), pp. 416-429; A. Wintner, loc. cit. r 
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(i) OS Bi < Be <: . -< Ba 
(ii) f(z) == E oar: + h(x) ah, where the integral of h(x) over 
1 g 

OSTAS 1 erists, 

(iii) A(x) | log x |G) -> 0, z—> 0, and 

(iv) «>Q, 
then (4) holds uniformly as | s | —> œ in the angular region | args | S r/2—«. 


These corollaries reduce to the corresponding theorem if n = 1, 6, = 0, 
cı = f(+ 0). It will be clear from the proof that a corollary analogous to the 
first part of Theorem 2 is true, i.e. if one replaces condition (iii), (iv) of 
Corollary 2 by 

(i) h(t) — 0, — 0, and 

(iv) a> 1+ $h 

It may be noted that Corollary 1 for n = 2, 8, = 0, 8: = 1 is a slight 
improvement over (1) without any assumption as to the differentiability of 
f(x). Also, if one does assume that f(x) possesses m (> 0) continuous deriva- 
tives, an application of Corollary ? gives an asymptotic formula with (m + 1) 
terms, while earlier results in the Laplace case give a formula with only m 
terms. 


First, the corollaries will be proved. By changing the integration variable 
from z to c/@4), B= 0, 


1 1 
6) f Pep(—s)jdr— (148) f espi se“ Jas. 
o o 
Thus, one must consider integrals of the type 
f exp(—se)dz, ym a/(1 +8). 
Now, 
1 fee) 0 
(6) f, exp (— 527) dz == J, exp(—sa)dr— f exp (— sx7) da, 
0 0 l 


where the two integrals on the right of (6) exist if either 


(7) | args | S 7/2 and y > 1 
or 


(8) | args | S 1/2 — e and y > 0. 


4Cf. A. Wintner, loc. cit. 1. 


118 PHILIP HARTMAN. 


In either case ° 


In case (7), ore has 
- (10) | f “ exp(—s0") dz | S4y7[s]7. 


The appraisal (10) is obtained by changing the integration variable from s to 
«vy and applying the second mean value theorem to the resulting integral 
_ (over a finite interval) | 


b : 
y” f sO- A exp(— sx) dr 
5 ; 


; | . 
-r Í, exp(— sr) de + enn fÀ exp(— sr) dz. 


(It is understood that the second mean value theorem is applied separately 
to the real and imaginary parts of the integral and that, in the above formula 
and in the sequel, the following notation is used | 


(11) fi Ge Jde fÀ R( - jata fi IÇ + -)de, 
whenever ¢ is a limit of integration.) By integrating, it is seen that the 
absolute value of each of the integrals on the right is less than 2|s|7. The 
inequality (10) now follows by pes bo+ o. 

On the other hand,’ in case (8) . 
(12) : if exp(— sz’) dr | S Ox | 8 LA, 


where N > 0 is arbitrary and Cy depends only on N and y- To prove (12), 
note that ; 


| f ep smar] s f “exp(— ox") dz, 


where s =— o +- it. By changing the integration variable from x to giia; the 
last integral becomes ; 


(18) ot f exp(— 27) de =o Vl J exp (— 27/*) exp(—— 27 + gl) de, 


SPut s =r exp (i0); then 
f (— ea?) de = r / frs exp (i8) ]de. 


` Jt can be shown by a straightforward application of Cauchy’s integral theorem that in 
both cases (7) and (8) 4 


© exp[— sY exp (19) ]do = apin f” exp (— wT) da, 


while the last biésrdl 4 is T(1 + y2). 
° This appraisal ig given by A. Wintner, loo. cit. 1. 
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where the limits of integration are o/Y and + œ. It follows from (13), that 


ai exp(— sa”) dx | < o an exp(— or) fe exp(— 27 + gY?) da, 


from which one obtains (11). , 
Now, (5), (6), (9), (10), (12) imply 


(14) a Í, È ooh exp(— sx) dx 


— È oP [ (1 H Ba) aJs o(| s -080a 
uniformly as | s |—> œ if either 

| args | S 7/2 and a>1+8, 
or 

| args | S/2—e and a > 0. 


Thus, to complete the proof of the corollaries, it must be shown that 
1 
f h(x) 28 exp(— szt) de = 0(j s |-*/2), 
0 
By changing the integration variable from æ to æ'/(*#), this becomes 


| A[a/+8) exp (— szt) ) dz = o (| s |-U+8/8), 
On placing ‘ 
g (2) = hfe), 
it is seen that the Theorems 1 and 2 must be proved in the case g{-+-0) = 0. 

Define the non-increasing function m(|s|) —m(s) as follows 
(15) m(s)==lu.b.|g(z)| for 0< aS |s |7, 
so that 
m(s) — 0, ls] co. 
Let $(s) = ¢(] s |) be a non-decreasing function of | s | which approaches %2 
with |s | so slowly that 
(16) m{| s| $(s)*}o(s) 30, |s|> . 
Eor example, one may let 
p(s) = min[]| s |178, m(] s [/#) 77]; 

under the last set of conditions of Theorem 2, it may be supposed that? in 
addition to (16) 


TIt may be supposed that 128 > 0, otherwise the second part of Theorem 2 is a 
special case of the first part. In this case, let 
¥(s)= p(|s|)=m(s)log|s]70,] s | > %. 
The function ¢(s) may be defined to be 
min[y¥{| a |2/28-9) -1/2 log3/8 Jeh | 4 [2/28 logi/8 | 2 ER 
for a small constant y > 0. 
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(17) i $(s)®/log | s |> «, [s|— c. 
Now, 


(18) f g(z)exp(— sz?) dx == 5 fe) 21-98 exp(— sr) dz. 


Consider the last integral from 0 to b, 0 <b=1, 


b b 
| Í g (2/8) 5-8 exp(— sx) de | £ m (b) f 200 bdo 
0 0 
= m(b1/5)5b1/8, 
Thus, if one places 
(19). b = | s |7 (8)?, 
it is seen from (16), that 


b $ 
(20) f, g (P) z0 exp(— s2) de = o(| s |10), 
0 


uniformly as | s | —> oo in the half-plane | arg s | & 7/2. In order to appraise 
the integral on the right of (18) from b to 1, apply the second mean value 
theorem (to the monotone function z“~)/*), one obtains 


(21) eaw ("g(a exp(—se)de + f 9(x°)exp(—se) de, 
b £ 

where, cf. (11), é= é(s) = (&, &) satisfies 

(22) D<h<1,, b<&<l. 


The treatment of the integrals in (21) is essentially different for Theorem 
1 and Theorem 2. Under the conditions of the first theorem, g(x/°) is of 
bounded variation and is, therefore, the difference of two non-decreasing func- 
tions. Thus, it may be supposed without loss of generality that g(x") is a 
bounded monotone function, so that the second mean value theorem may be 
applied to each of the integrals in (21). It follows that 


1 
(23) lf g(a?) exp (— sz) da | S Mb 8 | s |7 + 8H |s |, 
b 


where M =l. u.b. | g(x)| for 021; so that by (19), and the fact that 
(s) —> © and (1—8) < 0, the integral in (23) is o(|s|*#) uniformly as 
|s|— œ in the half-plane | args|=a/2. This completes the proof of 
Theorem 1 and Corollary 1. 

Put s == r exp(19) ; in the angular region | args | == | 6| = 7/2—«, one 
has cos 6 = € > 0, for some constant c == Ce Then for any0<p<cqSl 


(24) | Í " 9(a¥*)exp(—su) dz | S f “j g(t) | exp(— erz) da. 
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Denote by § the set of points z in 0 Sz 1 such that | g(z¥/* | > 1; and by 
Ta the set of points in the interval p = x q which are not in S. Thus, 
if z is in Tyg, then | g(2/*)| <1; if æ is in S, then z > for some constant 
7 > 0 since g(x) +0, s— 0. Therefore, by the first mean value theorem, 


es f g (z1) | exp (— cra) dz s f ep(— crz)dz < (cr)™ exp(— crp) ; 
also 


(26) f | 9(@”) | exp(— ors) de S J exp(— ern), 
where 

Tom f° | 9(a*)| ae. 
Combining (24), (25), (26), 


(27) | f gep sa) dz | < J exp(—erm) + (or) exp (— erp). 


Thus, (19). (21), (22), (27) imply 


(28) | (00e 79208208 exp(—sa)de | < RITA + 0-9/9 (s)*] exp(— om) 
+ 236 (s) exp[— ce (s)®] + 2 (cr) exp[— op (s)*]. 
The first term on the right of (28) is clearly o(r1/8) = o (| s |"). Since 
$(s)** exp[—cg(s)"] 30, | s| >, 


the second term is 0(11/5) = 0(|s]-1/#); while the last term is 0(1°) 
=0(|s|"/) if8> 1. Thus, the first part of Theorem 2 follows from (18), 
(20) and (28). 

Under the conditions of the last part of Theorem 2, the last term on the 
right of (28) is also o(r/) even for 1=8>0. For 


7™ exp[— cp(s)°] = 1? exp{(—logr)[cp(s)?/log r — (1 — 8)/8]}, 
and the factor of 1-1/7 is o(1) as 7 —> co in virtue of (17). This completes 
the proof of Theorem 2 and Corollary 2. 


QUEENS COLLEGE, 
FLUSHING, NEW YORK, 


‘ALMOST PERIODICITY AND THE REPRESENTATION OF 
| INTEGERS AS SUMS OF SQUARES.* | 


By M. Kao.** 


: Let re(n) be the number of different representations of n as a sum of 
s squares, Then i ' 


©. na (+239 


Hardy has given an analysis of the arithmetical properties of 1.(n) based on 
the theory of elliptic #-functions. The. object of this note is to point out the 
close connection between the investigations of Hardy + and the theory of almost 
periodic sequences. In particular the “singular series” of Hardy and Little- 
wood turns out to be a formal Fourier expansion of n‘#r,(n). The main 
result of Hardy that for 5 S s <8 the sum of the “singular series” is pre- 
' cisely nter, (n) will be shown to be equivalent to the statement that niir, (n), - 
where 5 S s <8, is a uniformly almost periodic sequence. The case s= è? 
‘is of particular interest, since r.(n) is? not even an almost periodic function 
of class (B), although the Fourier coefficients exist and tend to 0. The in- 
vestigations on almost periodicity of functions occurring in the analytic number 
theory and given by formal trigonometrical series, as originated by Wintner,* 
have led, thus far, to functions which are almost periodic of the class (B?), 
at least. This may emphasize the interest of the situation mentioned above. 


* Received May 11, 1939. 

** Fellow of the Parnas Foundation, Lwéw, Poland. 

1G. H. Hardy, “On the representation of a number as the sum of any number of 
squares, and in particular of five,” Transaotions of the American Mathematical Society, 
vol. 21 (1920), pp. 255-284. Cf. also S. Ramanujan, “On certain arithmetical func-. 
tions,” Collected papers of Srinivasa Ramanujan, Cambridge (1927), pp. 136-162. 

` aA, S. Besicovitch, Almost Periodio Functions, Cambridge, 1932. In particular 
` pp. 91-109. 

3 A, Wintner, “On the asymptotic distribution of the remainder term of the prime- 
number theorem,” American Journal of Mathematios, vol. 57 (1936), pp. 634-548; “ The 
asymptotic behavior of the function 1/§(1 + 4¢),” Duke Mathematical Journal, vol. 2 
(1936), pp. 443-446. Cf. also M. Kac, E.R. van Kampen and A. Wintner, “ Ramänujans 
sums and almost periodic functions,” American Journal of Mathematics, this number, ` 
. pp. 107-114. 
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1. If f(n) is a function defined for n = 0,1, 2,- - + and À a real number, 
the averages Af{{f(n)exp(2riAn)}, where 


M{g(n)} = lim n+ 3 9(i); 


are called the Fourier coefficients of f(n) if these averages exist. If these 
averages exist and if they vanish except for an at most enumerable set of 
A-values, say for A = A1, À,” °°, the series 


à ak EXP(— riin), where a, = A{f(n)exp(2riun)}, 


will be called the Fourier series of f(n), also when f(n) is not almost 
periodic (B). 

It will be shown that the Fourier coefficients of f(n) == n*#r,(n) exist 
and that the Fourier series of n'3r,(n) is the “singular series” of Hardy 
and Littlewood, namely | 


(2) ` p(n) = St S (Sx)‘exp(— 2rthn/ke), 


(Ayn) =1 


k=1 
where Sis = X exp (2rihj?/k). 
j=0 
Suppose first that À is irrational. Then 


n-i 
lim n $ exp (2riàj?) = 0, 
n00 J=0 
and so 
[se] 
(1— q) (1 + 23 qfexp(?riaf)) +0, as g—>1—0. 
T 


Thus, (1) implies that 


[ae] 
(1— q)#8(1 + Sre(j) qi exp(?rixj)) — 0, as q—1—0. 


Making use of a well known Tauberian theorem,* one obtains 


“Cf. for instance J. Karamata, “Neuer Beweis und Verallgemeinerung einiger 
Tauberian Sätze,” Mathematisohe Zeitschrift, vol. 33 (1931), pp. 294-299. The theorem 
cannot be applied directly since the coefficients are not positive. One can make use of 


n 

the fact that n-is Sr, (j) > màs/T (16 + 1) and then to apply the theorem to the series 
1 : 

Br, (7) (1 + cos 2rAn)q" and yr,(n) (1+ sin 2ran)q". See also J. Karamata, “Sur 


les moyenne arithmetique des coefficients d’une série de Taylor,” Mathematica (Cluj), 
vol. 1 (1929), pp. 99-106. 
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a-i à 
nts 3 r,(j)exp(Rmiaf) — 0, as n> co. 
o i z . 


It follows now immediately that, for irrational À, 
M{n= tr, (n)exp(2riàn)} == 0. 


Next, suppose that À = h/k. iene as Hardy does, that 


Six 


: 14-23 gesp (wiht /k) ~ x8 DE (tog 1/9) ~ à Sht Se gy a8 ps0, 


one obtains 
| o a) g 
1+ 3 1,(j) qi exp (2rihj/k) ~ air ( Sw)! (Lg), as g—>1—0. . 
1 r 
Applying again the Tauberian theorem,* one arrives at 


nar ws isi | 
EP : Poe Ak 
n 3 (Dep Cri) > D i je as n'o. 


This obviously implies that 
. A mis Sux wt SaN 
M (wre (esp) = 3 sare (FH) arol) 


and the proof of the italicized statement is complete. 

2. From the classical results concerning the Gaussian sums Örs one 
readily deduces | Sa. | 24% and it is clear that for s==5 the “singular 
„series ” (2) is absolutely convergent. Since it was already proved that the 
“€ singular series ” is the Fourier series of nitr, (n), it follows that 


nit, (n) — pa (7) 


holds if and only if mar, (n) is uniformly almost periodic. The function 
misr (n) is not uniformly almost periodic for s > 8 and it is almost periodic 
for 5Ss=8. This is only a restatement of Hardy’s results; it seems to be 
very difficult to prove or disprove elementarily the uniform ‘irae ae 
of nitr, (n). 


3. In the case s= 3 or s= 4 the “ aab series” is not any more 
absolutely convergent and n‘-#7,(n) is not uniformly almost periodic. Never- 
theless it still has some properties of almost periodicity. In fact, it is almost 


` 
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periodic (B?). For simplicity, only the case s— 4 will be discussed. Let 
-wj;(n) be 1 or 0 according as 7 is or is not a divisor of n, and let y(n) be 
defined by 
9%(a) |n, QV(MAT ny, 


Jacobi’s well known theorem concerning the representation of r,(n) in terms 
of o(n) may then be written as follows: 


1+ ?w2(n) o(n) g Lt 2u2(2) oe oj (1) 
oyma] n 8 ina — 1 j 





n(n) = 8 





N 
It can be easily verified that M{ (no(n) — X -lw;(n))}?} — 0, as N — co. 
f=1 
This proves the almost periodicity (B°) of n“*o(n), since the finite sums 
N 
Z j“lw;(n) are periodic. Observing that the ratio of 1 + 2o» (n) and 2709 — 1 
1 


is also almost periodic (B*), one sees that so is nr, (n). 
From Jacobi’s theorem one deduces by an elementary computation that 
M{n?r;%(n)}— 420€(3). On the other hand the Parseval relation gives 


M{n?r£(n)} = DE LAN tee x | Su: | where r 1S k° 3 a i Sur |8 == 4206 (8). 
= Lei n 

4. The Parseval relation is evidently valid in the case õ S sS 8, and 
it seems to be quite probable that it holds also for s > 8. This would mean 
that if s > 8, then n't,(n) is, though not uniformly almost periodic, at 
least almost periodic (B?). Furthermore it would follow immediately that 
the inequality | n*-#1,(1) — pa (1) | > e holds for “ almost all” integers (e. g. 
except for a sequence of integers of density 0) whatever is e > 0. 


5. The case s==2 is the most exceptional, which is due to the fact that 
r2(n) is 0 for “almost all” integers. r,.(n) is evidently of class (B**), since 
M{r (n) =} == 0. 

As mentioned in the introduction, r,(n) is not even almost periodic (B). 
The following simple proof of this statement was communicated to me by 
Dr. E. R. van Kampen. 

Let n == 2¢p,Fip.bs- - -g,%g,7- - -, where the p’s are primes = 3 (mod 4) 
and the q’s primes = 1 (mod 4). It is well known that | 


14 (—1)% 14 (— 1) 
2 2 





T2(n) = e(t Itl) oo 


Denote the product depending only on B’s by B(n). It is easily seen that 
Bin) is 0 for “almost all” integers and that 8(n)ra(n) = ra(n). Suppose 
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now that r.(n) is almost periodic (B). According to a known theorem * one 
could find a sequence of finite trigonometrical sums (which are in fact certain 
means of the partial sums of the “ singular series”) W4(n) approaching r,(n) 
in the mean, i. e., M{(r2(n) — Wi(n)|} 20 as k— œ. Obviously 


M{ 





ra(n) — p(n) Wa(n) |} == M{B(n) | re(n) — Wi(n)|} 
tends to 0 as k —> œ. This implies a contradiction, since 


Mira(n)}—=m, . M{B(n) | Wala) |} = 0, 
| r2(n) —B(n) We(n) | = ra(n) — 8(n) | Wa(n) |. 


6. It may be mentioned that the remarks of § 1 allow us to compute the 
limits of the expressions 


nè Sr, (jk) 
jal 


as n —> œ and k is a fixed integer. In fact, let o(n) have the same meaning 
kel 
asin § 3. Then og(n) =k" 3 exp(2rihn/k), and so 
A=0 
eee CE eh eee ee 
ly = a is À = ;) 5 
Da pe E E A a a 
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5 Loo. oit., 2) p. 106 (Theorem Il). 


ON THE EXPANSIONS OF CERTAIN MODULAR FORMS OF 
POSITIVE DIMENSION.* 


By HERBERT S. ZUCKERMAN.! 


1. A definition of a modular form of positive dimension has been given 
in a paper by Rademacher and the author.? In that paper we found the 
Fourier expansions of those forms F(r) which belong to the full modular 
group and which have only polar singularities at the parabolic point r — ico 
when measured in the uniformizing variable x == e?it, In the present paper 
we shall also restrict ourselves to functions which belong to the full group 
and which have only polar singularities at r= ico, but in the definition of a 
modular form we shall omit the restriction that F(r) be analytic in the upper 
half-plane and shall merely assume that F(r) has, as singularities in the 
fundamental region,’ at most a finite number of poles: and possibly a polar 
singularity at to. 

This problem of determining expansions of modular forms having poles 
in the upper half-plane was partially ‘considered by Hardy and Ramanujan.* 
However they considered only forms of positive integral dimension which have 
no singularities at the parabolic points. The generalization to forms of real 
positive dimension presents no difficulties. We have only to introduce the 
roots of unity e(a, b,c, d) and e?*#® in the transformation formulas (1.11) 
and (1.12) below, and to carry them through the analysis. However in order 
to take care of forms having singularities at ico we have to evaluate certain 
integrals which Hardy and Ramanujan were able, to eliminate by means of 
simple estimates. This part of the work is contained in sections 3, 4, 5, and 6, 

In this paper we shall consider most of the integrals in the r-plane rather 
than in the z-plane, where x == e°7i". The original path oy of section 2 is 
taken in the r-plane so that we may avoid the poles of F(r) which are easier 
to treat there than in the z-plane. After this point much of the work could 


* Received May 16, 1939. 

1 Harrison Research Fellow. : 

2“ On the Fourier coefficients of certain modular forms of positive dimension,” 
Annals of Mathematics, vol. 39 (1938), pp. 433-462, especially section 1. 

*I¢ is convenient to choose a particular fundamental region. We take the region 
|7|=1,—1/2 = Rr) 1/2, and throughout this paper we shall refer to it briefly as 
the fundamental region. 

+“ On the coefficients in the expansions of certain modular functions,” Proceedings 
of the Royal Society, A, vol. 95 (1919), pp. 144-155. 
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be done in either plane but by keeping in the r-plane we are able to maintain 

a closer contact with the properties of modular forms and to eliminate certain 

rather artificial parts of the proof. For example, instead of using the mediants 

of the Farey series to break up the path of integration we are able to choose 

a set of points which are more closely connected with the form whose expansion 

we desire. i 
We write the transformation equations for our modular form as 





cr + d 
(1.12) F(r+1)—ereP(r), 0<a<l, 


(1.11) p (SLE) ela b,c, d) (er + d)) "F (H), e>0, 


where | e(a, b, c, d)| —1 and where the branch of (—t(cr + d))" is chosen 
as in the original definition of a modular form. 
From (1.12) we have 


(1. 2) era) E (r + 1) == erta F (r), 


and hence e-**#7F{r) has a Fourier expansion for each region in which it is 
analytic. Since #(r) has only a finite number of poles in the fundamental 
region, we can find a number A such that the only singularity of P(r) with 
Y(r) =A is at rico. To simplify the later notation we shall always take 
for À a value = 1. Then, noting that F(r) was restricted to have at most 
a polar singularity at r = ico, we see that we have a Fourier expansion 


(1. 3) etriarF(r) = S Apg? TIRT =a S dut”, üp 5E 0, t= erir, 
n=-B B= ; 
which is valid for $(r) = 4, | a | Seer, 
For all + in the upper half-plane we write 


f(z) = eF (7). 


Then, within the unit circle, f(z) has a pole of order y at a= 0, poles at 
each point corresponding to the poles of F(r), and no other singularities. In 
section 2 we shall determine a closed curve Cy which lies within the unit 
circle, encloses the origin, and does not pass through any poles of f(z), Then 
if y is a point within Cw and not a pole of f(z) we have 


Bi Soy de — (9) + RON), 


where R(N) is the sum of the residues of the function f (2) / (z — y) at the 
poles of f(z) which are enclosed by Cx. We then have 
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(1.4) Of, eg RO), 
from. which to obtain our expansion. This expansion will consist of two 
parts. The part arising from R(N) corresponds to the Hardy and Ramanujan 
results ‘while the part arising from the integral is analogous to the series 
obtained for forms having polar singularities at too and no other singularities 
in the upper half-plane. Despite this similarity it is not true that these two i 
separate parts are each modular forms of dimension r belonging to the full 
group. | 

2. For the case in which F(r) has no poles in the upper half-plane, the 
curve Cy could be chosen as the circle «== exp {— N°}. However this 
curve is not suitable in our case since we must find one that avoids the poles 
of f(z). The curve used by Hardy and Ramanujan can be used but it is 
geometrically more complicated than the one that will be used. 

We first determine a path wy in the r-plane and then take its image in 
the z-plane as Cy. For a positive integer N we let he/ks be the s-th fraction 
in the Farey series” of order N, 


OF EE test, Ose a ar eepe eel 
1> N’ ? Les” ke? ken’ ATNA 





(2.1) 


and we consider the transformations 


j hs he 
(2.2) ` A | Te (7 el 


where we define hy to be — 1 and ko to be N. This transformation belongs 
to the modular group because of well known properties of Farey fractions. 
Under T, the point iœ goes into k,/ke, 0 into hs-ı/ks-1, and the line #(r) = 0 
from 0 to too goes into the semicircle, through the points hes/he-1 and he/ke, 
lying above the real axis. The first quadrant of the r-plane is mapped into 
the area bounded by the semicircle and the real axis. If any poles of F(r) 
lie on the line R(t) — 0 we detour around them with small semicircles 
extending into the right half-plane and deform the semicircles through he/ks 
and he+/ks1 accordingly. These deformations will all extend downward into 
the semicircular area but will not reach the real axis. 

These deformed semicircles join at the points h./k,s to yield a continuous 
path from r == — 1/N to t=- 1, Finally we detour around the points he/ks 


s Certain simple properties of the Farey series will be used without mention in this 
paper. For these properties and their proofe see E. pig Vorlesungen über Zahlen- 
theorie (1927), pp. 98-100. 
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along the line Y(r) = B from each semicircle to its neighbor. The constant 
| B, which may vary with N, is to be taken small enough so that the line will 
meet all the semicircles. It is also to be such that this line does not contain 
any poles of F(r) and we shall later add a further restriction. 











74 e oe 4 $ t $ 4 l i 
The path wy for the value N = 4. 


. For oy| we now choose the part of this path between the ‘points 
—1/(N +14) and (+ N—1)/(N +4) which are the images of the point , 
t = į under the first and last of the Ts. The path oy is entirely above the 
real axis and\it does vot extend further above it than the largest semnicirele. 
The radii of the semicircles are 


| | 

A de OÙ re 

(e =) ia EN EI) SN? 
and hence we have, for r on oy, | | 


(2.3) | | << 


The end ‘cits —1/(N +4) and (t+ N—1)/(N +i) of oy differ 
by unity and hence the imagé Cy of wy in the z-plane is a closed curve. Aleo, 
by (2.3), we ant, for z on Cy, 


| |z| == e rot < 1, 


| ic |a| =i eso < aa \ 


“and, therefore, Cy. - lies within the unit circle and apaneeeles it as N> œ. 
We can now a ae (1. 4) as 


Fy) + f Er (ete yeas — RCN) 
| a 
(2.4) | 
g Le f e*rs(i-a) 7 (q) (e247 — y) “1dr — R(N). 
oy ; 
3. In order to evaluate the integral of (3.4) we break the path wy into ` 
parte, We let Q, be {hat part of oy which joins hs-1/ks-1 + 4B to he/ke + iB 


for all s except the first and last, and for these values of s we let Q, be the `` 
corresponding ae parts of wy. If we write 


| 
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(3.11) lm Í, ei) (r) (arr yin 

we then have 

(3.12) f eri-aT P(r) (eri _y) dr = $ L, 
wy a 


where Ÿ denotes the sum over all s for which there is a corresponding Farey 
a 
fraction (2.1). In (3.11) we now make the change of variable 
| hao + hor 
ter ks + Esi ? 


corresponding to the transformation (2.2). If r lies on the part of Q. that 
does not consist of the line $(r) — B then o lies on the line R(oœ) = 0 or on 
one of the detours around a pole of F(r). If (r) =- B then we have 


(3.21) 





fe) 
The path Qs. The path O,. 
Bene (e+ hes) = (0) 
“8 keo + ke (kf (0) + aa)? + RS (a)* | 





Go +) + Ro) -laa 
ks 2k B kB) ? 
so o lies on the circle with center at —%4/ke + i/(2kB) and radius 
1/(2k:?B). It can easily be verified that the points corresponding to the end 
points of Q, are 
ks- t _ — Bhs 
n aB "7 Bila Fi’ 





g = — 


except for the first and last values of s, in which case one end point corre- 
sponds to o—1 Then as 7 runs along Qes, o runs along the circle from 
— Bk’s1/(Bksks-ı +1) to the line R(o) —0; along this line, making the 
necessary detours, until it again meets the circle; and then along the circle to 
— ks-ı/ka -+ i/ (kB). We shall call this path G.. By taking B sufficiently 
small we can keep the two points of intersection of the line and circle outside 
of the strip 1/4 S S(r) SA. We now have, using (1.11), 
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o g + kai kao + Kean 


(3. 22) a exp ee yee + Res | (Met bes) | 


x (exp { tat eae 4) e+ be) de. 


R 7 | hao + Re } . 
oe Peg) et 
€ S: S 1 Rri( a) ae ca ka 


hse + V PER 


x (ex { Pi ee by)" Et 


where we have used the abbreviation 


(3. 28) 


E = e(hs, ho, ks, ks-1) . 


As wej are later going to let W tend to infinity, we can suppose that it 
has been chosen so large that the curve Cy wholly contains the circle 
z| = (4+ |y|)/2 for the particular point y, within the unit ciele; that 


we are discussing. We then have the inequality 


(3.3) 


— ly] 
en 





for all z on| the path Cy. The integrand of (3.22) can be written as: | 


rte (r y)* (— t(kyo + Kes) JTE (e), 


where r, which is given by (3.21), lies on Q. and x= 6247 lies on Cy. Then, 
for the part of Q, in the strip 1/4 S 9(c) SA we have R(o) = 0 and hence 


| ka + ka [2 = 


I 


| gars arr | == g`?T (17a) 9 (7) < i 
(kR (0) + ka-1)? 4 kX (0)? =, att 
N? 


= Gs (Hoa + ht) > (N — E)? + k) a 


_ Also we see that the path is of finite length, is independènt of N, and is free 
„of poles, so | #(c)| has a bound on this 3 path. Pombining these resulta % we Bee 
that the part of I, due to this part of , is | 





| 


Nt : 
J — |y] , : 
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fe) 0 
The path Qy’. The path Q”. 


The remainder of @, is in two parts. The upper part we call ,’ and 
the lower 0,’”: It is to be noted that Q,” and Q,’ for the largest admissible 
value of s are missing. For o on N,” we make the change of variable 
p =— l/s and let 0,” be the corresponding path in the p-plane. It can 
‘then be seen that Q,’ consists of the path R(c) = 0 from Ai to the circle 
| o + ka-1/ks — t/(2k,?B) | == 1/(2k,°B) and then the arc of this circle on to 
the point — ks-ı/ks +1/(k.2B), while 9,” consists of an arc of the circle 
| p— ks/ka-1 —1/(2h*,.B)| = 1/(2h*s..B) from ks/ke-ı +1/(k:1B) to the 
line R(p) = 0 and then the segment of this line to At. These two paths lie. 
entirely above the lines (co) — Y(p) == A and hence are free of the detours 
that we made to avoid the poles of F(r). We now have 


IENS : hao + her H( f. uae 
(3. 4) L- o f exp f miaa a) OT | (exp | amie 
X (— i(k H ks) ) TEE (0) do | 
a ys ee hs- Ms-1 Pp —— tha Ra fl j et 
Tés Sex 1 2m (1-— Pine Bu exp are 2i 
; ke TA - dp Nw? 
x ( i( He :)) (= P ae 1—]y] 
Nw . ` 
=H; Hs” 0 J 
TUSE = ly] 
pce the tonton equation (1.11) to H4” we ie. 


(3. 51) H” =— ihoa fi exp j mi(1— a) pente } 
| | o ka -1P — Ka 


hs- 1P ha ° i | -r-2 
x (ep 1 Le od i. l— )" (—1(ke-1p — be) ) TE 
where 


(3. 52) e= e(0,— 1,1, 0). 


On 0,” we have 3 (p) 2 A and hence the Fourier expansion de 8) is valid. 
Thus we may write 
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(3.53) | i,t eT + Eel, 
where f 
(3.54) I= otto f e exp Í gadag Pe l 
kaip — ke 


| heip — hs | |: 
| x (cep {ans re (8 | 
\ X (—i(ksip— ee) ) Teria Sayer dp, 
(8. 55) xl avr f. exp j mi(1— a) heap — hı } 

h À hasap ihe E 

wl a 

| A (or ie y 

X (—i(Iesap — hi) ete" À anotrinrdp. 


If p is on QO,” then rm (hs 1p — he) / (Bac 1P — ka) is on wy and = er 
is on Cy and : may write (3.55) in the form | 


Ri’ ge f, Piar (5 — 9) 
| X (—i(kerp— hx)) 1202700 Ÿ again, 


| | Ky" | Sis 





Si. exp{2rt(1 — a)r} (z — y)"*(—4( keep — ka) ) Terrier > aae rined 


„where w” is the part of wy that r runs over as p runs over Q”. From, the 
geometry of the path 0,’ and properties of the Farey fractions we have | 
Tka- e pz l ka- 14i — k, |? = k’, tr: ka 


| , ZA +B, > (N — k)’ += 


aM | 
2 


Also we have S(p) Z A, the inequality (2.3) for r, and (3.3) for x, and hiik 


(8.56) Ka” He ca e Jal) 


E 
a ly | a x | 


In a similar (way we find 





(3.61) H= +E, 
where ne | 
| hao + he- hee + hoa) 
(3. 62) P= f exp À rit — a) pet Bes } (exp | D he boy 


ks -+ Ket 

X \(—i (kao + kaa) terria Ÿ q petted, | 
pel 

| 


po 


i 
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and 


x ' | 4 Nr: f 
3.63 | K,—0(*_ f d ). 
( ) os 5 Doe i—|y| a . 
Now 4’ and w,” are both parts of Q, and they do not overlap. Hence we may 
combine (3.56) and (3.63) to obtain 


(3. 64) RIFE” —0 (aS. lar). 
4 We now consider the integrals I,’ and 1,” of (3.62) and (8. 54). 
In I,’ we make the change of variable 


z= — i(k t keai), 2 emt 
and have 


(4.11) Hu f exp 2ri(1— a) (+5) 


dre ( E) } Baran] he 
where the path of integration consists of the line (z) = — ks-ı from the , 
‘point Ak, —tk,.. to the point at which this line meets the circle correspond- 
ing to the circular part of Qs’, and then an arc of this circle on to the point 
1/(k:B). 
In I,” we set 





_ lg ke 
z == — 1(ks1p — ks), ES Ra AC 
and find 
h i_\} 
I ey nan KOMAT [EZ I = = 
(412) Is F5 eee L f exp} Pri a G -1 rl 








OCR 
: exp js Kea ka-12) $ y 
ka wz \ f . (= tz ) 
y Raw | — 
co a4) Sema) a 
where the path of integration consists of a circular arc and a straight line 
joining the points 1/(k:,B) and Aks- + kat. 
We now wish to combine J’, and I”.v into a single integral. Since their 
paths of integration abut at the point 1/(%.B) it will be necessary only to 
transform the integrand of I”, to show that it is the same as that of I's. 


It is because of this that we chose the path oy in such a way that the first 
and last 9, were incomplete. . The effect of that choice is that I’, and I’. 
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. for the largest admissible value of s are missing and hence when we sum the 
I, of (8.11) to obtain the integral of (2.4) we are to sum the expression 
(Z's + Ti) over the s corresponding to all Farey fractions ee De ‘with the 
exception of the last one, 1/1. 
In order to transform the nee of Is. we first note that from the 
property | 
| hakai — harks = 1, hoake aa haker = 1, : 
of the Farey fractions, we have 





(4. 21) harke ae. heaksat = he- 1 + han : ka 1 + ksr E I 

he ke | 

. Using this, the transformation equations (1.11) and (1.12), mog the 
definitions (3. 23) and (3.52), we find . 


! hart + ho hası TAT + harika- PR ho-1kss1)" + hs 
ERIE ie + = TA Ge eee TF x) 
| > (—: kar + kei | yz( — 1 ' ) 
a * + haska- —Ieakar J © Nr + harka- — haha 
=. €41€00" 7/2 (— (ker + ks-1) TF (r + haute = he-1ksu) 
= nee" 17/2 exp {ria (Rese. — hs-1kon) } (= t (her + ks) ) "F (r), 











and 





(4.23) F Get) = €3(—i(ker + ker)) "F (r), 


and hence we have : 


| | ai 
(4. 3) erir/ie ess exp Iria = exp { = Priv Hest . i 


ue {tte }en | see (D) 


Equation (48) is obtained by comparing (4.22) with (4.28) ‘and: using 
(4.21). Using this result we may write (4.12) as 


L'on tal ffrio th 
(eine = | 
on] na i1 E) he fa (HE) 
which we now combine with (4.11) to get ! 
(4.4) Pin ia ge f exp | Pri(1— a) (E +É) l 
x (exp | emi (32+ E) gra 


iC . ks- H . ka: 1% ‘| | 
X exp | ania (— Hes i + iz) LS a exp — rw (— pa +7) ae 


t 














(2 











yal 
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The path of integration of w 4). extends from Ag — thes t to Ake + thea: 
The exact form of this path need not be known. It is sufficient to note that 
if we trace back through the changes of variable that we have made, we ‘find 
that as z runs over this path then 


(te, i 
ae) 
runs over a part of Cy. By (3.3) we then have - 
hs 
ep fari (Et r) ENT 


and we can expand the denominator of the integrand of (4.4) in a uniformly 
convergent geometric series of non-negative powers of y. Doing this and 
interchanging the orders of summation and integration we obtain 





Eea 


ka pat n=0 
Xf ten] Eo azt (nays de 


5. The integrand in (4.5) has no singularities except at the points 
z—0 and z= so we can deform the path of integration. Ifn+a>0 
we cut the zplane along the negative real axis and take the following path: 


Qt(z) — Aly from Ak, — iks, to the point at which |z | — If 
| z | = M from the point at which R(z) = Ak, to the point — M, 
a loop from — M around the origin and back to — M (on the upper border), 
|z |= M from — M to the point at which R(z) == Ake, 
R(z) — Ak, from the point at which | z|==M to Aks + thes. 


a 


(4. 5) Let Las me te $ Š av oxp | 2r (= (nta) E (veo) 





The path of integration 
for Te + fire 
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‘Calling integrals along these paths Jı, Js, Js, Ja, Js, respectively; we have 
[Ja + tals f Wer exp | E T (7a) + (m+ a) p) Fast : 


= Rar EXP j 2 ((v—a) Al + (n + a) iz) baer, | 
and hence Ja + J,—0 as M œ. Also, on the path of J, we have 


z = Ak, — y, hea = y SM, 


| 

| 

L 1 Ak, Ake 2Aks 
OR RO ES WF? 


PEAR + ei Ske + (N — ke jaz 


E 


and hence i ` 
Aks-400 | Le sf 
nis fi din Pa + (n+ a) 2) | de | 
or exp{4du 3 (n + a) i z |? | dz I) | 
| Aether 


PE © K 
” o( E CER f | dw I); 
(Ak—tha-1) 
where, in the last integral, we have set w= 1/2 and where the path of 
integration is along an arc of the circle tangent to the imaginary axis at the 
origin and passing through the point (Ak, — tks). The length of this path 
is at most /2 times the length of the chord: 


s | Aks — iks | -7 (Atk,® + ke?) € Fi N+, 


H 
and therefore we have 


| Jy O(N exp(4dxN*(n-+2)}), 

and, similarly, | 
Js = O(N exp{44rN(n + a)}). 

Finally we have ê 


lim Js = Fe ar exp ECS (n+a)2) l de 


iji o (y— a)rt fi {74 exp f pp Set |à Er ae } dt 





=i (ES) ra (GE Vu +964). 


° Q. N. Watson, bas of Bessel Funotions (1922), p. 181, (1). 


À 
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Combining these results and letting M tend to , infinity we then find that the. 
integral of (4.5) has the value | 


GD mi ae i (= Conc 5) 
LE O(N exp{4dN2(n + a)}). 


If n + a = 0 we have n = & == 0, It has been shown in the first paper” 
refered to in section 1 that a == 0 implies r = an integer. The integral can 
then be evaluated in this case, as was done in that paper, without cutting the 

z-plane. The result is that (5.1) may still be used provided we use the con- 
‘vention of that paper regarding its meaning in this case. 

We can now combine (2.4), (8.11), (3.4), (3.53), (3.61), ui 64), 

(4.5), and (5.1) to obtain 








(5.2) jy) — 23 {Sard Seem { 2 oni(— (aa) P+ (oa) Se) À 


M) 


+ O(N exp(4dnN*(n-+ a))) ) y" 


HT Stel) +0 (Sa gr) 200. 


The error terms in (5.2): reduce to 





(5.3) O ( z { E Zy exp{4Ar N> (n + a)} | y|” 


Nr jo Ne | 
tia Poe }) 
— O(N exp{4ArN +a) Z Nk? X (exp(44xN} | y |)*) 


+o eof lat) 
= (Nr cpt) (1 —exp (440) | y ANAR) 


ÉTÉ ONCE) 


Now it is easily seen that the length of the path wy has an upper bound 
independent of N. Also we have. | 


SNS EN, “sR SN SN, 


k=1 A=O 


* Loo. oit., footnote 2, sections 6 and 7. 
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and De the error terms (5.8) have the estimate 
|. O(N exp(44xN-a} (1 —exp(44eN} | y|)*), 


` and (5. 2)\may now be written as ! 


68) 10) Sard Suen {ai (0+0 Ba oma He) } 


— q \ (tt) /%, ap pa 
x (5) La (E VEFA) ; 
— R(N) + O(N- exp{44rN+a} (1 —exp(44xN*} | y |)+). 
6. If we let N— œ in (5.4) the error term will fend to zero and we 
shall have the desired expansion of f(y). In order to perform this we must 


free (5.4) F its dependence on the Farey series of order N. Thatjis, we 
must replace |the term i En 








€s EXP j mri (v — a) fea 
; š 
. by an equivalent expression involving h, and k, but not ha- and k,1. | It is 
convenient at | his point to omit the subscript s, writing À and k for, ha and ka 
but still using he. and k,:. We first define k’ to be any solution of the 
congruence i i ; | | 


(6.1) | = hh’==—1 (mod k), ` 

from which welhave, at once, LOU i 
W + ke1 = 0 (mod k). CT | 

By (3.23) and (1.11) we have a í i 


hr + hea a | i 
p (hea) L e (ilke + km) FCO). | 


On the other hand we use (6.1), (1.11), (1:12), and the fact that he1/Ke-1 
and h/k are successive Farey fractions to get | | 


hr + Ron | (h + (ksa + W)/E) — (1 + hk’) /k 
A (GE) 4 ( Er + ea EW) /k) —# ) 


/ 
= (r= 1 tAr sh —W) (ier + ksi)" 
| z H 
exp | mia HAEN | p), Ys 
Comparing these two results we have 


7 s $ 
a= (nH kW) exp j aria HEREN, 
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and hence, since » is an integer and k divides (h + ks), 





Using this we may now write (5.4) in the form 


y — qa \(r+)/2 
n+ a 


X Im (= V(y—a) (n+ a) ) y"— E(N) 
+ O(N- exp{44rN-ta) (1 — exp{44rN=)} | y |)), 


A 


N n 1 Co 
(6.21)  f(y) = 27 à Sas S Avr) ( 
kzi vi n=O 





where 


(6.22)  Anv(n) = = (nt rx) 





0 
Gk)=1 


x exp} — ari ((n a) + va) EF). 


If Nc the first term of the right member of (6.21) becomes an 
infinite series which is easily seen to be convergent. ‘The second term becomes 
R(o) which also stands for an infinite series. Since the error term becomes 
zero, this second series then converges, provided its terms are summed in the 
proper order. By R(«) we shall mean the infinite series whose value is 
jim R(W). Then we have: . 


THEOREM 1. If F(r) isa modular form of positive dimension r, having, 
as singularities in the fundamental region, at most a finite number of poles 
and a polar singularity at ic, then we have the expansion 


F(r) pane e?rtarf ( garir] ; 


o: 12 y — g \(rt1)/2 
(6.3) f(y) =ar] Yared duv(n) =) 


X Ina (E VO QE )n Re), 





where Axv(n) is defined by (6.22) and the remaining constants are de- 
termined by the transformation equations (1.11) and (1.12) and by the 
“ principal part” of the Fourier expansion (1.3). 

7. We shall now obtain the series representation for R(co) in the case 
in which F(r) has a single simple pole in the interior of the fundamental 
region and no other singularities in this region except a possible polar singu- 
larity at io. The value of R(oo) in other cases can be obtained in a similar 
manner, with obvious modifications. 
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We let F(r) have its simple pole at the point r— 0 and suppose that 
its residue there is R. Then, expanding F(r) about this point, we have 


(7. 11) : Fr) = + È e(o)” 


Now if r lies in the triangle which is obtained from the fundamental region by 
the ianatoemeHion . 


(7.12). Ç J 


then (— dr + b)/(cr— a) lies in the fundamental region and we have, by 
(1.11), 


(7. 18) F(r) = e(—d, b, c, — a) > (—i(cr— a) yF ==). 


CT — a 


In this triangle we have a single determination of the branch of (—i(cr—a))" 
and therefore F (r) is analytic except for the parabolic points and the points 
(as + b)/(ca + 4d). For r near the latter points we combine (7.11) and 
(7.18) to obtain the expansion 


F(r) = e(— d, b, c, —a) > (— (or —a))* 
R © f—dr+b n 
x À == +Èe( Cr — a —) 


medio —o))r{ PE 


Se ier a 


from which we see that F(r) has a simple pole at (ao + b)/(ca + d) with 
the residue 


ire ( (EE) (EH) EE 
me e(— d, b, o, — a) 4(— (co + d)) TAR. 





Corresponding to this pole, F(z) == g °rtarF (7) will have a simple pole at 
the point 


am © e(a), 





with the residue 


(7.22) — Sri exp { mi(1— a) — gti 5} e(—d,b, c, — a)" (— i(ce + d)) "k 





Not all transformations (7. n will give distinct points (7.21). Since 
& is not a vertex of the fundamental region we shall get each point 
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(ac + b)/(co + d) once and only once if we take the identity transformá- 
tion and all transformations (7.12) with c21. Two of these points, 
(ao + b)/(co+d) and (mo + b;}/(cio + d), will. yield the same point 
(7%. 21) in the z-plane if and only if their difference is an integer t, 


moths goth, (a+ tele + (b+ td) 

0 + di cr+d cot d : 
Therefore we can take each pair of integers p and q such that p21 and. 
(p,q) = 1, choose a single solution p’ of the congruence pp’ ==— 1 (mod q), 
and then take for our transformations (7.12), all the transformations 


wo GDP) 


We also must consider the poles in the z-plane due to the parabolic 
points. The point rte corresponds to x= 0 while the other parabolic 
points correspond to points on the unit circle and hence are not included in 
R(o). For |s| small we have, by (1.3), 





O0 
f(z) — Sanz, 
n=-4 
and hence, since y 54 0, we find, at z = 0, 
Dal aha A Pe ae à (2) 
T—y y 1—sa/y > m=o \Y 
= — Í, l-my”. 
m=i : 
From this, (7.22), (7.8), and the fact that y is distinct from the . 


singularities of f(x), we find that the sum of the residues of f(x)/(«—y) 
at all the poles of f(x) within the unit circle has the value 


& a 
— À amy il -+ RriRerTtl-a)a (gario EM y) -1 


(7.4) —%iRS S (—0r, nH" 


X (ilp + g) reari (abris —y) 
where, in an effort to simplify the notation, we are using the abbreviation 
| 1)/g)o + 
7.5 ,_ (Pr + f 
oe) ee po + 9 


so o” is a function of the indices of summation as well as of o. 
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The infinite series (7.4) is absolutely convergent. To see thia we first 
note that, since o is in the fundamental region, there are only a finite number 
of its imagés of the type (7.5) which lie above any fixed line parallel to the 
real axis and ‘above it. Also we have | 
| 
axi, Ye)>0, met 2) | = 





(= q; r P: — 


If y is in a region 0 < yo S jy] <i mets ne aside the finite number 
‘of terma for which ' 


Jare | < LE, 


we see that the infinite series in. (7. 4) is majorized by the series | 


ár | R f 
IES S| po + gj, | 
Yo p q i 
which is known to converge for r > 0. | 
Since (7.4) is absolutely convergent we can rearrange its terms to agree . 
with our definition of R(). Then we have: 


` THEOREM 2. If the only singularities of the modular form F(r) of 
Theorem |1, in the fundamental region, are a possible polar singularity at ico 
and a simple pole with residue R at the point o in the interior of the funda- 
mental. region, then the quantity R(o) of Theorem 1 has the value (7.4) _ 
which ts absolutely convergent for any y inside the unit circle which is not a 
singularity of f(s). 


if P(r) has several simple poles the value of R(co) : is the sum of 

- corresponding expressions of the type (7.4). If F(r) has a simple pole at a. 
vertex of| the fundamental region the restrictions on p and q in Ge 4) have 
to be strengthened. 


8. As in the case in’ which F(x) is analytic in the upper half-plane,® 
we can characterize the class of all functions which satisfy the conditions of 
Theorem'1. We shall find this characterization in a somewhat different way, 
basing ition a formula connecting the number of zeros and poles of la modular 
form. We suppose that F(r) has Z zeros and P poles in the fundamental - 
region omitting the vertices, ico, i, p == e**/*, and p°. Also we suppose that, 
at 40, F{r) has an expansion of the form (1.8) with the associated constants 
a and a, while it has zeros of orders s and t ae i and p respectively. The integers 





8 Loc. oit., footnote 2, section 8. 


ha 
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s and ¢ are to be taken negative if F(r) has a pole at the corresponding | 
points. Then these constants are connected by the formula : 


(8.11) P+r—a— (z+ Le DE Z. 


This formula can be found by considering the integral, 


(8.12 A f adog F(r)) — 2 —P, 


taken over the path formed by the sides of the fundamental region and a line 

parallel to the real axis and sufficiently far above it. Circular detours are 

made around the points i, p, p°, and any poles of F(r) ; and then (8.11) is 

found as the limit of (8.12) as the horizontal line approaches 4 and the 

circular. detours shrink down to their points. | . 
We now multiply F(r) by factors to obtain a new function which has 

no zeros or poles in the fundamental region, including the vertices. To cancel 


the Z- zeros and P poles, which we may suppose to be at the points p1, pe'**, pz. 


and o1,02,° `, or, respectively, we use the factor 


(8.21) @(r) = TT (+) —I(o1)) LT Gr) —Io) 


which may introduce new zeros or poles at sco but nowhere else. To take 
care of the finite vertices we use the factor ° 


-+ = 
(8.22) 86) = (VIG) —1) (WH) 
Finally the factor »(r)?" will suffice for the point T==10. To see this we 
expand the function 
(8. 23) U(r) = F(r)n(r)#8(r)8(r) 
about the point 4 iœ, using the known expansions of Je) and a(r) and the 
expansion (1.3) of F (7). We then have 
} ‘ 
Pa Tr ,8,t X S. aein 
U(r) = exp{2rir(a — p + St ÊH É — P + Z) } $ onerinr — S opetin, 
12 2 3 n=0 n=0 


where we have used (8.11) to obtain the last equality. 

The form Y(r) is a modular function. To prove this we need consider 
only. the two eee of the modular group. For the function (8.21) 
we ‘have 


° We use the usual determinations of the roots, loc. oit., footnote 2, p. 449. 
10 
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(8.31) ` @(r + 1) —@(r), e(—1)—e(), 


since J(r) is a modular function. The function (8.22) has the transforma- 
tion formula 1° 


(8. 32) Set) exp { 2ni($+ 2) Lan, a (=)= e"s), 


3 
and »(r)?" the formula 
(8.88) afr Dep (ai) ae)", A(Z inner 
From (8.23), (1.12), (8.31), (8.32), and (8. 33) we then find 
| t 

(8. 41) Y(r + 1) — exp | ri(at +42) ban, 
but from (8.11) we have 

r s, t 

TER UE Ve dec ou À 


an integer, and hence (8.41) reduces to 


(8.49) ` U(r 1) = P(r). 

Also, from (8. 23), (1.11), (8.81), (8.32), and (8.33), we get 
(8. 43) | Y (=) = ee (7), 

where | | 

(8.44) ` & == e(0,— 1,1,0). ° 


Now the value of e is known ™ to be 


(8.45) o= esp À 2ri (2e?) ; 
and therefore we have, using (8.11), 


EE "#5 — exp ri (—3a— t À) } — exp{?ri(— 3P — Bu + 3Z + t)} = 1, 


and hence (8. 43) reduces to 


(846) v (=)=. 


19 Loo. oit., footnote 2, p. 449, formulas (8.63) and (8.64). 
+ Loo, vit., footnote 2, p. 445, formula (6.7). The proof used there applies without 
change to our case in which Fr) has poles in the fundamental region. 
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Formulas (8.42) and (8.46) show that ¥(r) is a modular function. 
Since it has no singularities it is a constant, and, by (8.23), we have 


(8.5) F(r) == ui Sd 2 
Z 
_ Ent) TI (J(r) evel) II (I(r) —J(pr)) . 


x (VE) (Gy 


Any form F(r) that satisfies the conditions of Theorem 1 can then be ex- 
pressed in: the form (8.5). Conversely, it is clear that any form (8.5) 
satisfies these conditions provided we have r > 0 and the c; and px distinct 
from the vertices of the fundamental region. The numbers y and a associated 
with this form may be obtained from (8.11). Since p is an integer and 
0Sa < 1 we have | 


r s t 
(8. 61) p——[-L+r-5-i i , 
and 
r 8 t [ r S t 
(8.62) a= FLE Men ae du HP z]: 


We can express the roots (8.22) in terms of g:(1,7), gs(1,7), and n(r) 
and then write (8.5) in the form 


(8.7) Fr) == Ky(r) 77-2686 E : 
X ga(1, 7) *gs(1, U (F(x) —F (o3))* O =l 


where the constant K has changed its value. 


9. The discussion of the e(a,6,c,d), given in the paper +° referred to 
above, for forms analytic in the upper half-plane, applies directly to our case 
also. Hence we can immediately state the | 


THEOREM 3. The modular form, (8.7), of dimension r, satisfies the 
transformation equations 





om P(SEE) —e(a,d,.8)(—i(or + DFE, 6 >0, 
and 


(9. 12) F(t +1) = eF (7), 


12 Loo. cit., footnote 2, section 9. 
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with the multiplier 


a+ d 
12c 





1 


(9.21) (a, b,c, d) — exp } Pi (r NO 


and with the a of (8.62), 


(9.22) 


Ho DE z(t [“]- 


+8 (b(a +d) + a(0— e) +e) 
$i G+2@—0 adto) ) }; 


D 


where 


Conversely, all modular forms satisfying the conditions of Theorem 1 ‘are 


contained in (8.7). 


1 


We can apply Theorem 1 to the form (8.7) and evaluate the Axy(n) by 


means of (9.21) to obtain 
THEOREM 4. If r> 


0, the modular form (8.7), in which the a and oj 
| 


“are distinct from the vertices of the fundamental region, has the expansion 


(9.31) fy) =$ Sart 


X Tru (= 


P(r) == etrtorf( oer), | 
SE Là 


VESER) — R) | 





“where a and p are given by (8.61) and (8.62); the a-y are the coefficients ` 
of er in the expansion of e?7tarF (7), valid for S(r) sufficiently large; 


R(co) is the sum of the residues of f(z)/(x— y) at the poles of f(x) within 


the unit circle, as described in section 6; and where 


(9.82) Arb(n) = E or(h, k)é (h, k)'é(h, k)t 
és 


with 3 
or (hk) — oxp{2rir: 8(h, k)}, 
(9.88) &(h,k) = exp ns (D in 


f 2i 


(A, k) = exp i 78. 


| 
1 
t 
| 


Xen] (ae HIW Get P in |, 


= nt) | ‘ 


exp { xi (— “OD ae +1) }, 
(h —h’) (+ tomy +} whe 


f 


18 These are the same as the functions (9.54) of the paper referred to in footnote 
© 2. It should be noted that there is an error in the expression for å (h, k) in that) paper. 
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10. The modular form 


(10. 11) F,(r) = -8r(J(r)—J(c))+ 


1 
1738 77) 


affords an interesting example. We limit r to be positive and o to be distinct 
from the vertices of the fundamental region. Then F,(r) is of the form 
(8.7) with 

(10. 12) s=i=Z=0, PEN 


and (8.61) and (8.62) yield the values 


(10. 13) leg ge a+], a—— 2 41-[—5 41]. 


Corresponding to (1. 3) we have the expansion 


(10. 14) er tar J", (7) a qgr/12-1-u x gr (r/13) (1 + g + Ig + ny “yr 
X (22 -+ (744 — 17287 (o)) + 1968847 + + : -)* 
= rie (z — (144 — 2r — 17287 (o) je + > -) 
= rt — (744 — 2r — 17287 (o) are +: 


and, therefore, the values 

(10. 15) ul. yy = 128I (0) — T44 + Br, - 
If 0 << 712 we have p= 0 and hence, by Theorem 4, 

(10.2) P(x) = etant (er), fly) = R( 0). 


The value of R(c) may be found by Theorem 2. The residue of F,(r) at 
the point r= 18 ` 


(10.3) R= ggg Wo) I") 


where J’'(e) is the derivative of J(r) at r — o. The value of 
(— q P, p,—PE ++) in (7.4) may be replaced by its value (9.21) and 
we then have 


(10.41) R(œ) = SS glo) EJ (a) errtt-ade (erte — y) 


—È À eoit menaren — y) 7), 


(PQ =1 
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with the a of (10.13) and where we have used the abbreviations ( 5) and 


co fpo arf (sap EE OEE NA) | 


With this value for RS) we then have the desired spies of P,(r) in. 
(10.2). ` ' 

| A formula concerning modular forms of negative dimension can be 
` obtained from our expansions of F,(r). For X(r) > S(o) we expand each 
‘summand of) (10.41) in a geometric series and then have, using (10. 2) 


P(r) ae — no) I (o) semen (otter S D gtrin(ro) 


-$ È te, q)" (—i( pe + 9) pripa $ pre, 


(po)=1 . 

"On the other hand, by (10. 14) with the values (10.18), we also have 
` Fr) = otar (1 — (744 — Dr — 1928 (0) or 4 , 
‘ valid for 3(7) > $(o). A comparison of the leading terms now yields the 
‘result - | | i Lu. 
(10.51) 1%284(o)" (o) 

a — À = tp, g)" (—ilpr + g) yira, 


Fer a 





for 0 <r 12 and o inside the fundamental region. By analytic continua- 
tion the restriction on o can be removed and we have (10. 51) for X(c) > 0.. 

For the case r= 12 this formula simplifies considerably. Making ‘use 
of the result ** i 








(10. 52) isk k) ait (mod 1), 
| a . pp +1 4 
for hh’ =—1 (mod k), we have, a I aR (mod p), ; 
1pe(— g p) = BEEN (mod 1), 
and therefore ` . 
(10.58) . 1. £(p,q)? — 


i 
: “H. Rademacher, “ Zur Theorie der Modulfunktionen,” Orelle, vol. 167 (1981), 
«pp. 312- 336, in particular p. 321, formula (2.51). 
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t ` 


Then (10.51) reduces to the expression 
1 | : v o 
17889 (0) "J" (o) = — 2at {1 + 2 2 (pot gy, 


Foa =t 


which can easily be reduced to the more usual form 


; 2-1. 
V128y(0) J (= Gaon A Da (po + q)7*. 


‘Returning to our original te (10. 11), , We now consider thè case 
12 <r & 24. We now have p= 1 and hence, by Theorem 4 with the values 
(10.13) and (10.15), we find | 


F, (t) = = erriarf, (g2rir), 
(10.61) fr(y) = =% St pA Qa" 





x In (# ca) a) 

where A(n) has the value . ` | 

(10.62) AQ(n) — Er “exp { — i — Zi w+ (n+ 2)h) L. 
Gee | 


The value of R( œ) may again be found by Theorem 2. It is found to have 
the value (10.41) with the added term —y, 

For r= 24 the expression for F,(r) again simian, In this case we 
have a == 0 and hence, making use of (10.53), we find 


| ‘ CE RS Ar — | 
(10.71) Fur) = ?r Dd 72 ALO (n) ni 6913 (Eva) e2tinr 1. garir 
i= Kno oe 
i — Fs lo)’ (0) À (1 — etx t(r-))-2 


HS È (po+ gyi erie) l, 
pel q=—— 00 


: (2,0)=1 
where, by (10.62) and (10.52), | 
(0.2) < A@(n) = BS exp) — 2 (nh +0") l. 
D eee 1 


Tt is of interest to ask whether the two parts of our expansions are each 
separately modular forms. We shall not answer this question completely but 
shall merely give some indications as to the answer by considering the 
particular. example (10.71). We first write Fa4(7) = G(r) + H(r) where 
G(r) is the part of (10.71) which is a power series in 6***7 and H(z) is the 


” 
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remainder. The series G(r) converges for all r in the upper half-plane so 
G(r) has no finite poles. If G(r) is a modular form of positive dimension 
belonging to the full group then it can be seen from its expansion in 
(10.71) that it is of dimension 24. The characteristic constants of G(r) 
then have the values r = 24, a = 0, p = 1, P= 0, 720,520 andt=0. 
However these values contradict equation (8.11) and therefore G(r) is not 
a modular form of positive dimension belonging to the full group. 

We now turn to the function H(r). From its value in (10.71) we have 
H(r-+1) = H(r) and hence a—0. Also H(r) remains finite as r — to 
and therefore we have #0. Since P = 1, 720,520 and t= 0 we see, 
by (8.11), that if H(r) is a modular form of positive dimension 7 belonging 
to the full group, then we have u = 0, Z = 0 and 

os r s t 
(10.81) > Leii. 
Then by Theorem 3; H(r) is one of the forms 


(10.82) (r+) = Kn(r) (Jr) —J(o)) (VI (1) —1) (YI (4) ) 
where K may depend on o. From our definitions of the functions we now have 


1 
1728 


which we combine with (10.81) to obtain 





H(r) = a(r)“ (J (r) —J(a))*— G(r), 





(10. 83) ue) — G(r) (I(r) —I(o))* = Kal) (VI) = 1) (WI). 


This equation must be valid for r 4 o and ø within the fundamental region. 

By continuation it then holds for all + and o in the upper half-plane. From | 
(10.83) we see that K is a modular form of dimension O in the variable ø. 

However if we set r — o in (10. 82) we see that K is of dimension (24—r). 

Therefore we have r= 24. This contradicts the condition (10.81) and we 

therefore see that H (r) is not a modular form of positive dimension belonging 

to the full group. 


UNIVERSITY OF PENNSYLVANIA, 
4 
PHILADELPHIA, PENNSYLVANIA, 
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35 Loo, ott., footnote 2, p. 458, footnote 18. 


THE EXPONENTIAL REPRESENTATION OF AUTOMORPHS OF A 
' SYMMETRIC OR HERMITIAN MATRIX.* 


By Jonny WILLIAMSON. 


In a previous paper : the exponential representation of canonical matrices 
was studied. These canonical matrices are automorphs of the normal form of ` 
a skew symmetric matrix. The corresponding problem, when the skew sym- 
metric matrix is replaced by a symmetric or hermitian matrix, is considered 
here. Three cases are treated: when the matrix is symmetric over the complex 
field, when the matrix is hermitian, and when the matrix is symmetric over the 
real field. Of these three the last is by far the most interesting. The methods 
employed are similar to those of the paper quoted above and, as in many cases 
the proofs are practically identical, they will not always be given in detail. 


1. We shall consider square matrices over the real or the complex field 
and, if H is such a matrix, we shall mean by H* eithér the transposed or else 
the conjugate transposed of H. Let H be non-singular and let H == H*, so 
that H is either symmetric or hermitian. If G—— G* and 


(1) C = exp(HG@), 
then 
(2) ; CHC* = H ; 


for CHC* = exp(HG)H exp(HG)*— exp(HG)H exp(G*H*) 
= exp(HG)H exp(— GH) = exp( HG) exp(— HG) H = H. 


Since the determinant of a matrix is the product of the latent roots of the 
matrix, . 
| exp(HG)| = exp (trace of HG). 
If 

H = (his) and G = (943), (i, j == 1,2, -:,n), 


n n iad n a 
t = trace (HG) =D D hiugai=— È D huju = — i. 
3. 4=1 j=l 4=1 j=l 


Therefore, when * denotes conjugate transposed, the trace of HG is a pure 
imaginary number and the determinant of exp(HG) has absolute value one. 


* Received April 20, 1939. 

1 John Williamson, “ The exponential representation of canonical matrices,’ Ameri- 
can Journal of Mathematics, vol. 61 (1939), pp. 897-911. This paper will be referred 
to as I. 5 


153 


154 | JOHN WILLIAMSON. 
| ; ' l 
_ In the other case, when * denotes transposed, t==} and therefore ¢ is zero. 
Consequently, 
(3) 1 | exp(H@)| = exp(0) = +1. 


We shall determine here necessary and sufficient conditions, that a matrix C, 
which satisfies (2), shall have an exponential representation of the form (1). 
If a matrix C satisfies (2), since H is non-singular, | C | | C |* = 1.; There- 
fore the cote value of |C | is unity and, when * denotes transposed, 
| C |= -t 1. As a consequence of (3), when H is symmetric, we must restrict 
our consideration to those matrices C, whose determinants have the value 
` plus one. | | | f í 
It A = HG, ae | i 
| AH — HGH = — HA* — Hf(A*), 


where f(x) =—£. Hence À is normal * with respect to H. | 
Let P bé a non-singular matrix and let . if 


KON PHP* = H, and POP — C.. i 


Then, if CHO* = H, 0,H,0*,—H,. Similarly, if C = exp A = exp(HG), 
Cı = exp A, + exp (H,G), where G, = — @*,. _ 
Since | 
(5) - Ay= PAP, 


` Ái is normal vith respect to H,. Further the matrix C is also normal with 
respect to H and the matrix C, normal with respect to H,, where the defining 
polynomial is f(z) =a. For brevity, when equations (4) are satisfied for 
some matrix P, we shall write (H, C) ~ (Hı, 01) and, when (4) and (5) are 
satisfied, (H, 4) = (Hi; 4). Since both the symbols ~ and = have all the. 
properties of an equivalence relation we have, 


Resvrr (a). A matrix C, which satisfies (2), has an exponential repre- 
sentation of the form (1),1f, and only if, there exists a pair (H,,C,) ~ (H,,C) 
and a pair (H;, 41) = (H, A), where Cy = exp A1. í 
. 0 . Q 0 i 
Since canonical pairs (Hı, 4ı) ~ (H, A) and canonical pairs (Hj, Qi) 
~ (H,C) are kuowa, it is only necessary” to compare the matrices exp À; 
with the known matrices C.. | . i i 
: f a l 
-3 John Williamson, “Matrices normal with respect to an hermitian matrix,” 
American Journal of Afathematios, vol. 60 (April, 1938), pp. 355-373; “Normal matrices 
over an arbitrary ffleld of characteristic zero,” American Journal of Mathematios, 
vol. 61 (April, 1939). These papers will be referred to as II and III respectively. 
e Cf. I, page a ; 


| 
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2. Hermitian case. Let H be a non-singular hermitian matrix over 
the complex field and let H* denote the conjugate transposed of H. Since C is 
normal with respect to H and f(r) = r4, the canonical forms (H;,C,) ~ (H, C) 
can easily be written down as particular cases from the general results of IT 
or III. The actual forms of H, and O, are, however, not necessary. It is 
sufficient for our purposes, that the matrices H, and C, are similarly par- 
titioned diagonal block matrices. These blocks depend, though not always 
uniquely, on the elementary divisors of C — AFE or, as we shall say, on the 
elementary divisors of G These elementary divisors are of two distinct types ; * 


Type (i): the pair (A — a)", (A—a™)*, where | a | 1, and 
Type (ii): (A — a)", where |a| — 1. 


Since the matrices of the canonical pair are similarly partitioned diagonal 
block matrices, the general case reduces to the consideration of two particular 
cases; that, in which C has a single pair of elementary divisors of type (i), 
and that, in which C has a single elementary divisor of type (ii). In the 
first of these cases the canonical pair (H,,01) ~ (H,C) is unique; in the 
second there are two non- oe pairs (pX,,C,), where C, and X, are 
unique but p= +1. 

The matrices of the ER pair (H, Ai) = = (H, A) are again similarly 
partitioned diagonal block matrices depending ‘on the elementary eee of A. 
These elementary divisors are of two distinct types; 


Type (a): the pair a— p), (A+ p)", where py — pi, and 
Type (B): (à— p), where p=— 5. 


' If A bas the single pair of elementary divisors of type (a), A, and H, are 
unique. The matrix exp A, has the single pair of elementary divisors (A— a)’, 
(A— a)", where a == exp p. Since p4— jp, | a| 1. Further, if | a | 1, 
we can always determine p= loga where p-4—p.. Consequently every 
matrix C with the single pair of elementary divisors of type (i) has an 
exponential representation of the form (1). If A has the single elementary 
divisor of type (8), the canonical pair (H;, À:) = (H, A) is not unique. The 
matrix A, is unique but H;—pY¥:1, where.p = + 1 and YF, is unique. The 
matrix exp A, has the single elementary divisor (A— a)", where @== exp p. | 
Since p == — ĵ, |a| = 1. Conversely, if | a | = 1, pete satisfies a Ponce 


“II, page 360; John Williamson, “ Quasi-unitary matrices,” Brake Mathematical 
Journal, vol. 3 (December, 1937), no. 4, page 414. | 
8 The matrix Y, may be taken to be the same as the matrix Z, in ‘the canonical 
pair H, O, of type (ii). : : 
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Since there are two distinct canonical pairs Y,, 41 and — Y, Ai, we get two 
distinct canonical pairs + F,, exp A; for the pair H, C, where C has a single 
elementary divisor of type (ii). Therefore we have the theorem: 


Turorpem 1. If C ts a conjunctive automorph of the non-singular her- 
milian matric H, C = exp(HG@), where Q ts antt-hermittan® 


If H = F, the identity matrix, C is unitary and we obtain the well known 
corollary : 


CororLAry 1. Every unitary matrix U may be written in the form 
U = exp G, where G ts anti-hermitian." 


Since, if H is hermitian, +H is anti-hermitian we also have the corollary: 


COROLLARY 2. If O ts a conjunctive automorph of the non-singular antt- 
hermitian matric H, C == exp HG, where G is hermitian. 


3. Complex field. Let be a non-singular symmetric matrix over the 
complex field and let * denote transposed. The canonical pairs Hy, Cy and 
H;, Ax are now uniquely determined by the elementary divisors of C and A 
respectively. As in § 2 the general case of a matrix C can be deduced from 
two particular cases; that, in which C has a single pair of elementary divisors 
(A— a)", (à— at)", and that, in which C has a single elementary divisor 
(A+ 1). The two particular cases, from which the general case of a 
matrix A may be deduced, are those, in which A has a single pair of elementary 
divisors (A— p)”, (À + p)” and a single elementary divisor A***. It follows 
immediately that every matrix C with a single pair of elementary divisors 
(A—a)’, oS a-')? does have an exponential representation of the form (1), 
- as does a matrix C with the single elementary divisor (A—1)%##*, On the 

other hand a matrix C with a single elementary divisor (À + 1)*** does not 
have ® an exponential representation of the form (1). We therefore have the 
theorem, i 


THEOREM 2. Let C be an automorph of the non-singular symmetric 
matrix H over the complex field. The matric C has an exponential repre- 
sentation of the form C = exp HG, with a skew symmetric G, if, and only if, 


“This answers for the case of finite matrices a question raised by Aurel Wintner, 
“Über die automorphen Transformationen beschränkter nicht-singuliirer hermitescher 
Formen,” Mathematische Zeitschrift, vol. 39 (1933), page 263. 

‘ 7 Aurel Wintner, “Spektraltheorie der unendlichen Matrizen,” (Leipzig, 1929), 
page 217. 
` * This is obvious directly, since | 0 | =—1. 
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no elementary divisor (A +1)? occurs an odd number of times among the 
elementary divisors of C. ` 


Let C have the single elementary divisor (À + 1)", where r= 2% + 1. 
Let E,, Uy ‘and T, be respectively the unit matrix, the auxiliary unit matrix 
and the counter unit matrix of order r. Then, if? 


(6) X, == [1,— 1,- s=], 
we may take H, == X, and C, == T,, where 


(7) | T, =— exp U,. | 
If B, is the diagonal block matrix given by B,— [bEr, 1, b7 EK], 
1 B,X,B*, = X, 

and therefore, if 

(8) D, = B,T,;, | 

D, is an automorph of X,. The elementary divisors of D, are obviously 

(A+ 6)‘, (A + b7)" and (A+ 1). Further - 
. (9) | Lim D; = Tr. 


If C has only the elementary divine a+ 1)", where r; = 2h; + 1, 
i= 1,2,: > -,8, we may take 


(10) ` C= [Try Tr * * *,Ty,] and H,= [Zro Les * *, Xr,], 
where T, and X, are defined by (6) and (7). If. 
(11) D (Dy, Dro: + +, Dr,], where D, is defined by (8), 


then, as a consequence of (9), 
Lim D = Q. 
b1 


The elementary divisors of D are the s pairs (A + 6)", (A + bt)": and 
the s elementary divisors (A-+-1). If s is even, D has, as a consequence of 
Theorem 2, an exponential representation exp (HG). Now, if C is a proper 
automorph of H, so that | C | is +1, and C does not have an exponential 
representation of the form (1), the matrices H,, O, of the canonical pair must 
include as submatrices the matrices given by (10), where s is even. If, in C:, 
this submatrix be replaced by the matrix D in (11), the resulting matrix has 
an exponential representation exp(H,G,) and its limit as b — 1 is Cı. There- 
fore we naye the theorem : 


° See II, page 356, or II, page 337, or I, page 903, footnote 13. 
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THEOREM 3. If C is a proper automorph of the non-singular symmetric 
matrix H over the complex field, C is either of the form exp(HG), where G is 
skew symmetric or is the limit of automorphs, which are. 


It is of course obvious that, if |0| =—1, 0# Lim D, where ¥ 
D = exp( HG). 

If C is not representable in the form (1), we may write C, = [C;, C3], 
where all the elementary divisors of O, are of the form (à + 1)” while none 
of C, are of that form. If Æ; is the unit matrix of the same order as Ci, the 
matrix J == [E.,— E] is an automorph of H, and is of period two. The 
matrix JC;:is an automorph of H, and, since no latent root of JC, has the 
value minus one, JC, == exp(H,G,). Accordingly we have the theorem ; 


THEOREM 4. If O is an automorph of H, which does not have an 
exponential representation of the form (1), there exists an automorph D of H, 
such that DC does have. such a representation. The automorph D +s of 
period two. | ; 


If C is proper, the number of elementary divisors of D of the form À + 1 
is always even and we therefore have 


COROLLARY 1. If C is proper, the matric D of Theorem 4 has an 


exponential representation of the form (1). 


4. The real field. Let H be a non-singular real symmetric matrix and 
let * denote transposed. The general canonical pair (Hi,C,) ~ (H, C) can 
again be deduced from that of several simple types of the matrix C. The 
simple matrix C has elementary divisors of the following types: 


Type (i) : a single pair of real elementary divisors (Aa); (A—at}r. 
Type (ii): the four elementary divisors 
(A— a)", (Aa), (A~a4), (a) Jall, asā. 
Type Gin: a single pair of elementary divisors 

(Aa), (A—4)°; | a | = 1, asa. 
Type (iv): the single elementary divisor (A = 1). 


In types (i) and (ii) H, and C, are unique but in types (iii) and (iv), 
while C, is unique, H, == pX,, where p = + 1 and X, is unique.” - 
The general canonical pair (H, 41) = (H, A) can also be deduced from 


1° II, page 371, or III, page 361. 
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that of several simple types of the matrix A. The simple matrix À has 
elementary divisors of the following types: 


Type (a): a single pair of real elementary divisors 


(A—p)*, (A+ p)*. 
Type (B): the four elementary divisors 
(A—p), Ap), A+ 2) AH) pp, PAD. 
Type (y): a single pair of elementary divisors 
(A—p)"; Ap)"; p=. 
Type (8): the single elementary divisor A. 


In types (a) and (8) the matrices H, and À, are unique, while in types 
(y) and (8), 4, is unique but H, = p¥,, where p= + 1. The canonical 
forms are of course all real. 

If A is of type (a), expA has the single pair of elementary divisors 
(A— a)", (A — at)", where, since p is real, a == exp p is positive. If A is 
of type (8), the matrix exp A has the four elementary divisors (A—a)’, 
(A— a)", (A—at)r, (À — 41)", where a = exp p. Since the real part of p 
is not zero the absolute value of-a is not one. The number a is real if, and 
only if, the imaginary part of p is an integral multiple of +, in which case 
a= à and a% = ã*. Therefore, if A is of type (8), exp À is a matrix C of 
type (ii) or else a matrix C with exactly two equal pairs of elementary divisors 
of type (i). In this last case it should be noted that a may either be positive 
or negative." . | 

If A is of type (y), the matrix exp A has the single pair of elementary 
divisors (A— a)", (A— a7)", where a = exp p and, since p = — 9, | a | = 1. 
Further, if | a | = 1, and a is not real, we can always determine p to satisfy 
both of the equations a = exp p and p——#. Since H,==p¥1, p= +1, 
the matrices H, exp A have two distinct canonical forms. If the imaginary 
part of p is an integral multiple of r, a = +1 and exp A has the two ele- 
mentary divisors (A + 1)”, (À + 1)”. In this last case, where r is odd, exp A 
is a matrix C with only two equal elementary divisors, both of type (iv). 
It is important to notice that the two p’s associated with the elementary divisors 
must be equal.?? 

Finally, if A is of type (8), exp A is a matrix C of type (iv), with the 


Cf. I, page 900. 
12 Cf. I, pages 907 and 908. 
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| single elementary divisor (A— 1). Once again there are two distinct 


` . canonical: pairs = (H, exp A). 


The simple matrices C, which do not have an exponential representation 
are therefore, 


(a) a matrix O with the single pair of pi nero Dir (Aa), | 
(A— a)f, where a is negative and distinct from minus one; 


_ (b) g matrix C with the single elementary divisor (à + 1); 


(c) a matris Q with the pair of elementary divisors (A + 1), where 
one of the associated p’s has the value plus one and the other the value 
minus one, 

An illustration of (c) is the following. | 
. If c(T; =) and H == G E: C and H are in canonical form. The 
| elementary divisors of C are (A+ 1) and (A+ 1), and the first associated p 
has the value + 1 and the other the value —1. Let @ be the teal skew 
symmetric! matrix G = (3 2). Then exp(HG) — exp y 2) and this 
cannot be. equal to C. On the other hand, if H= 5 : , 80 that the two 


associated p’s have both the same value plus one, C = exp i 1) 0 ae 
— f 


In this last simple example a change in the associated p’s altered the 
value of H. That this does not always happen is shown by the following 
example. [Let C = [1,—1,—1].and H = [—1,— 1,1]. The elementary 
divisors of C are (A—1), (A+ 1), (à+ 1). The first p associated with 
À + 1 has the value — 1 and the other the value + 1. Therefore there is no 
‘real skew symmetric matrix @, such that C.—= exp(HG). On the other hand, 
the matrix H has the same elementary divisors as C.and [—1,— 1, 1 


= EDS wire G= LC j al o]. | a 


Comparison of the canonical forms ‘clearly shows that case (o ia, 
limiting case of case (a) as a tends to minus one. 

Since a matrix C, whose only elementary divisors are two Li pairs 
(A— a)", (A — at)", always does have a real exponential representation of 
the form (2), when a54—1, we have the theorem ; | i 


THEOREM 6: Let C be a proper real automorph of the real non- ‘singular 
. symmetric matriz H. Then the matrix C has a real exponential representation 
of the form C = exp(HG), with a skew-symmetric G, if and only if, every 
real elementary divisor of the form (A—a)* where a is Bega ves occurs an 


E 





1 


3 
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even number of times among the elementary divisors of C and, when a == — 1 
and r is odd, the number of positive p’s associated with (A— a)" ts even. 
We next determine those matrices ( which can be obtained as limiting 
Cases of matrices with an exponential representation. Let C be of type (i), 
where a is negative, and r — 2k is even. Then C, is of the form 


A, 0 0 E, 
r= (4 Pe) and (sr dr 
where 4, = 4E, + Ur. If B, = À,— b?V*,, where V, is obtained from U, by - 


replacing the units in the even numbered rows by zero, and D, = [B,, (B*,)“], 
then Lim B, == A, and Lim D,<T,. The matrix D, is obviously an automorph 
b-0 bo0 


of H, and, since all the latent roots of D, are complex, D, has a representation 

of the form exp(H,G,), as long as b is different from 0. If r— 2% +1, 

and F, = Gs A where e is the column vector of dimension 2k defined by 

e* = (0,0,0,- - -, 1), Lim F, == 4,. The latent roots of F, are all complex, 
d0 


except for one, which has the value a. If K, = [F,, (F*,)*], K, is an auto- 
morph of H,, and has elementary divisors which are all complex except for the 
two simple ones (A—a) and (A— a). If C has only the s pairs of ele- 
mentary divisors (Aà — a)", (A—at}"t, i— 1,2, + -,s, where r; = 2k; + 1, 
then Ci [Tr Tro: T]; and 0, = Lim K, where K = [Krn Kry Kn]. 


The elementary divisors of K are all complex except for s pairs of simple 
elementary divisors (A—a), (A—a*). If is even, K has an exponential 
representation of the form K = exp(H,G,). If s is odd, K does not have such 
a representation and K is not the limit of a matrix which does; for otherwise, 
a matrix C with the single pair of elementary divisors (A — a) (A— a) would 
be the limit of a matrix with an exponential representation, and this is im- 
possible. Consequently, if C is a matrix with only elementary divisors of the 
form (A— a)’, (ÀA— at)", where a is negative, C is the limit of matrices of 
the form exp(H@), if, and only if, the total number of pairs of elementary 
divisors for which r is odd, is even. 

We now consider matrices C with elementary divisors of the form (A +1). 
If C is a proper automorph of H, with only s elementary divisors of the form 
(A+ 1)", by the same argument that was used in the complex case, (§ 3), 
C = Lim D, where all the elementary divisors of D are complex except for s 
simple elementary divisors (À + 1). With each elementary divisor of C of 
the form (A + 1)?**1, there is, in the canonical form, associated a p== + 1. 
Hence with C there is associated a set of sps. In a canonical form for D, H, 


11 


l 
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with the s elementary divisors À + 1 of D is associated, the same set of sp’s. 
Since C is a proper automorph of H, s is even and D has'an exponential repre- 
sentation, if the number of positive p’s in the set is even. If the number of 
positive p’s is odd D does not have such a representation nor is D the limit rs 


— 1 0 ` 
0 = would be a 


limit of matrices of the form ( Pa j On combining these results 
| expd 0 
we have the theorem: 


of a matrix which does. For otherwise, the matrix ( 


Taxorem 6. Let C be a proper real automorph of| the real symmetric 
matrix H. Then C has a real exponential representation of the form (1) 
or is the limit of real automorphs which do, if, and only if, when a is a negative 
latent root of C, the total number of elementary divisors of C of the form 
(A — a) *** is even and, when a = — 1, the total number of positive p's asso- 


ciated with these elementary divisors is even. 
Finally, by the same proof as that of Theorem 4, we have 


THEOREM 7. If O is a real automorph of the real symmetric matrix H 
and C does not have an exponential representation of the form (1), there 
exists a real-automorph D of H, such that DC does have such a representation. 
The automorph D is of period two. 


If © is improper, D is improper and D cannot have an exponential repre- 
sentation. But, even when C is proper, it is not possible in\every case to find 
a D, which does have an exponential representation. This is best shown by a 
simple example. Let: 


i — 1 0 1 0 

(T5 9) ma (1) 
If D is to ob proper and of period two, D must be + ©. Therefore D = C and 
D does not have an exponential representation. On considering the proof of 
Theorem 4, we see that D has an exponential representation if, and only if, 
the number of positive p’s associated with Cs is even. If H is definite, all ps 
must have the same value and in this case D always has a representation et 
the form (1). 





5. Lorentzian matrices. We now consider in more detail automorphs 
of the non-singular symmetric matrix H of order n and index n— 1. If C 
is an automorph of H, the elementary divisors of C must alk be linear. At 
most one pair of real elementary divisors (A— a), (A—a™), where | a | 341, 
can appear: among the elementary divisors of C. If no such pair occurs, 
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C must have one elementary divisor (à + 1), with which is associated a 
p==— 1. The ps associated with all other elementary divisors of C all have 
the value + 1. We therefore have 


Txeorex 8. Let H be a real non-singular symmetric matris of order n 
and indez n—1. If C is a real proper automorph of H, O = exp(HG) with 
a skew-symmetric G, unless C has a pair of elementary divisors (À — a), 
(A— at), where a is negative, and, when a — — 1, the associated p’s have 
different values. 


In other words C has an exponential representation of the form (1) 
unless C has a pair of elementary divisors (A—a), (A—a™), where a is 
negative and | a | £1 or is the limit, as a tends to — 1, of an automorph, 
which has such a pair of elementary divisors. From theorem (6) we deduce 
the Corollary ; 


COROLLARY 1. No automorph, which does not have an exponential repre- 
sentation, ts the limit of automorphs, which do. 


If n is even and (H, C) ~ (H, Cı), we may take 


(12) HS H 5) Bas | and Ci [Ca Cs]. 


The matrix C, is a diagonal block matrix with blocks of the form +1 or 
( à B ) , Where a? + 8° =— 1. The matrix C, is a diagonal matrix, 
—B« 


== (6 aja) 


If a54—1, each elementary divisor À + 1 of C is associated with a p, 
which has the value plus one. Therefore C has an exponential representation 
of the form (1), unless a is negative. When a is negative, each elementary 
divisor À + 1 of —C is associated with a p, which has the value plus one. 
Accordingly — € does have an exponential representation and we have the 
theorem : 


THEOREM 9. If H is a real non-singular symmetric matrix of even order 
n and index n— 1, and, tf C is a proper automorph of H, at least one of the 
matrices C or — C has an exponential representation of the form (1). 


Tt is of course obvious that no such theorem is true if n is odd. 
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| In conclusion * we exhibit the canonical forms H,, C, and the exponential 
representations of C,, when n = 4 and |C; | +1. If 


i 


- 0 1 
H,=[(? aa] 
C; is of a single type; 


i 


a 0 0 0 

: de 0 1/a 0 0 

I 0 0 cos sing 
0 0 sing cosé 


The matrix C, == exp(H,G,), where 


0 —loga 0 0 
log a 0 0 0 

G = 0 0 4 |? when a is positive. 
0 0 —# 0 


l 
As remarked earlier, if a is negative, C, does not have such an exponential 
representation. l 
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19 Of. C. C. Macduffee, The Theory of Matrices (Berlin, 1933)! page 68; F. D. 
Murnaghan,: “On the representation of a Lorentz transformation means of two- 
rowed matrices,” American Mathematical Monthly, vol. 38 (1931), pp. 504-511. 
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` UNBOUNDED CONVEX POINT SETS.* 
_ By J. J. Sroxes. 


1. Introduction. We are concerned here with ‘the properties of un- 
bounded convex point sets § in three-dimensional Euclidean space H*, though 
many of the theorems and'proofs are obviously valid in Æ”. In addition to 
convexity we make the following assumptions on the sets S: a) S is not the 
entire E°, so that boundary points of 9 exist, b) S is assumed to possess inner 
points in H*, c) S is a closed set. Frequent use will be made of the following 
well-known property of all convex sets (bounded or unbounded): there exists 
on every boundary point-of § at least one support plane (German: Stiitzebene) 
of S, that is, a plane containing the boundary point and having the property 
that § lies entirely in one of the two closed half-spaces bounded by the plane.* - 

A special case of unbounded convex sets, the convex cone, is treated in 
some detail because of its importance in the discussion of the general sets 9. 
- By.a convex cone we mean a closed convex set C consisting of infinite half- 
rays all emanating from the same point O, the vertex of the cone. However, 
in dealing with the cones C it is not convenient to assume that C must possess 
inner points in Æ or-even in #?, but we explicitly omit the case in which C is 
the entire Æ. It is hence clear that the vertex O of C is always a boundary — 
point of C relative to Æ. The terms inner and boundary point are used, 
however, in dealing with the cones Q, in relation to the dimensionality of C, 
even though C is always considered as laid in Æ%. This terminology is free ` 
of ambiguity since C is convex.? | 

With every set 9 there is associated a unique cone O, the characteristic 
cone of S, defined as follows: Through any point p C S all infinite half-rays 
rC § are drawn. The resulting point set is shown to be not empty. and to be 
closed and convex, ie. it is a cone C. Two such cones erected on different 
points pı C § are shown to differ only by a translation. The dhietgoteristie 
cone plays a central rôle in the discussion of the sets 8. 

Most, though not all, of the theorems on convex cones as well as the idea 
of the characteristic cone are contained in the paper of Steinitz: “ Bedingt ` 
konvergente Reihen und konvexe Systeme,” Journ. f. d. reine u. angew. Math., 


* Received March 8, 1939. | 
1 Bonnesen-Fenchel, Theorie der konvewen Körper (1934), p. 4. The proof given 
here applies to bounded sets, but could be extended easily to unbounded sets. ` 
` * Bonnesen-Fenchel; loo. oit., p. 2. 
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| , ENE ; 
Bd. 148/(1913) ; Bd. 144 (1914) ; Bd. 146 (1915). The theorems, of Steinitz 
are formulated in a terminology which is not suited to our purposes, and since 
his proofs can be replaced by others quite concise, we shall includé proofs of 


these ig ER in order to preserve the continuity of the discussion. 
, ’ It is well known that the set of boundary points of a bounded convex set 


with inner points in H® is homeomorphic with the surface of the sphere. The 
corresponding problem for unbounded sets S is more complex, as the following 





. simple examples indicate: 1) The space between and on'a pair; of parallel 


planes— the set of boundary points is not connected, 2) a solid right circular. 
| cylinder with an entire straight line as axis—the set of boundary points forms 
an open cylinder, 3) a half-space—the set of boundary points is a plane. One 
of our main purposes will be to show that these. examples exhaust all posst- 
bilities jin so far as the topological structure of the boundary points of 8 ts 
concerned. 7 | 
‘The spherical image of a convex set is defined in the usual manner: The 

‘ outward normals on all support planes of S are displaced parallel to themselves . 
and erected at a point O; the points of intersection of such normals with the © 
‘unit sphere having O as center constitute a set Z, the spherical/image of 8. 
In theicase of bounded convex sets, J is the entire surface of the isphere. We 
prove that J of any unbounded set S lies on a closed heïnisphere, that every 
inner point of the spherical image J, of the characteristic cone; C of S is a 
point of Z, and give conditions under which I of § is a closed or an open set. - 
At the close of this paper we- prove the following theorem : There exists 

a support plane T of S on a certain boundary point b, of S such that .the 
infinite half-ray taken along the inward normal to T at b| lies entirely i in 8 if, 
and only if, the set of boundary points of 8 is homeomorphic with the plane. 
We obtain also a sufficient condition that the set of boundary points of 8 
may possess a representation in the form z = f(z, y) with f one-valued and 


` continuous, ` Z . . | ' 
A | t 





2.. Convex cones. To the definition of the convex cone C already given 
above, we add the definition of the cone Cp polar to C: On every lsupport plane 
of C (all of which evidently contain the vertex O of Cy. we erect the normal 

turned away from C at O. The totality of all such normals (considered as 
infinite half-rays) forms the polar cone Cp of C. Since O is always a boundary ` 
point of C (relative to H*) it is clear that C, is not an empty set. It is also ` 
easy to show that C, is closed and convex, i. e. it is also a convex cone. (See, 
for example, Bonnesen-Fenchel, loc. ctt., p. 4). 

We begin the discussion of the cones C with 


i 
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Turonni I The polar cone Cpp of the polar cone Cp is identical with 
the original cone? C. 


It is clear that the above definition of the polar cone Cp can be given in 
the following form: the necessary and sufficient condition that a half-ray rp 
should belong to Op is that ẹ (r, 1p) = »/28 if r is any half-ray belonging to C. : 
If, then, r C C, it follows at once that r € Cpp from the above definition of 
the polar cone; hence we have C C Cpp On the other hand, if r Œ O, it is 
clear that a support plane T of C can be found which will separate r from C. 
(Here we use the fact that C is a closed set.) The outward normal rp on T 
at the vertex O of C belongs to Cp by the definition. Hence ẹ (1,15) < 7/2, 
and r can not belong to Cpp Hence from r Œ C, follows r Œ Cpp and we have 
thus completed the proof that Cpp = C. 

It is convenient to divide the cones C into the following classes, in which 
C is: (1) an entire straight line (that is, the entire #*), (2) an entire plane 
(that is, the entire °), (3) any of the remaining possibilities. It is further 
convenient to subdivide class (3) into three additional sub-classes, i. e., those 
in which: 

(a) C possesses inner points in Æt, but not in #?. C can be only a single 
infinite half-ray: if C possessed more than one such half-ray, it would either 
possess inner points in H?, since C is convex, or it would consist of an entire 
straight line, both of which cases are to be excluded. 

(b) O possesses inner points in E? but not in E3. C can be only the 
convex portion of the plane between and on two infinite half-rays emanating 
from the same point, since C is convex and can not be the entire plane. 

(c) C possesses inner points in Æ’. 


It is clear that these classes and sub-classes are mutually exclusive and exhaust 
all possibilities in the ES. | 

lt is of interet to note the nature of the polar cone Cp in each of the above 
classes. In cases (1) and (2) this is quite simple: Cp for class (1) is the 
entire plane, evidently, i. e., Cp is of class (2); Cp for class (2) is an entire 
Straight line, i. e., Cp is of class (1). 


Lemara 1. The polar of a cone of class (3) is itself of class (3). 


This follows from Theorem I: If the polar Cp of a cone C of class (3) 
were of class (1), say, then Cpp would be of class (2) as we have just seen. 
But O == Cpp which shows that our assumption is absurd. It is clear, also, 


3 See Steinitz, loa. cit., vol. 144, p. 10. 
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that Cz could not be of class (2), by the same argument, Since o, is not 
empty, it must be of class (3). - 

From now on, in this section, we shall consider cones of class (3) only. 
Evidently this class is digtinguished from the other two by the following . 
property: The vertex O of C is a boundary point of the cones of.¢lass 3), but 
is an inner point in classes 1) and 2), according to our definition of the terms 
inner and boundary points when applied to cones C. We prove now a theorem 
on cones of class (3): |. 





‘ THEOREM II. There exists a support plane T of C with the following 
property: the inward normal (that is, the normal turned toward C) on Tat 
O lies in C. 


Since Cand Op are both of class (3), by Lemma 1, the boing O is a 
boundary point of both sets. It is obvious from the definition of Cp that C 
and Cy have no points in common except O. It follows at once that C and Cp 
possess a common support plane through + O. The outward normal rp (relative 
to C) jon T at O belongs to Cp, the inward normal r on T at O jto Cpp, hence 
r C O! since C = Opp- | 


“We can present this theorem in a sharper form, as follows: 


SS 


‘THEOREM III. There exists a support plane T of C with the following 
property: the inward normal on T at O contains (with the exception of O) 
only inner points of C. 


We know from the preceding theorem that there exists a support plane T 
and an inward normal r on it at O, such thatrCC. Ifr contained an inner 
point of C, then our theorem would be proved, clearly. If rı contained no 
inner) point, it would lie in the boundary of C. There would! hence exist a 
support plane 7, containing r, which would be perpendicular to T. Through 
O we take a third plane Ta perpendicular to both T and T. It I, contains 
an inner point p of C, the ray Op lies in the interior of C and, in addition, ` 
a plane Ta through O normal to Op. would clearly be a support plane of C: 
the ray Op would hence possess the required property. If T} contained no 
inner point of C, then T, would necessarily be a support plane of C, and C 
would lie in the convex portion of space between three planes mutually at right 
angles. In this case it is cléar that T ray: in the interior of C would have 
the required property. ` 


8, The characteristic cone. Since any set 9 is unboundbd, there exists 
an unbounded sequence of points pi, Pa,” * * , pv," <in S. Consider any point 


t Bonnesen-Fenchel, loo. cit., p. 4. The same remark applies here as in footnote 1. 
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Pp E S. 8 ene closed and convex, all line segments Pø» as well as all limit, 

points of such segments belong to 8. The half-rays ppv cut the unit sphere 
with'p as center in. points qy which possess a limit point q. The half-ray p4 
is made up entirely of limit points of the segments ppr. We have then 


-Learara 2. very point pC 8S contains at least one infinite half-ray 
CS. 


ae half-ray with this property we lot from now on as an astal half-ray. 


Lara 3. If ris an axial half-ray on point 5 C g and p any other point 
of 8, the half-ray F on p parallel to r is also an aztal half-ray of 8. 


For if Pı, Po," ` *, py is an unbounded sequence of points of r, the line segments 
Pp» lie in S and all points of F are limit points of these segments. Hence 
FZS. . 


LEMMA 4. The set C of all axial half-rays emanating from a point p C 8 
4s a convex cone, the characteristic cone of S. 


That C is convex is shown as follows: Let p, and p, be any two points of C. 
We must show that the segment Pipe C C. This is evidently the case if the 
half-rays ry == PP and To == Pps are identical or opposite in direction, or if 
either p, or pz is identical with p. In any other case the-plane convex sector — 
defined, by rı and rz belongs to § since § is convex. All half-rays in this. 
sector which emanate from p belong to C, and with them the segment Dips. 


From Lemmas 2 to 4 we conclude: To every point p Cos there exists a 
characteristic coné with p as vertex, and all such cones go into one another by 
translations. 


4, Topological structure of the boundary points of S. Consider any 
inner point p C § and an infinite half-ray r, going out from p, which contains 
at least one boundary point of SS. On going out from p along r one must come 
upon a first such boundary point, say b, since the set of boundary points on r- 
is closed and also bounded on the side toward p. Any support plane P of 8 
on b cuts the ray r at b as it would otherwise contain p, an inner point of J, 
and this is manifestly impossible. The line segment pb is thus the set of points 
common to § and r, all of them being inner points except the manque boundary ` 
point b. We conclude in 


Lemma 5. A half-ray drawn from an inner point of 8 contains a unique 
boundary point of S, or it lies SR in S and therefore pes to the char- . 
acteristic cone of 8. 


i 
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Let p be an inner point of S, K the unit- sphere with centerjat p, and C 
the characteristic cone of S with vertex at p. . The points of intersection of C 
with the surface of K we denote by Č, the remaining points on the sphere by 

. B, i.e: B is the > complement of C relative to the surface of K. From Lemma 5 
we conelude : B is the set of points formed by the intersection of the surface 

` of K with infinite half-rays drawn from p to the set B of boundary points of S 

and thé correspondence thus set up is one to one. Since the characteristic cone 

C for any S is independent of the point pC S chosen to define it, the set G, 

and with it B, is uniquely defined by S. C being a closed set jan and not the 

entire #*, it follows that C is not the entire surface of K and is closed and 
that Blis a non-empty open set. 

The sets B and B are homeomorphic, that is, the correspondence set up 
between B and B by central projection'is not only one to one, as we have seen, 
but also continuous in both directions. We show the continuity first in the ` 
direction B—B. Consider any point bC B and let b CB be the point 
| corresponding to b. We have to show that to any set b; C Ë and having D as 
limit point corresponds a set b; C B with b as limit point. To prove this 

‘it is sufficient to show that the set b, possesses a limit point b* on the half-ray 


' pb, for, assuming the existence of b*, it is clear that bt C B and Lemma 5 
shows that b and b* are identical. "The existence of b* is readily shown: The 
‘boundary point b contains a support plane P of 8 on one side of which b, as 
well as p lie; we can construct a finite cone with vertex on p and base on P which 
will be bounded and contain if not b, itself at least an infinite sub-sequence b’; 


of bj, from which, if necessary, a further sub-sequence can clearly be taken 





which |will converge to a limit point b* on pb. The existence of 6* is thus 
assured and with it the continuity in the direction B—> B. The continuity in 
the direction B > B can be shown in a similar manner; in fact, this is simpler, 
since it is evident a priori that every infinite set b; C B possesses a limit point. 
ake To sum up, we have seen that Ë, the complement of 0 relative to the 
surface of K is uniquely determined by 8, and is “homeomerphi with the set 
` of boundary points B of S. The problem of determining the possible topo- 
logical structure of the boundary points of § is thus resolved into the following 
problem: Determine the possible topological structures of. ‘the open sets on the 
surface of the unit.sphere K which are obtained by removing from the surface 
of K its intersection with any convex cone with vertex at the center of K. 
Before continuing with the solution of this problem, it is of interest to 
note that Theorem I, Lemmas 2 to 5, and all that we have shows in this sec- 
tion with the proofs, as given, are valid for the E*, independent ot n. 


` 
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We consider the classification of cones C given in section 2 above. These 
classes were those in which C is: (1) an entire straight line, (2) a plane, 
(3) all other cases. In case (1), is made up of two diametrically opposite 
points of the sphere and B is homeomorphic with an open cyclinder. In the 
second case © is a great circle on the sphere and B is homeomorphic with two 
distinct planes. 

In the third case it is convenient to consider a further division of the 
cones into sub-classes in which Ọ possesses: (a) inner points in H? but not 
in F°, (b) inner points in Æ? but not in F’, (c) inner points in E*. In sec- 
tion 2 we saw that C in case (3a) is a single half-ray; hence C is a single 
point and its complement B is homeomorphic with the plane. We saw also 
that C in case (3b) is a convex sector of the plane; Č is a closed segment of a 
great circle and again B is homeomorphic with the plane. 

In the case (3c) it is clear that © possesses inner points relative to the 
surface of the unit sphere. We shall show that C is convex on the sphere, that 
is, © lies on a hemisphere and with any two of its points also contains a great 
circle arc of length + joining the two points. An immediate consequence 
of this is that C is simply connected, and, since it is also a closed set, its 
complement B relative to the surface of the sphere would be homeomorphic 
with an open circle, hence also with the plane, which is what we wish to show. 

We have, then, to show that © in the case under consideration is spheri- 
cally convex. This follows from the convexity of C. The vertex of C (and 
center of the sphere) is a boundary point of C, a support plane of C at this 
point exists, and, since Ë C C, it follows that © lies on a hemisphere. Con- 
sider any two points p, and pe which belong to C and which are not at opposite 
ends of a diameter of the sphere (that such points exist is clear since Č 
possesses inner points on the sphere). The entire convex plane sector formed by 
rays from the center of the sphere to p, and pz belongs to C ; the intersection of 
the sector with the sphere belongs tc C: it is thus clear that the shorter great 
circle arc joining p, and pa belongs to ©. If Ë contained no diametrically 
opposite points, the spherical convexity would be proved. If Ô should contain 
a pair of diametrically opposite points p, and pz, it would also contain a point 
Ps different from these. The great circle arc joining ps with p, and p, would, 
as above, belong to Ë. Hence Č is spherically convex. 

With this we have also determined completely the possible topological 
structures of the boundary points B of sets S in E°. Summing up, we have 


THEOREM IV. The set of boundary points of a set S in E*® possesses one 
of the three following topological forms: (1) an open cylinder, (2) two dis- 
tinct planes, (3) a single plane. f 
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therefore the space between two parallel planes. 


_by 


J. J. BIOKER. 


Le of the above theorem: 
Case (1). 


Tt is of interest to characterize in more detail the sets S in 


We have seen that the characteristic coné in 
single straight line L. Consider the set of points P common to 


cases (1) and 


this case is a 
S and a plane 


perpendicular to L. Since such a plane contains no axial half-ray, it follows 


that P is a bounded set. It is also, of course, closed and convex. 
there 
to L) which lies in S. § is thus an infinite cylinder. 


' Case (2). The characteristic cone is a plane. 
generated by moving a plane parallel to itself through a finite 


` We can now state an evident corollary to Theorem IV: If 


The set S 


By Lemma 3 


exists through every point of P one and only one straight line oats 


can clearly be 
distance. § is 


the set.S in E3 


is not the space between two parallel planes, the set of its boundary points is a 


connected set. 


confine ourselves here in general to sets in EF, it is of interest 


This corollary is true for sets 8 in the Æ”, and although we 


to give a proof 


of this fact 5 valid in the H*, This is readily done with the aid of the following: 


Lemma 6. Consider a closed unbounded convex set S wi 
all of whose boundary points lie on a pair of parallel planes 
at least one boundary point on each plane. 
parallel planes. 


h inner points, 
. and T with 


Then S is the space between the 


From the Lemmas 2 to 5 (valid for any dimension) we ulna? the char- 


acteristic cone of J is a plane parallel to T, and Ts and all points of T, and Ta 


are boundary points of S. This proves the lemma. 


Taxorem V. Let S be any closed unbounded conves 
points which is not the space between parallel planes nor.t 


set with inner 
6 entire space. 


Any two boundary points of S can be connected by a continuous prang curve 


lying in the boundary of 8. 


| Let b, and bs be any two boundary points of S, T; and Ta support planes 


T, dod T, which contains bbz. 
taining b, and b, and cutting the intersection of T, and T in 


- on these points. We may clearly assume without loss of generality that T, 
„< and T; are different. Two cases are to be aces a @) 
not parallel, (b) Tı and T, are parallel. 


Tes and T are 


| (a). T, and T, intersect. § lies in the convex portion of space bounded 
Consider a two-dimensional plane con- 
point a. We are 


free to assume that b,b; contains an inner point. p C S, since otherwise biba. 


* This result, but not our Theorem IV, is due to Steinitz, loo. oit., 





‘vol, 146, p. 10. 
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itself belongs to the boundary. of 8. From p lines are drawn to all points of 
the segments b,a and ab. Every such line clearly contains a single boundary 
point of 9, and we see without difficulty that 6, and b, are connected by a con- 
tinuous plane curve lying in the boundary of S. (See the proof for homeo- 
morphism of the sets B and B in section 4). 


(b). Tı and T, are parallel. Lemma 6 and our fmmdamental assumption 
insure the existence of a boundary point b which does not lie on T, or T, and 
which must lie between T, and Ts, S being convex. Any support plane of 8 
on b must clearly intersect T,.and T,. The proof that b, and b, are connected 
by a plane curve lying i in the boundary of 8 may now be conducted in exactly 
the same way as in (a). 


5. The spherical image of S. Consider any boundary point b of 5, 
together with the characteristic cone O and its polar cone Cp erected at this 
point. Any support plane T of § on b must be a support plane of C also: 
it follows at once from the definition of Cp given in section 2 that the outward 
normal n on T must lie in Cp. We erect C and C, with their common vertex O 
at the center of the unit sphere and denote by Cp the intersection of Cp with 
the surface of the sphere. From the definition of the spherical image I of 9 
given in section I and from the foregoing we conclude: I C Op. 

We proceed to investigate the relations between J and ©, in more detail. 
Since ©, is the spherical image of C, we are in effect investigating the relations 
between the spherical image of 8 and that of its characteristic cone. It i is con- 
venient to introduce at this point the same classification of convex cones dis- 
cussed in section 2 and apply it here to the polar cone Co of the characteristic 
cone of S: 


(1) C is an entire straight line, C is a plane, and S the space between . 
two parallel planes. - In this case J and Cy, are evidently identical. 


(2) Ch is a plane, C is an entire straight line perpendicular to it, S an 
infinite cylinder erected on a bounded convex plane set and again J == Öp (each 
is a great circle on the unit sphere). We use here the known result that the 
“circular image” of a bounded convex set in the plane is the circle: circular 
image of the plane section of S and spherical image of S are identical. 


(3) All other cases, subdivided as follows: 


(a) Op possesses inner points in E+ but not in Æ*. We know that Cp 
consists of a single infinite half-ray. The cone C is thus a half-space and one 
can easily show that S must also be a half-space. It is then evident that ` 
I= Öp | l 
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(b) Cp possesses inner points in #? but not in E*. C,lis the convex 
‘space between two half-rays. The cone- C is a wedge: that is, the convex space 


‘between and on two half-planes having a common boundary. (The two planes ` 


are at right angles to the boundary half-rays of Op). The set Slis easily seen ` 
to be an infinite cylinder with generators parallel to the edge of the wedge and 
having as base an unbounded convex set in the plane, in contrast with case (2) 
of Theorem IV in which the cylinder is erected on a bounded set. The set 
O, is 8 closed segment of a great circle (at most a semi-circle) and the spherical 
image of S is clearly the same as the circular image of the plane, unbounded, 
convex set upon which the cylinder is erected. It is convenient to defer 
farther consideration of this case until after discussion of the next case, which 
is exactly analogous for thtee dimensions. | 


(c) Cp possesses inner points in E°. Öp possesses inner points rela- 
tive to the surface of the sphere. We shall show that every inner point of Op 
is a point of the spherical image of S. Let s be an inner point of Cp, P the 
plane! through the common vertex O of C and Cp which is perpendicular to the 
line joining O with s. P is (1) a support plane of the cone C which (2) con- 
tains ino point of C except its vertex O; (1) P is a support plane of C by the 
definition of Op and (2) P contains no point of C except O, pince it would 
otherwise contain an entire half-ray r C C which would clearly make an angle 
<*/2 with some of the half-rays of C,-which pass through the points of a 
neighborhood of the inner point s C Öp, in contradiction with] the definition 





_ of O}. Consider next any inner point p C § and the characteristic cone C of 


8 erected on p'as vertex. The set of points common to S and that one of the 
two half-spaces bounded by a plane T parallel to P and noti containing C 


“We denote by S. S is evidently convex; it is moreover bounded, since 


every, infinite half-ray emanating from p and lying in the half-space 


‘containing 9’ contains a boundary point of S because of the choice of T. 


The set S’ possesses inner points in Æ? (since p is an inner point of 9) and 
T is a support plane of this set. By a well known theorem on bounded convex 
sets there exists a second support plane T” of § parallel to T. T is clearly 


also à support plane of S: The outward normal on T” is parallel to Os (the 
direction of the outward normal on T relative to 0). Hence point s belongs 
to the spherical image of S, as was to be shown. A Ste Te F 
We can now consider case (b). The method of proof used for case (c) 
is not valid here without change, since no support plane of C exists which. 
contains only the vertex of C because of the fact that O contains an entire 
straight line in its boundary—the “edge” of the wedge—and every support 
plane of C must contain this line. However, as remarked above, the spherical 


b | 
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image of 9 is the same as the circular image of the unbounded convex plane 
set which consists of the intersection of 9 with a plane perpendicular to the 
edge of the cone C. We might call this set S?. A discussion exactly analogous 
to that of the above but carried out one dimension deeper would show that 
every inner point of Č, would be a point of the circular image of S?. (The 
term inner point means, of course, inner point relative to the great circle which 
contains Oy). We sum up these results in 


TuxorEm VI. The spherical image I of 8 is contained in the spherical 
image C, of its characteristic cone. If Opis one-dimensional or two-dimensional, 
at least every inner point of C, (this term being used with reference to the 
dimensionality of p) ts a point of I. In all other cases I ts identical with Cp. 


The cone Cp possesses always a support plane through its vertex. From 
this we deduce an evident corollary to the above theorem: I of S lies on a 
hemisphere. 

At this point a natural question arises: Under what conditions will I be 
an open set (that is, contain none of the boundary points of Op), or a closed 
set? The cases in which this question remains open are the cases (3b) and 
(3c) above, I being identical with ©; in all the others. Additional restrictive 
assumptions must be made on the sets S in order to answer this question 
definitely : Consider the convex set which is bounded by a paraboloid of revolu- 
tion: the spherical image of this set is evidently an open hemisphere; on the 
other hand, a semi-infinite right circular cylinder possesses a closed hemisphere 
as spherical image, though the cone € and with it Cp are the same for both 
sets—a single half-ray and a half-space. An example of a set for which J is 
neither open nor closed could easily be given. 

We begin our discussion of this problem with the case (3b), which redudes 
to the consideration of an unbounded convex set S? in the plane whose char- 
acteristic cone is the convex sector between two hälf-rays. If C? of S°'is in 
particular a half-plane, S? is also a half-plane, as one readily shows. In this 
case the circular image I of S? and the circular image Cp of C? are evidently 
the same—a single point on the unit circle. If C? is not a half-plane, which 
we assume from now on, then ©, clearly possesses inner points in #1, all of 
which as we have seen belong to J. We wish to obtain conditions on 8? which 
determine whether the two boundary points of ©, belong to J or not. That 
C, has only two boundary points is sufficiently clear. It is of importance to 
‘note explicitly that the boundary points of Cp are determined by the two 
boundary rays of C, and that the latter are the two half-rays at right angles to 
the boundary rays of C. 

. Suppose that a boundary point 6 C G, belongs to I. As remarked above, 
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the half-ray Ob is perpendicular to an axial ray 7 C O; at the same time 0b 
is parallel to the outward normal on a certain support line L of 8%) L contains 
a half-ray (parallel to 7) by Lemma 3, all points of which must be boundary 
points of S? since L is a support line. We.conclude: If S contains no infinite 
half-ray in its boundary, J contains no boundary Po of Cy, l-e, I is an 
open set. 

On the other hand, suppose that there se g circle with radius sufficiently. 
_ large that all boundary points of S? outside of the circle lie on infinite half-rays 
belonging to the boundary of S. We may choose a circle with this property 
with its center at an inner point p C 8%. On p as vertex we erect the cone C. 
. The boundary. rays of C intersect the circle say in points q, and qj. The angle 








between Pq and Pe is definitely < ~, since C is not a half-plane. The out- 
ward normal ñ to O (regarded as a half-ray) erected at q) must contain a 
‘boundary point b of S? (since n Œ C), which must, in addition, lie outside 
the circle, n being a tangent to the circle. By our assumption, there exists a 
certain half-ray 7) starting from b which lies entirely in the boundary of 42. 
It follows that 7, lies on a support line of 8, and, by Lemma 3, r must be 
paralel to a ray in the boundary of C; if it were in the interior of C all points 


of To except b would be inner points of the characteristic cone C erected on b 
as vertex and hence also inner points of 82, which is not possible. Moreover, 


To must be parallel to pg.; if it were parallel to pq: (the only other pompk) 
it would necessarily intersect pq, since the angle between n and To would be 
> x/2, evidently. Again we see that 7 would contain inner points of O and 
oghsequently also of S?, which is impossible. Hence ? is tes cum to To- 





It follows at once that the point of I corresponding to # is one of the two 
boundary points of ©. In the same way, by considering an outward normal ` 
to Ci at point’ ga, one shows that the other boundary point of Ọs also belongs 
to T. I is therefore closed. 
* [We pass to consideration of case (3c), that in which ü p possesses inner 
points relative to the surface of the unit sphere. Theorem III insures the 


existence of a support plane T of Cp with the following property: the inward 
normal 7, on T at the vertex O of Op contains (with the exception of O) only 
‘inner points of Cs. Since the polar cone Co of C; is identical with C (Theorem 
IL), it follows that the outward normal m on T at O lies iniC. Because of 


` the fact that n; lies in the interior of Cp, the plane T contains no point of C 
except O. Let p be the point of intersection of the inward normal on F with 
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the surface of the unit sphere: p is thus an inner point of G, and hence a point ` 
of I by Theorem VI. Consider any boundary point of Cy, say q. The points 


p and q determine a great circle on the sphere, a closed and connected segment ` 


of which belongs to C,, since Cp is convex. One boundary point of this seg- 
ment is g while p lies in its interior. | 
We consider the set S, consisting of the orthogonal projection of all 
points of S on the plane P determined by the points O, p, and g. (As C; lies 
on a hemisphere with p as an inner point, it is clear that p and q do not lie 
on a diameter of the sphere and O, p, and q determine a unique plane). Sy, is 
unbounded, since P contains an axial half-ray of 8, i.e. the half-ray in the 
direction Po. Sp clearly possesses inner points in the plane, but is not the 
entire plane, as S possesses a support plane parallel to T (p being a point of I), 
which is by construction perpendicular to P. As we have seen, the plane T 
contains no ray of the characteristic cone C of S; consequently the intersection 
of S with any plane T, parallel to T is a closed bounded set. Since the T4 
are perpendicular to the plane of Sp, we conclude that the correspondence 
established between 8 and Sp is such that to each boundary point of Sp corre- 
sponds at least one boundary point of S. 9 being unbounded, it is not obvious 
that Sp, the projection of 8 on a plane, is a closed set even though S is closed. 
This we show as follows: Let p be a limit point of a set pv C 8,. -We must 
_ “prove that p is the projection of some point of Son the plane P. Consider a 
circle with p as center which contains an infinite number of points p’y of the. 
points pv. As we have seen, the intersection with: 8 on any plane T4 per- 
pendicular to P is a bounded set. It follows that the intersection of 9 with the | 
right cylinder erected over the circle with p as center is also bounded; a set of 
points in S corresponding to the p's C S, must then possess a limit point pa Q 
the projection ray through p and p, belongs to § since J is closed. This 
establishes the fact that 8, is closed. The projection 8, of S on the plane P is 
- thus a closed unbounded conver set such that to each boundary point of Sp 
corresponds at least one boundary point of S. Also, support lines of 8, corre- 
spond to support planes of S: perpendicular to the plane of Sp and vice versa. 
We may now apply to Sp the reasoning used above for the sets S?. The 
assumption that q, a boundary point of >, belongs to I of S and consequently - 
also to the circular image J, of Sp insures the existence of an infinite half-ray 
in the boundary of Sp, as we have seen, and hence the existence of an unbounded 
set of boundary points of S all lying in a certain support plane T, of S per- 
pendicular to the plane of Sp. The intersection S,? of T, with 8 being convex, 
. it follow that 9,° contains an axial half-ray. Since q is any boundary point 
of I of S we may conclude: I of S is an open sét if S possesses no half-ray - 
‘in its boundary. 


12 
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On the other hand, assume now that'a sphere exists such that every sup- 
- port plane of S on any boundary point outside the. sphere contains at least 
one half-ray in common with 8. Let bp be any boundary point of Sp which 
corresponds to a boundary point b of S outside such a sphere. We show that 
` bp lieslon an infinite half-ray r, in the boundary of Sp: There exists a support 
plane Tof § on b perpendicular to the plane of Sp T, contains an infinite 
half-ray r belonging to the boundary of 9 by our assumption, and r can not 
be perpendicular to the plane P containing Sp since, as we have seen, the 
characteristic cone C of S is such that none of its rays are at right angles to P. 
. It follows that the projection of r on P is an infinite half-ray which evidently 
lies in the boundary of Sp. The spherical image of Sp is thus a (closed set, as 
_ we baye seen above, and we conclude that the point g in the boundary of Č, 
belo 8 to I of S. ` The point q being any boundary point of Öp, it follows that 
all kounia points of p belong to I and I is closed. We thus have 





’ TrreoreM VII. If I of 8 is of dimension two, it is (a) an open set if 
S contains no infinite half-ray in its boundary, (b) a closed set if every support 
` plane|of S outside a certain sphere lies on an infinite half-ray| belonging to 
the boundary of 8. | 


I of 8 is of dimension one, the above theorem does not) hold without 
modification, as the following example shows: Consider the set S consisting 
of the convex portion of the plane bounded by a parabola together with all 
straight lines through such points at right angles to the plane. |The spherical 
image I of S is evidently an open semi-circle, though the condition in (a), of 
Theorem VII is violated and that of (b) is fulfilled. However, ag we have seen 
in dealing with the sets S? above, Theorem VII holds in this case also if it is 
Splied, with obvious changes in the terminology, not to § itself but to the 
plane, section of § which is normal to the plane conatining|I. .We have 
already seen that I of S in this case is identical with the circular mage of 
such a section. l 


4 


6. Additional results. We begin this section with a theorem which 
gives a characteristic property of the sets S whose boundary points are komeo 
morphic with the plane: 


Taxorem VIII. -If the set of boundary points of 8 is homeomorphic 
with he plane, there will exist a support plane T on a certain Youndary point 
bof 8 with the following property: the infinite half-ray taken along the inward 
normal to T at b (that is, the normal turned toward 8) lies entirely in 8. 


It is clear that this property is iat shared with the seta S whose boundary 
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points are homeomorphic with a cylinder or a pair of planes, since such sets are 
geometrically right cylinders or the space between parallel planes respectively. 

Any set S whose boundary points are homeomorphic with the plane 
possesses a characteristic cone of class (3), as we have seen in the course of 
proving Theorem IV. It follows (Lemma I, section 2) that the polar cone C, 
of C is also of this class. We may therefore apply Theorem III of section 2 
to Cp. This theorem insures the existence of a support plane T, of Cp such 
that the inward normal n on T, at the vertex of Op contains, with the exception 
of the vertex, only inner points of Cp. The outward normal on T, at the 
vertex lies in C, by Theorem I and the definition of the polar cone. Since n 
contains an inner point p of Cy, it follows that T, is parallel to a support plane 
T of 8, since pC I of S by Theorem VI. The inward normal on T at the 
boundary point through which it passes is thus parallel to a ray belonging to 
C and consequently belongs to 8, which proves the theorem. 

For certain sets 9 it is possible to find a plane P with the following 
property: the orthogonal projection of the set B of boundary points of S on 
P is such that a one to one correspondence between the two sets of points is 
established. It is clear that only the sets S whose boundary points are homeo- 
morphic with the plane can possess this property. In fact, only certain sets of 
this type possess it, as the example of an infinite half-cylinder shows. We have, 
however, in the following theorem : 


Taxogem IX. If the characteristic cone C of S possesses inner points in 
BE, a plane P can always be chosen to serve as x, y-plane of a set of orthogonal 
cartesian coGrdinates such that the points of the set B will be given by 
z == f(x,y) with f one-valued and continuous. 


Consider any half-ray + which lies in the interior of C. We show that a 
plane P at right angles to r has the required property. By Lemma 3, there 
exists an infinite half-ray parallel to r through every point 6 C B which lies 
entirely in S. This half-ray contains with the exception of b only inner 
points of S, since r lies in the interior of O. It is moreover clear that b is the 
only boundary point of § lying on the straight line containing r. This is 
sufficient to show that the projection of the points of B on P is one to one. One 
shows also without difficulty that f is continuous: for example, one might use 
practically the same method as that used at the beginning of section IV. One 
sees also that the domain of definition of f is the entire plane. 
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ON THE SMOOTHNESS PROPERTIES OF A FAMILY OF 
BERNOULLI CONVOLUTIONS.* 


By Paur Enpôs. . 


Let L(u, o), — © < u < + œ denote the Fourier-Stieltjes transform, 
oO * t 
f e*do(x), of a distribution function o(s), — œ <s < +3. Thus if 
RE 

A(x) is the distribution function which is 0, 4,1 according as = —1, 
—1<z3£1,1<%, then L(u, 8) —cosu; and so, if b is a positive con- 
stant, cos (w/b) is the transform of the distribution function B(bx). Hence, 
if a is a positive constant, the infinite convolution i 


| oo(t) = Bar) * (ae) *B(az)#- >> 
is convergent if and only if a > 1; its Fourier-Stieltjes transform being 


G); L(u, oa) — TI cos (u/a*), (a> 1). | 


It is known? that the distribution function os is continuous for every 
a > 1 and, in fact, is either absolutely continuous or purely singular, depend- 
ing on the value of a. In this direction it is known? that the set of points z 
in the neighborhood of which oa(z) is not constant is either the interval 
tÆ a/(a—1) or a nowhere dense perfect set of measure zero contained in 
this interval according as 1 << a2 or 8 <a. While this implies that oa(z) 
is singular if 2 < a it does not imply that ca(z) is absolutely! continuous if 
a< 2. In fact it has recently * been shown that there exist oa algebraic 
irrationalities a < 2 for which L(u, sa) does not tend to zero with 1/u and 
sò og cannot be absolutely continuous. (It was conjectured, loc. cit., 3, that such 
values of a are clustering at a == 1 -+ 0 which would imply that! they lie dense 
in the interval 1 <a < 2). On the other hand it is ow" that those a <2 
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for which og is absolutely continuous are certainly clustering at a = 1 +0, 
since if a==2/™, where’ m is a positive ee then og has a continuous 
derivative of order m — 1. 

The object of the present paper is to. show that the successive smoothing 
of og can be considered as the general case when a—>1-++ 0. In fact it will 
be shown that there exists, for every positive integer m, a positive y(m) such 
. that the set of those points a of the interval 1 < a < 1 + m(m) for which og 

does not possess a continuous derivative of order m— 1 is a set of measure 
zero. To this end it is sufficient to prove that there exists, for every positive 
integer m, a positive 8(m) such, that He set of those points a of the interval 
1<a@<1-+8(m) for which ` 

(2) ~ L(u, oa) = 0(|u |"), b> 0, 
‘does not hold-is a set of measure zero. | 


Let C1, Ce, > CN be N positive MISES which satisfy the folowing 
conditions : 


(i) a 2; 
(ii) Ce [L y (¢—1,2,-°°,N—1); 
(iii) Chr < 304 (t=1,2,---,N—1); 


(iv) there-exists an a such that 2i < a < 2 and | Gen — ac; | <2, 
| (i=1,2,---,N—1). 


Lemma 1. There exist two positive absolute constants yı, ye such that 
if M is ary fixed number > yz, there are less than [M] different sequences 
C1, Co," °°, Cy satisfying the requirements (1)-(iv), the inequality cy SM, - 
and the condition that the number of those indices i (t—=1,2,---,N) which 
satisfy | Ci — acy | > Yo ts less than y: log M. à | 


Proof. Suppose that | cur — aci |S Ko and | ce — acm | S Ho for 
a fixed + Then 








Char 
Ci 1 | < 10c4 ? 
hence 
Cu y Ci+1 3 
a 41 < Ton < 10 








by (iii). Consequently, since | c13 — aes | < o by assumption, 


CAE er 1 
C4 = 2 





3 1 
< 10 + 10 < 
and 80 C442 is uniquely determined as the nearest integer ® to c*441/¢. 


“The above considerations are suggested by the investigations of Ch. Pisot, “La 
répartition modulo un et les nombres algébriques,” Annali d. R. 80. Norm. Sup. di Pisa, 
ser. II, vol. VII, p. 238. 
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Consequently if t, t'>, i denote all those among the N indices i which : 
the inequality | cı — ac; | > Yo then all indices ¢ which are not of 
the form i + 1 or 4, + 2 for some r = 1, 2,---, 1, are such that c; 


is uniquely 


determined by c+, and c;-2. On the other hand, éven if j is of the form i, + 1 
or ù + 2, so that cp is not uniquely determined by cj. and cys; then there 


are, by 
-Hence 


(iv), (or (i)), at most 4 choices for c; after cj, has been determined. 
there are at most 4°! different peas Ci, Co," °°, Cy which have a 
given set of exceptional indices h, tz, © +, in 


Finally (ii) and (iv) together with the notion ax = M clearly imply 


that N 
of exce 


Lemma 1, it is seen that the number of distinct possible choices 


` excepti 


| (IREM) + (Isle M à. | + (pre) 


< 5 log M for sufficiently large M, say for M >y Since 


onal indices cannot exceed 


[y: log M] 


ithe number 
ptional indices i, %,- > -, 4 is less than y: log M, by the hypothesis of 


for a set of 


‘and is therefore less than M*/* if yı is chosen sufficiently small. Since it was 


shown 


above that there are at most 4?! sequences c1, C2,‘ ‘+, ¢w with a given 


set of ee indices, it follows that the number of distinct sequences 


C1, Ca, ° 
is less Dan , 
Ms . 421 < Mus + 42% log M < M'/s 
if yı is|sufficiently small. This completes the proof of Lemma 1. 
© If'a, À are positive numbers let Az = = Ax (a, À) and a —œ(a,À 


for k= 1,2, >, by placing 


© (8) 


‘Aak == Ár + a, Ay integer, —$< ah. 


> ON which satisfy the requirements of Lemma 1 for a fixed M >y: . 


be defined, 


Limma 2. There exists an absolute constant ys, which shall be chosen 
to be > ya, such that if M has a fixed value greater than ys, then the measure 


of the 3 I‘ of those values a in the interval 


(4) 


for which there exists in the interval 


(5) 


a à—A(a) stich that the inequalities 


(6.1) 


A<a<? 


1<A<2R 


J 


Mak < M: (6.2)  [e(aa)l > Hg 


| hold for at most dy, log M distinct values of k, ts less than M+. It is under- 
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stood that œ = œ(a, À) is defined as in (3), and that yı, ya are the absolute 
constants occurring in Lemma 1. 


Proof. Suppose, if possible, that Lemma 2 is false. Then there exist at 
. least [M+] values of a in (4), say 
ay, G12: o BEPA), 

which are in T and which are separated by [4] — 1 intervals each of which 
has a length not less than A1-#/#; so that 
(7) ` | aj — a | Z Me. 
Since a; is in T, there exists a A = à (aj) in (5) such that 

æ(as, A(a;)) < %o 
holds for all but $y: log M values of k satisfying 

aà (az) < M, 
where, according to (3) 
(8)  afA(a;) = Ar(ay, A (a3) ) + (as, A (a4) ) = Ar? + a, say. 
It will be shown that 


(I) The finite sequence of integers A? belonging to a fixed j 
(— 1,2, «+, [MY]) satisfies the hypotheses of Lemma 1 if this sequence 
of integers is identified with the sequence of integers c1, ¢2,° - *, cy occurring 
there; and that 

(II) The sequences Ax‘? corresponding to different values of j are 
distinct. Since there are [Jf*/*] such sequences this will contradict Lemma 1 
and so complete the proof of Lemma 2. 


In order to prove (I) notice first that (i), (ii), (iii) are obviously 
satisfied for c; == 44%. Furthermore, by (8) 


A pe 1h — ar (di +e) 
and 50, by (3) and (4) 
| A) — ayy | = | ages’) — e HT < 2; 


so that (iv) is also satisfied, with «œ == aj. The hypothesis (6.1) assures that 
the assumption cy = M of Lemma 1 is satisfied. In order to verify the 
remaining assumption of Lemma 1 recall that there are at most 4, log M 
values of k satisfying (6.1), (6.2). Thus there are at most y. log M values 
of i such that (6.1), (6.2) are satisfied either for k — à or for k =1 + 1. 
But if i has a value distinct from one of these yı log M values, so that 


a? | < Yo and ef < Ho, 
then, by (4), 


184 | = - PAUL ERDOS. 


[AP — aide | = | ayer? —eP |< Yo. 
Thus there are at most y, log M indices à for which 
| A? — ajk” | > Ko. 


This completes the proof of (I). 


Í there exists a pair of distinct indices j and k such that 
, Å; G) = A, 
for all ¢—=1,2,---,N. Thus, by (3), 

(9) O [æa —ax(a)| <2 


In order to prove (II), suppose, if possible, that (ID) is false. Then 


' holds, for all 7 such that œA(a) S M. In particular (9) ‘holds if | is an 


index! for which 


| 1 1 
; : ty = M. 
(10) TM> a> M 


Now it may be assumed that ax > a; so that, by (7), ax Zaj HM, Then 





| eee) = aà (a) (as + M-34) 
end 80, by (9), “ei ; 


dA (dy) 2 (atà (ay) — 2) (ay + MA) = aja (as) - 
Hence, by (5) and (10), 


if M is sufficiently large, say M > ys. Thus 
Li | ae tA (ae) — aa (a) | & 8. 


ag (dy) = ay***d (as) + a M — 32 — 2 (aj + M-*/4) lay 


+ ay" (ay) M — 2 (a+ M), 


wA (a) +3 


This contradicts (9) (since by (10) ay) (Oe) < M) where one could write 


I oa I for l. This contradiction proves (II). 
The proof of Lemma 2 is now complete. 


|LEMMA 3. There exists, on the interval (4) a zero set Z which has the 


following property: if a is a point of (4) not contained in Z |then there is a 
postive B = B(a) suchi that tf M is any ficed number larger than B and if À 
` ig any number in (5), then there are at least fy. log M values of k which 


satisfy both conditions (6.1), (6.2). 


-thel interval (4) such that (6.1), (6.2) hold (for some À LA 


foriless than $y, log M values of k if M — 2>. Then, by Lens 2, 


meas Ty < 2% if 24 > ya 


Proof. For any positive integer h let Ta denote the set of points a on 


(a) in (5)) 
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Thus if Tu denotes for any fixed » > ys the a-set 2 f 
(11) Tæly= X., Ta then measly < dy. 
By 


It is clear from the definition of T, that if a is not in Ty and if M > w, then, 

even if M is not of the form 2* for some A, there are still at least $y: log M 
values of k satisfying (6.1), (6.2) for any value of À in (5). Thus if a is 

not in T then there is a 8 = 8(a) satisfying the requirements of Lemma 3; 

in fact one can choose f == u. Then the set of points a in (4) such that there 

does not exist a B— 8(a) satisfying the requirements of Lemma 3 is con- 

tained in TL for every positive u. Thus by (11), Z is a zero set. This 

completes the proof of Lemma 3. 


LEMMA 4. For every q > 0 there exists a p = p(q) > 1 and a zero set 
Z = La of a-values contained in the interval 


(12) 1<a<p(q) 


with the following properties: tf a is a point of (12) not contained in Za ` 
then there exists an a = a(a) > 0 such that if M is any fived number greater 
than a, and if À is any point of the interval (5), then there are at least q log M 
values of k satisfying (6.1), (6.2). 


Proof. Let a be a point in the interval 1 < a < 24 such that no integral 
power of a is a point of the zero set Z occurring in Lemma 3. Let pı, pe," , pr 
be those prime numbers such that 


Bc ama Caer. 
Now if v is such that af = 2 then, by the elementary inequalities of Chebyshev, 
there are two absolute constants y, ys such that 


(13) | HT 


Since a” (j == 1,2,---+,1) is in the interval (4) and not a point of Z, there 

are, by Lemma 3, for every À in (5), at least 4y: log M values of k satisfying 

(14.1) [Ase | < H, (14.2) | æla)a)] > Yo 

provided M > B(@). Thus, if M > ar B(ar:), there are at least 

siS 

An log M values of k satisfying (14.1), (14.2) for each + (= 1,2, : :,r). 
zlog M 
pip; log 2 

(aP )* = (Qriri/#)k < iy <M. 


But there are at most values of k such that 


Thus there are at least 
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: log M 
1StSi<r Pip; log 2 
values of & satisfying (6.1) and (6.2). Then by (13) the miber of values ` 
k which satisfy (6.1) and (6.2) is nòt less than 


ry: log M — 


dyrys —— log M — 443 ——— (log log M. 


Toz x y 


But tie expression can be made greater than glog M if s is chosen suffi- 
‘ciently large, i. e., if a is chosen sufficiently small, say a < p(q)! This com- 
- pletes the proof of Lemma 4 since Za may be defined to be the zero set of . 
points|a in the interval (12), some integral power of which is a point of Z. : 


THEOREM. For every positive integer m, there exists a positive 8—5(m) 
such that the set of points a of the interval 1 < a < 1 + 8(n) for which ` 


L(u, oa) == 0(|-u |"), . b> ©, 
does not hold is a set of measure zero. T 
-Proof. According to (1) 

3 œ 2 
a © D(u, ca) = I cos (u/a*), (a > 1). 
i : n=1 f 5 
Thus, üf u is in the interval a® < uss ax 

| Dee 

L{u, oa) < T cos (a? (u/a*)). 
4 fil. 
Now lbt à — u/a¥ so that 1 <à <2. Then 

! k | 
L a u àa )| = N (àa) |. 
| (uoa) < D | cos (A)| — IE | cos (aur)| 
. By Lemma 4, with M — u, if a is chosen in the interval (12) and not in Zą 


and ifiu > a(a) there are at least glog u factors in this, last product which 
are less than cos 7/30 so that . 


| L(u, oa) | < (cok /30) 110 «, u > «a(a). 


Since, according to Lemma 4, q a 0) can be chosen arbitrarily this completes 
the proof of the theorem. 
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ALGEBRAIC VARIETIES OVER GROUND FIELDS OF 
CHARACTERISTIC ZERO* 


By Oscar ZARISEI. 


Introduction. In an earlier paper (see footnote 18) we have derived a , 
number of characteristic properties of simple points of an algebraic r-dimen- 
sional variety V+. There the ground field K (field of coefficients, or field of 
constants) was assumed throughout to'be algebraically closed. In the present 
paper we generalize our results to any V, defined by a field 3 of algebraic 
functions over an arbitrary ground field K of characteristic zero. We do not 
assume that K is maximally algebraic in 3. | | 

Our generalization has an immediate application to simple subvarieties 
of V,, of any dimension. This application is given in the last part (V) of 
the paper. An irreducible s-dimensional subvariety V of Vr can be treated 
as a point P, provided we,pass to a new ground field K—a suitable trans- 
cendental extension of K in 3—and regard our V, as an (r —s)-dimensional 
variety Vrs over K. From our definitions it will follow that V, is simple 
for V, if and only if P is simple for V;.. The properties of the simple point 
P yield corresponding properties of the simple V.. It is this application 
that should justify (in the eyes of.a geometer) our consideration of ground 
fields which are not algebraically closed. 

Let é,’ ++ és be the codrdinates of the general point of Ve and let 
o denote the ring K[é&,---,é J]. An irreducible V, on V, is given by a 
prime s-dimensional ideal p in o. Let & be the quotient ring of Va, 
(Y= 0p, a/b eX if a,beo, bS40(p)) and let P= Ÿ-p be the prime ideal 
of non units of X. We define a simple V, by the condition that there exist: 
r — s elements m,°* +, 9r-.in X such that Y(m», °°", me) =P. The elements 
mare referred to as uniformizing parameters along V., or of Vs. Our main result 
concerns the characterization of a simple V, and of its uniformizing parameters 
with the aid of the different F”, of primitive elements win o. In this characteriza- 
. tion we start with an arbitrary set of r elements ¢,,---,{- in o such that o is inte- 
grally dependent on K[£:,:::,£;]. Let F’arbe the different of an element win 9 
if 1°" -t are taken as the independent variables. Just as a matter of 
arrangement of the indices it is permissible to assume that fı, > ', s are 
algebraically independent mod p. Let fifln’ ` +, te; doi) ==0 (mod p) be 
the irreducible congruence modp which sui satisfies over K(fi,-° > -, fs) 
(i= 1,2,-++,—98). We show that if there exists an elément w.in o such 


* Received September 28, 1939. | 
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_» that Ph 0(p), then Vs is siniple gid the r —s een hh 
aré uniformizing parameters of Vs; and conversely. ` 
An almost immediate consequence of this result is that the 
S of a simple V, is integrally closed in 3. 
The burden of the proofs résts naturally on the e case of 
We consider the residue class field K, of a point P, i.e. the fi 
This field is a finite algebraic extension of K. Let K* be th 
“extension of K which contains Kp. Upon extending the ground 





a ne 


number of points P*,,- - -,P*,. 


| 


roles ban) ' 


quotient ring 





+ dimple points. 
ld K, = 0/p. 
least normal 
eld K to K*, 


y variety V*, is obtained, and on V*, the point P splits into a finite 
The most difficult step of the theory is the 


proof that P is simple for V, if and only if the points P*; are simple for V*, 


and tf the quotient ring X of P contains the relative algebraic c 
3. 


losure of K in 
ith the aid of this result, the various theorems concerning the simple 


point P can be readily deduced: from the corresponding theorems concerning 


the points P*;, 
This reduction succeeds because at on point P*,; we have 
state of affairs, namely the residue class field at each point P*, 
the new yround field K*. This is therefore a special case of 
it is! characterized by the condition K, = K. This special case 
(Part III). 
determined by K. It is shown that this ground field extension 
any splitting of. the point P. We then use the results already 
the case of an algebraically closed ground field. 
The method just outlined necessitates a preliminary study o 
We 
van 
special case in which K is maximally algebraic in 3. 


could not take over directly the results established in this 


a very special 
coincides with 
our problem : 


is treated first 
Here we pass directly from K to the. algebraically closed. field 
does not cause 


established in 


Í the splitting 


of prenie ideals in o under algebraic extensions of the ground field (Part I). 


connection by 


der Waerden and Krull, because these. authors have only dealt with the 


The systematic study of simple points and of simple subvarieties under- 


taken in this paper is a necessary preliminary to the problem of local 


! 
u 


P 
I. Normal ground field extensions. 


1. 





T 
al 


' 


ebraic extension field K* of K and we wish to show how 


on. on algebraic varieties which we shall treat in a forthcoming 
aper f : 


Let & be a field and let K be a subfield of 3, of characteristic zero. 
e field K shall be referred to as the ground field. We coneidet a normal 
this extension of : 


the ground field defines a corresponding extension field of x, which we shall 


denote by =*, or by K*3. 


Let © be the algebraically closed field determined by K and let K’ be the 
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relative sabre closure of K° in $, i.e. the fell ‘consisting of ‘all those 
elements of 3 which are algebraic over K: The fields K’ and K* can be 
imbedded in 9. This imbedding is defined to within relative automorphisms 
of K’ and K* over K, but since K* is a normal extension of K, the intersection 
of K’ and K* is a subfield of K’ which is independent of the imbedding. Let 
this subfield be denoted by A. 

The elements of 3* shall be the formal finite sums £* = ab, ++ a* aga? 
a*,C K*, & CS, h-arbitrary. Addition, subtraction and multiplication are 
defined formally in an obvious fashion. We need a rule for identifying two 
‘ formal sums, and for this it is sufficient to give a rule for identifying a formal _ 
sum é* with the zero element of 3*. . Let ét = até +e o H até and 
let b*e --,0%, (b* C K*) be an independent A-basis of the algebraic 
extension field A(a*,,: : -, a*a) of A. If we substitute formally into the.sum 
a?i% +: < © + a*ré the expressions of.a*,,- + - , a*s in terms of b*1,- © >, b*n 
(linear forms with coefficients in A), we get an expression of the form 
bhim + H btan mS. To indicate this substitution we write: 
ét > biy + + Dim. We identify the element &* with the zero element 
of X*, if and only tf m0, ` -,m—0. It is self-evident that this identi-. : 
fication rule is independent of the choice of the base 6*,,---,5*,. More, 
generally, let c*;,: - -,c*m be a set of elements of K* ‘which are such’ that: 
(1) they are linearly independent over A; (2) the a*; can be expressed as 
linear forms of the c*; with coefficients in A. The elements c*; need not 
belong to the field A(a*,,- - -,a@*x). By condition (2) we get, through formal 
substitution: £* —> c*a +°- -p c%mem. We assert that £* — 0 if and only 
tf fi" + -sfn—=0. For the proof, let d*,,: - :,d*, be an independent 
A-basis of A (b*e + + ,b%4, M1," °°, 0%m) and let É*—> des + > -+ d*w. 
It is clear that b*im +: e + b*ann > d*o +--+ d*ywy and also 


. O. 
c*ab ++ CF mom —> d'u ++ d*yov. If 6%, = > kud*;, kij Cc A, 

j=l . 
then the matrix (4:;) is or rank n, since b*,,- - -, b*, are linearly independent 


# ld 
over A, and moreover w; = © kjij Similarly, if c*; = D 1,;d*;, then the 
: j=l jk 


matrix (ip is of rank m, and we have w; = Š With. Hence, if € = 0, i.e. 
gat 


if m =: + + = ya = 0, then o, = * - = o = 0, and since (l;;) is of rank m, 

it follows that f, =: > - == m == 0. Conversely, if & —: ` = fm = 0, then 

opm oy Ó, ie. Shiai 0, t= 1,2,---,2, and since the matrix 
3=1 


(kij) is of ranken, it follows that me: > - == na = 0, ie. é* = 0. 


1 We use small Greek letters for elements of = and small Latin letters for elements 
of K. The same letters with an asterisk denote elements of Z* and K* respectively. 
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Asan immediate consequence we have the following: if a*,,-|- -,a*, are 
themselves linearly independent over A, then a*,é, +: `- +a*ié= 0 tf and 
Sit ert | | 

it clear 


that. formal addition and multiplication of the élements of 3*, 


| consideved as formal sums, is consistent with out identification rule. Hence 


Z* is a ring. 


Le 


— 0, at 
remains irreducible in the polynomial ring X[x] ; in other words. 
degrees! [A(8) : A], [3(8):3] are the same. ` 
-factor 
Since 
are in K*. 


ff(z) in 3[2]. Let 0® = 9, 09,- 


Consequently, a, - 


algebraic over K, they must also belong to the field. K’. Consequently . 

or"; |, 6m C A, whence (T) = f(x), q. e.d. | 
By means of this Lemma we now show that &* is an integral domain. 

(ie. 3* has no zero divisors). Let £*)* — 0, £* = a*i ++): o- atm, 

q* == Dim + ©: + Dam, and let g be the relative degree lof the field 

A (afit - *,@"m, O*:,--+,6*,) with respect to A. Let 9 bel a primitive 

element of this field, satisfying an irreducible equation F' (9) — 0, of degree g, 

with coefficients in A. By our identification rule we have: 

(1) Et = Go + 0 -H + 471801 — 4(6), 

COR Bot Bt SC TO 


mA 1. Let 0 be an element of K* and let f(8) = 69 + mo +. ay 
|< A, be the irreducible equation for 0 over A. The: polynomial f(x) 


the relative 


Proof. Let $(2) = 2+ oa! +--+ + om, of C3, be an irreducible 
-,6™) be the roots of d(x). 
C K* and since K* is a normal extension of A, all the r 


oots of f(a) 


- - um C K*, Since the ws are in ¥'and are 


and by the same rule, the relation #(@) -y(@) = 0 implies that te polynomial 


(zx) y(x) is divisible (in S[x]) by F(z). By Lemma 1, F(z) 


is irreducible 


in 3[z]. Hence, either ¢(x) or y(r) is identically zero, i. e. either é* = 0 or 


7* = 0, which.shows that 3* has no zero divisors. 


of 3* 
entire 


field 3(6). 


. Remark 1. We call attention to the important role which the 
in the 
which 
of 3 which can be imbedded in K*. We would get the same field 
. A as ground field instead of K. 

f particular importance is the special case K == K’ (i.e. K is 
algebraic” in X, or K is algebraically closed in 3). In this 
K = A for every normal extension of K. 


It now follows immediately that S* is a field. In fact, every element £* 
is of the form (1), for some 8 € K*, and, by Lemma 1, 3% contains the 


field A plays 


definition of the field 3*. It is this field, rather than the ground field K, 
really matters in our construction. By definition, Ais the largest subfield 
E* if we took 


“ maximally 


case we have 
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Remark 2. The fields 3 and K* are subfields of 3* ? and have at least the 
field K in common. It is not difficult to see that &* is the smallest field having 
this property, i. e. any field T with this property contains X*. Our hypothesis 
is to the effect that the field T contains two subfields 3, and K*, simply iso- 
morphic to 3 and K* respectively, and, moreover, that the field K, which 
corresponds to K in the isomorphism between K*, and K* is a subfield of 33. 
It is then clear that the intersection of 3, and K*, must be the field A, which 
corresponds to A in the isomorphism 3 = 3,. Using the reasoning of the 
proof of Lemma 1, it is immediately seen that the join (3%, K*,) of the two 
subfields 3, and K*, of T is abstractly isomorphic to the field %*, and that this 


isomorphism induces the given isomorphisms between $, and 5, and between 
K*, and K*. 


2. Let o be an arbitrary subring of %, subject to the only condition: 
K Co. Let o* == K*o be the extended ring in 3*, i. e. the ring whose elements 
are of the form a*,€, +: - - + a*ngén, a*; C K*, & Co. Let A be the inter- 
section of o with A. Since A is an algebraic extension of K and since K C o, 
it follows that A’ is a field. 

THeorem 1. If A’==A, then 0*%ao—Y for any o-ideal Y. In the 
general case the relation o*X a o == Y still holds true if À ts prime. 


Proof. Let é==a*,é,-+-- : Lat,é,, af C K*, & C Y, be an element of 


o*M% oo, and let 8 be a primitive element of A’(a*,,---,a*n)/A’. Since 
A’ C o, we can write é in the form: 
(2) + Ê g + mO+ + + ma, 


where n C Y and where g is the relative degree of A(a*,,: : :,a*,) with 
respect to A’. Under the hypothesis that A’ =A, the elements 1,0,- © <, 077 
are linearly independent over A, and hence the equation (2) implies that 
É = moy == © t = ng- = 0. Hence £=0(A). This shows that o*&% a o C A, 
and since Y C o*Q, it follows that of M9 o = Y. = 
In the general case and for a prime ideal A, we proceed as follows. 
Multiplying (2) by 1,6,- : -, 697 respectively, we get relations of the form: 


* More precisely: Z* contains two subfields abstractly isomorphic to 2 and K* 
respectively, consisting of the elements a*.E +... Ha" nën in which a*,, 12,07 CK 
or fp - -sén C K respectively. 

3 The following example illustrates the possibility: g*Q ago = Y, if A x4’. Let 
K be the feld of rational numbers, K*—K(V2®) and let Z={K*(m). If we 
regard K as the ground field then the extension K—K* does not affect 2, i.e. we 
have 2=2Z*. Let p= Kis, œ. V2], Y =p: Then 0° = K*[o], ož = p*. v and 
ofM oo =p- (s,s. V2) N. Here the fields A and A’ coincide with K* and K 
respectively. 
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é= no + m0 +: : SE 701007, 
#0 — 90) + MO: +: ae 


- 607 tro ra m TES de pee, 
where dll the nj‘ are in I. Hence | yP — 8P E | = 0, where 
4, and &() == 1. This equation is of the following form: 
D G+ BP + + By 0, 

where Rie=0(%), 11,2, ::,g. Hence # =0(X), and since 





_we con¢lude, as in the first part of the proof, that £==O0(X), q. e. d. 


ideal X* in o* is said to Ke over an ideal A in o if 
W* a p = À is satisfied. It has been proved by Krull‘ that over 





18,40 = 0 if 


is prime, 


the relation 
every prime 


ideal p in o there lies at least one prime ideal p* in o*, provided that o* be 
integrally dependent on o (i.e. that each element of o* be integrally dependent 
on elements of o). This provision is satisfied in our case, since of = K*o and 
since K*, as an algebraic extension field of K, is certainly integrally dependent 


on K (KC 0). 


and K*,. the residue class fields of p and p* respectively, ie. 
fields of the residue class rings 0/p and o*/p* respectively. Since 
= Kp may be regarded as a subfield of K*, Moreover, K andi 
regarded as subfields of Kp and K*,. respectively. 


We consider a prime o*-ideal p* which lies over p and we denote by K, 


the quotient 


ep*ao = þ, 
K* may be 


Lamma 2. K*p is the extension field of K, obtained by the extension 


K> K* of the. ground field K; in symbols: Kya K*-K,. 


K in common. Hence, by Remark 2 of the preceding secti 
K* D K*-K,. On the other hand, any element of o*/p* is 
at ee + a*ném, at C K*, ți Co/p, This shows that the 


We observe that K* and K, are subfields of K*, having at ine the field 
n, we have: 


of the form 
ring o*/p*, 


and hence also its quotient field K*,s, is contained in the field (K*, Kp). 


Hence K*ps == K*K,, as was asserted. 


`8. Unramified character of the maximal o-ideals. We k the fol- 


lowing assumption : 


The field A is a finite extension of K. This assumption ‘is always satisfied 


if, for instance, K’ (the ne closure of K in 3) is itself 
tension of K. 


a finite ex- 


Under this assumption we prove the following fundamental theorem : 


tW. Krull, “Zum Dimensionbegriff der Idealtheorie” (Beiträge zur Arithmetik 


kommutativer Integrititsbereiche, III), Mathematische Zeitschrift, vol. 42 (1937), 


p. 749. 








} 
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- THEOREM 2. If pisa maximal o-ideal® then o*p is the intersection of 
the prime o*-ideals which lie over p. ; 

“For polynomial rings o = K[a,-- -,2,] this theorem is due to van der 
Waerden.® It appears as 'å special case of a generalized discriminant theorem 
proved by Krull for any pair of integral domains o, o* (o*-integrally de- 
pendent on o) under the hypothesis that o is integrally closed in its quotient 
field.” If we assume, as. it is permissible to do, that X is the quotient field 
of o, then Krull’s hypothesis in our special case implies that K’ C o, whence 
A= 4. The special case when o* is obtained from o by a separable extension 
of the ground field has been treated separately by, Krull in his report “ Ideal- 
theorie” (p. 40). However, also this treatment is based on the tacit assump- ` 
tion that the fields A and A’ coincide. Namely, under this assumption it is 
permissible to take A as ground field, since A= A’ Co, i.e. we may put 
A = K, and then our assumption A = A’ becomes: K'a K* —K. It follows 
then, by Lemma 1, that the Galois groups of 3*/3 and of K*/K coincide 
(i.e. every relative automorphism of .K* over K can be extended to a relative 
automorphism %* over X; note that %* is at any rate.a normal extension of 
3). One defines then in a natural fashion the concepts of conjugate ideals 
and of invariant ideals in o*. The proof by van der Waerden and its gen- 
eralization by Krull are then applicable, leading to the following theorem: . . 


THEOREM 2’ (van der Waerden-Krull). If A A’, and if K* ts a 
separable ettension of A, then each invariant o*-tdeal A* is the extended 
ideal of tts contracted ideal in o : U* == o*- (A* ao), and for each prime 

o-tdeal p? it ts true that o*p is the intersection of the prime o*-ideals which 
lie over p. . : 

We shall make use of Theorem 2’ in order to prove our more “general 
theorem for maximal ideals. 

_ Let A be the least normal extension of K which contains the field A, 
i.e. A is the join of A and of its conjugate fields over K. By our assumption, 
A is a finite extension of K. We introduce the intermediate field $ — AS, 
a finite algebraic extension of ¥, and the intermediate ring 5 — Ao, so that 
SÍS", pZ o0*. The ground field extension K—>K* is thus de- 
composed into two successive normal extensions: K — A, Am» K*. We have 
clearly the relations: 3* = K*S, o* = K*5, 


5 An ideal is maximal (or divisorless) if it is not properly contained in any other 
ideal, different from the unit ideal. 

° B. L. van der Waerden, “ Eine Verallgemeinerung des Bezoutschen Theorems,” § 5, 
Mathematische Annalen, vol. 99 (1928). 

TW.Krull, “ Der allgemeine Discriminantensatz. Unverzweigte Ringerweiterungen ” 
(Beiträge zur Arithmetik kommutativer IntegritAtsbereiche, VI), Mathematische Zeit- 
schrift, vol. 45 (1938). 

8 Not necessarily maximal as in Theorem 2. 
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We assert that the relative algebraic closure A’ of A in 3 is the field AK’. 
'- To show this, we first observe that A= Aa K’, and hence, by Lemma 1, 
. [Ar A] = [3:3] =g. Let & be an element of $ which is algebraic over A, 

‘hence also algebraic over K’. The relative degree [K’(%):K’] cannot be 
greater than the relative degree [%(@) :%], and since [£(4): 3] = [3:3] =g, 
` it follows that [K’(«):K’]<g. This last inequality ‘holds true for any 
element « in A’, and consequently [A’:K’] Sg. On the other hand, A’ 
. contains the field AK’, and we have [AK’: K’] = [A:A] — g, in view of the 
relation A — À n K’ and of Lemma 1. ` Hence necessarily A’ = AK’, as was 
asserted. 


` We now prove the relation: 
(3) A= K# a AK’, 


Let a* be an element of K*n AK’. Since A= K* n K’, we have [K’(a*) : K’] 
- = [A(a*): A]. Now we have just proved that [AK’: K’] = g. Since a* C AK’, 
we conclude that [A(a*) : A] Æ g, for any element a* in K*a AK’. Hence 
this last field is of relative degree << g over A. Since on the other hand this 
field contains A, and since [A: A] = g, the relation (3) is established. 

. The relation (3) says that À is the intersection of K* with the algebraic 
closure of A in 3. The ground field extension À —> K* therefore satisfies the 
condition of Theorem 2’, We therefore know that every prime ideal f in 5 
is the intersection of the prime o*-ideals which lie over p. Let us assume that 
Theorem 2 has already been proved for the ground field extension K —> A and, 
moreover, let us assume that there is only a finite number of prime 6-ideals 
which lie over,a given maximal prime o-ideal p. We will have then: 


op — [Po Pa pe > Pal. 
The ideals f; are also maximal in 0,° hence are two by two free from common 
divisors. . Therefore their intersection coincides with their product: 


(4) op = pipe: © En 
By Theorem 2’ we have 


o*i = [D* sa Pees Selg 


where p*4; ^ 5 = pi.: Since (Pa, bj) ==; if (25, we have also (0*p;, 0*f;) == 0*. 
Hence the product of the ideals o*p; coincides with their intersection, and 
therefore, +7 (4), 


° Since p is maximal, the ring p/p is a field. The integral domain 9/), is in- 
tegrally dependent on its subfield p/p and hence is also a field. Consequently p, is 
maximal. | 
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o*p = o*f, Ba: +: 0% 6m — [oh : -, 0* Bn] 
= [ph ph, ce, Dé, cc], 
which proves Theorem 2. 

Thus, to complete the proof of Theorem 2, we have only to prove it for 
arbitrary finite ground field extension K — A, and we have also to show that 
the number of prime 5-ideals which lie over p is finite. This we shall do in 
the following section. 


4, Merely as a matter of notations, we may identify A’ with K, since 
A Co. Let p be a maximal o-ideal and Jet Ky (== 0/b) be the residue class 
field of p. Let Kio À = A, whence K S A, € A, and let A, = K(#), where 
Ÿ is a primitive element of A, over K. Let f(ÿ) — 0 be the irreducible equa- 
tion, say of degree m, which @ satisfies over K. Since 8 C K, and K p ™ o/p 
(by hypothesis: p is maximal !), there must exist in 0 an element w such that 


(5) f(e) =0(p). 
If þ is a prime 0-ideal which lies over þ, then 
(5) f(w) =0(b). 


Since A is normal over K and since one root, ® = #,, of the polynomial f(x) 
is in À. all its roots are in À. whence also in 0. Hence, by (5), we must 
have o = V; (P), where #4 is one of the roots Dı,’ : :, Un of f(r). Let, say 
o= p, (p). We assert that — 

(6) p= (õp, o—%,). 


Let @ be a primitive element of A over K(0,), and let [A: K(%)] =n. 
Every element & of b can be written in the form: 


G = Go + GO > + Gnat, 
where 
Gi = Bin + CRUA +: + Zi, m10", Qij Co. 


Since o = ù, (P), we have: ai = aio F ano tHe > + Armio (p). The 
right-hand side of this congruence is an element of o. Consequently, in the 
homomorphism 6 = 6/ the elements &o, &1, ``" , &n-ı ate mapped upon elements 
of Kp (—0/p). Since K(9,) = Ap == K, À, the elements 1, 8,: + :,8"1 are 
linearly independent not only over K(Ÿ:), but also over Kp (Lemma 1). 
Consequently à cannot belong to p, unless all the elements a, &,° * *, %i-1 
belong to p. We have: @ = aio + ano + * + aime (0* (0—0) ), 


1 By K; n À is meant the intersection of A (normal finite extension of K) with 
the relative algebraic closure K of K in Ky in the same sense as À was defined by 
the relation: A = K*n K’. See Section 1. 
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and if @==0(p), then io Fano +: Re mi0 1=0(p), since ye Dao. 
This shows that if @;=0(p), then 4; = 0 (õp, o—%), and consequently also 
== 0 (dp, o—%,), which proves the relation (6). 
From (6) it follows already that the number of prime 5-ideals which lie 
over p is finite, since it cannot be greater than m ı (= [As K]). Let these 
prime ideals be ,,: + - , Pa, and let l 


(6°) Pi (Dp, o — v), ($= 1,2,-° es 
Since the f; are also maximal, we have 

(1) ef h p T (#99). 

Let Le it, (w— #4) and let us consider the at (5p, E). We assert that 


it is the unit ideal. Namely, in the contrary case let p be a prime ideal 
divisor of (5p, ¢). Since pC p and p is maximal, p must lie over p. Hence 
_ P must be one of the ideals f,,---,fa, say P= :. Now since Ẹ== 0 (F), 

one of the factors o— Ù, t==h-+1,---,m, must belong to fı, say 
o — n = 0 (pı). Hence #,—%s,,==0(f,), and this is impossible, since 
4 >= D and since ẹ, — Ôr is an element of the subfield À of 6. 


It is therefore proved that (Op, Ẹ) = 5. Consequently TI (o —®:)=0 (0p), 
since $ Il (w— #4) — f (o) =0(p). Comparing with (7) we find: 


[Pao a) fa] = op, 
as was asserted. 


.5. It can be shown by examples that Theorem 2 is not generally true 
for non-maximal ideals. For arbitrary prime o-ideals some weaker result 


.Let K be the field of rational numbers, ahd let E = K (V2) (@, y) , where a, y 

are independent variables. We put K* = K (v2), o= KE, 4,s], where s = V2. oy. 
We have E*=K*E == and 9*=K(V2)[o,y]. Let po: (y°— 2, s — 20). ‘Ob- - 
serving that every element of o can be put in the form f(æ,y) +e.g(æ,y), where 
fe, y), g(a,y) C Ke, y], itis a straightforward matter to verify that b is prime. 
It is not maximal, since it is contained in the prime ideal p(w, g, y*— 2). We have 
| o*p = 0" (y —2, V2. æ (y — V2) = [p*, p*,], where 
=o*(y—V2), pro" (my + V2). 
- The ideal p* lies over p. In fact, any element f(m, y) + eg(x,y), reduced modulo p. 
gives a residue of the form A(s) + yB(x) (since y? = 2(p) and s = 20(p)). Here 
A(@) and B(w) are in K[w]. Should this residue belong to p* > it is necessary that 
A(@) + v2. B(w) be identically zero. Hence A(w) + y¥B(a) is also. identically zero, 
and this shows that p* no—p.. However, the ideal p*, lies over the prime ideal 
o: (æ,r,y*—2) which is a proper divisor of p It is remarkable that in this example 
0° possesses even an isolated component p*, different from př, since p* 34 0( po): 
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can be established by the usual artifice of quotient rings. Let p be an arbitrary 
prime o-ideal and let $—0, be the quotient ring of pi? Let 9* = K*%. 
It is well known that the prime ideals in X correspond in one to one fashion 
to p and the prime o-ideals which are contained in p. If p, and $, are 
corresponding prime ideals in o and X respectively, then Bi = S pu p= — 
The prime ideal P == X- p is maximal in 3. By Theorem 2 we have therefore: 


(8) D = [B*, P*a dE 
where P*,, B*,,: : - are the prime ideals in $* which lie over $. 

Similarly, it can be shown in a simple manner that the prime ideals in 
3* correspond, one to one, to the prime ideals in o* which lie over p or over’ 
‘ prime multiples of p. The correspondence is again the one of contracted and - 
extended ideals? Let P*; ^ 0% == p*; and put 

o*p pees m* 
[p*, P*a “es ‘] =m". 

We have evidently: Y*m* — 3*m*, = [P%, P*a: --]. Let us assume that 
Hilberts basis theorem holds in o*. From the relation X*m* = X*m*, follows 
that for every a* C m*, there exists an element a in o but not in p, such that 
aa C m*. By Hilbert’s basis theorem there exists then an element £ in 0, 
not in p, such that 8m*,=0(m*). This shows that p*,, p*3,: - - are isolated 
components of m*. Since o*p°^ o = p, by Theorem 1, it follows that the 
decomposition of o*p into primary components is of the form t. 


o*p == [p*s, p*a es qd, '*5,° i J | , 
where the prime ideals p’*,, p’*2,-- - to which q’*1, q'*a,: > belong, lie over 
proper prime divisors of b. i 


6. The following theorem, which we shall have occasion to use in the 
sequel, gives a sufficient condition that o*p be prime, where p is now an 
arbitrary prime o-ideal, maximal or not. 


#0, consists of all quotients a/B, 4,8 C p, 850 (ph). 

The elements of X? are all of the form a*/a, a* C o*, a C p, «Æ0 p). : Let 
př be a prime 9*-ideal which lies over a prime multiple of p, and let P= "ph". Let. 
a*/a, B*/B be two elements in g" whose product is in p“ Then a*8*/aß = Y*/Y; 
where y* = O(p*), and therefore ya*8* = 0 (p*)- Since p? ngo= O(p), it follows that 
Y540(p"), and hence either a* or 8* is in p*, i.e. either a*/a or 8°/8 is in K*. 
This shows that p* is prime. Let a Cp" agp”, a* = p/p, 8° = 0(p* }. Then 

a*ß = ot (p X and it follows by the same argument that a* is in p* This shows that 
ng 
P H p “aortiti ‘then let a be an element of 9 which bip but not: amp; 
Since a a a unit in g5, it follows that. TP == CF, 
If R* is an Arbitrary prime ideal in y , and if p° =P" n o*, then any element 
a* /a in §R* is‘such that a* is in p? , since a is a unit in &*. This shows that P* =3" 
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Tuzorem 8. A suficient condition that o*p be prime is that A’ be the 
intersection of the fields Ky and K*, — 


Proof. Let p* be a prime 0*-ideal over p. Every element a* of o* can be 
written in the form: ¢* = a, + a,0-+- +--+ a%.,671+, where a; C o and 8 is 
an element of K*, of degree g over A’. If a* == 0(p*), then passing to the 
residue field K*, we find the relation: Zo + a6-+:--+ &,,@-1<= 0, where 
a C Ky. Now if K* a K, = A’, then, by Lemma 1, the elements 1, 0,- + -, 69 
are linearly independent over Kp. Hence & = 0, i.e. a4==0(p). This shows 
that «* is contained in o*p, consequently o*p == p*, q. e. d. 

Let K, be the largest subfield of K, which is algebraic over K, i. e. Ko is 
the relative algebraic closure of K in Kp. Let K* be the least normal extension 
of K which contains Ky and let o* = K*o. If p* is any prime o*-ideal over p, 
then, by a result proved in Section 2, we have: K*~—K*- Kj. In view of our 
choice of K*, it follows that this field is algebraically closed in K*,.. Hence 
if K*, is any normal extension of the new ground field K*, the condition of 
Theorem 3 is satisfied and p* will remain prime when we pass from o* to the 
ring K*,o*. This is true, in particular, if we pass from K* to the algebraically 
closed field determined by K. In other words: the extension K — K* causes 
the maximal splitting of p into prime ideals., 


II. Algebraic varieties over arbitrary ground fields. 


_.7. Let & be a field of algebraic functions of r independent variables, 
over an arbitrary ground field K of characteristic zero.# We do not assume 
that K is algebraically closed in 3. Let &,- > -, & be a set of generators of à, 
ie. 3— K(E,: «+, é), and let o = K[é,: + <, é] be the ring consisting of 
those elements of % which can be expressed as polynomials in &,-° +, én 
With the elements & we associate an irreducible algebraic r-dimensional variety 
V, whose general point has coordinates &,---,é. A point P of V, shall be 
associated with a prime zero-dimensional ideal pẹ in o. The geometric terms: 
“variety,” “codrdinates,” “ point,” are so far purely formal and conventional 

.expressions. To confer upon these terms a geometric reality it is necessary to 
imbed our V, in an affine n-dimensional space 8,4 over some field A15 The 
field A may be either K itself or an algebraic extension of K. Now the residue 

. Class field Kp, (== 0/po) of the prime zero-dimensional ideal pọ may very well 

be a proper extension of K (necessarily algebraic). Hence, in general, there 


14 We assume, of course, that E is a finite extension of K (of degree of trans- 
cendency r). : 

1 By the symbol 8,4 we mean an affine n-space in which every point has coördi- 
nates Gase o +, Op 4, A. 
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will not exist elements c,,- + >, Cn in K such that & == ci (po). In such a case 
our point P is not represented by a geometric point of SA . On the other 
hand, if we take for A some normal extension of K, for instance the alge- 
braically closed field determined by K, then the results of the preceding 
sections show that P may be represented in 8,4 by a set of points ° 

More generally, we associate with a prime s-dimensional ideal pẹ in o 
an irreducible algebraic s-dimensional subvariety V, of V,. The codrdinates 
of the general point of this V, are the elements €,’ ' -,&, upon which the 
elements é, : -,&, are mapped in the homomorphism o = 0/p,. The residue 
class field K,, (i. e. the quotient field of o/p,) is the field of rational functions 
on Vs: Kp, == K(&,-- -,&s). Given two irreducible subvarieties V, and V'o 
of V,, defined by the prime ideals pa and p'o respectively, we say that V, 
belongs to V’o if P'o = 0 (pa). 


8. Let V, be an irreducible s-dimensional subvariety of V, and let 
p= ps be the corresponding prime s-dimension] o-ideal. We consider the 
quotient ring 3 = op. The ideal -p= $ is prime and maximal in % and 
we have H^ o= p. The quotient ring Sp is evidently % itself, and the 
residue class field of P (—S/B) coincides with Kp. te 


DEFINITION. V, is said to be a simple subvariety of Vr if there exist 


r — s elements m1,° ° `, nr- in X such that: 
(9) | S: (n° j “s Nr-a) = $. 
Elements such as m,’ °°, shall be referred to in the sequel as 


uniformizing parameters along Ve, or at Vs. 

We shall see later that if V, is simple, then the uniformizing parameters 
qi can already be found in the ring o. Now if m,° © -,r-. are in o, then (9) 
is equivalent to the condition that p itself occur among the maximal primary 
components of the ideal o` (m,° > `, r-e)" ie 


(9) O° (q +, ore) = [p 


where the right-hand side is a decomposition of o- (m1,° © *, mr) into maximal 
primary components. 


18 This set is finite since the relative algebraic closure of K in K,, is a finite 
extension of K. See the footnote +! and the considerations at the end of Section 6. 

17 Concerning the relationship between the prime ideals in 9 and in & see Section 
5. To that we add that, more generally, there is a (1,1) correspondence between the 
Syideal Qf and those p-ideals q which have the property that each maximal primary 
component of q is a multiple of p. If 9 and q are corresponding ideals, then 
W=J-a a=Weo. If q is an arbitrary ideal in g, then the ideal g Ga differs 
from q only by primary components which are not Bish Di of p (such primary 
components are missing in the decomposition of ga X- q). 
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Our purpose is to derive from the above definition a number of char- 
acteristic properties of simple subvarieties. The results will be on the main 
generalization of theorems proved by us elsewhere ** for simple points in the 
case of an algebraically closed ground field K. Practically all the rest of this 
paper deals with simple points. Once the results for simple points are 
established, the extension of these results to simple subvarieties of any dimen- 
sion is rapidly achieved by the usual artifice of a transcendental extension 
of the ground field. 

Dealing with a point P of V,, given by a prime zero-dimensional o-ideal 

p, we shall proceed in the following manner. The residue class field K, is a 
finite algebraic extension of K. Let K* be the least normal extension of K 
which contains K, We take K* as new ground field and we pass to the field 
3* = K* and to the ring 0* = K*o = K*[é,,---,é:]. Regarded as ele- 
` ments of X* the elements é,,- : -,é are the codrdinates of the general point 
„of an irreducible variety V*,. Let p*:,- - -,p*; be the prime o*-ideals which 
lie over p and let P*,,- : -, P*, be the corresponding points of V*.. We may 
aay that these points P*; correspond to the point P, and that P splits into 
the k points P*, of V*, By Lemma 2 (section 2), the residue class field 
K*,., coincides with K*K,, and since Kp € K*, it follows that K*ps, == K*, 
Thus on the new variety V *, we now are dealing with points P#, (im 1,2, + +, h) 
which have the property that for each of them the residue class field K*,», 
coincides with the ground field K*. If we now pass from K* to the alge- 
bratcally closed field determined by K, then each prime ideal p*, remains prime 
(section 6), and it stands to reason that the results valid in the case of 
algebraically closed ground fields ** can therefore be carried over to the points 
P*; of V*,. For this reason we study first the special case in which K, = K. 
When this special case has been settled, the only thing left to do in the general 
case will be to study the finite ground field extension K — K*, where K* is 
the least normal extension of K which contains Kp 


II. Simple points. Case K, = K. 


9. Let p be a prime zero-dimensional ideal in o (== K[é,' - +, &é]) and 
let the corresponding point P of V, be a simple point, with yı’ * >, mr as 
uniformizing parameters, and such that the residue class field K, (= 0/p, since 
p is divisorless) coincides with the ground field K. Let K* þe the algebraically 
closed field determined by K, and let 3* — K*X, o* = K*o == K*[&,- é 


D = KES, where $ — op As was pointed out in the preceding section, the 


18 “ Some results in the arithmetic theory of algebraic varieties,” American Journal 
of Mathematios, vol. 61 (April, 1939), no. 2, pp. 249-294. 
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ideal o*p = p* is prime and lies over p. It is maximal in o*, hence is zero- 
dimensional, and defines a point P* of the variety V*,. 

Lemma 3. NF = op. | 

Pioof. Since the relation $* € 0%. is trivial, in view of the relation 
p* ^ o= p, we have only to show that 0*,. is contained in %*. Let a*/p* 
be an element of o*p, a*, 8* C o*, B* s£0(p*). Since p* is maximal, the ` 
ideal (o*p, 8*) is the unit ideal, i.e. we have a relation of the form 
(10) | A*pt = 1-4 Et. 
Here & is an element of o*p and hence can be put in the form: 

gt = fp + éb + Vds + É7-10:97, | 

where == 0 (p) and 6, is an element of K* satisfying an irreducible equation 
of degree g over K. If 6:,- - -, 6, are the conjugates of 6, over K and if we 


ar . 

multiply (10) by IT (1 -++ &:), where éf; = fo + &6;-+° Le £5-10;9"; we: 

get a relation of the form: | | | 
B*p* =1 +n, a 

where 7=0(p) and B* C o*. Hence a*/8* — a*B*/(1 m 7), and since 1+7 

is an element of o, not'in p, it follows that a*/B* belongs to $*, q. e. d. 

Let PF B*p = BP — J*p*, where P—p-op, By (9), we have 
(11) NE (n° +s ar) = PF, 


and since by the preceding Lemma the quotient ring 0*,» coincides with %*, 
it follows that P* is a simple point of V*, and that m,’ +, yr are uni- | 
formizing parameters at P*. Since K* is algebraically closed, we are in posi- 
tion to apply the results of our paper.1® | 

‘Since Kp = K, every element w of Y satisfies a congruence of the form: 
w==2¢(%),¢CK. In particular, let & == c (P), Cu’, CK. The point 
P is therefore represented by an actual point (¢:,---,¢n) of the affine 
SA%. We shall assume from now on that P is the origin of codrdinates in 
La, whence & == 0 ($), i—1,2,-- +n. | 

By (9), every element w of $ can be put in the form: o = 43m +: ° 

-+ Arr, Ay C 3. Let Ay == ca ($). Then 


(12) ; o= nm t + + + eer (R7). 


A congruence such as (12) holds true for any element w in $. Since 
P*— SEP, it follows from (12) that œ = am +- > : + crr (VH). We have 
proved in * that in this last congruence the coefficients c1, - : +, Cr are uniquely 
determined. From this we conclude immediately with the following: 
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; THEOREM 4. The coefficients c,''>, Crin (12) are uniquely determined 


and belong to K. The elements m,: ` - ,m are linearly independent mod PH 
. over K*, Moreover, we have the following relation: 
(18) ` | peaga Pr 


By a similar argument and observing that P? == $ (m°, mime °°, nr"), 
we find that any element w in satisfies a congruence of the form: 


w == Cy Org F Cum? nfs Cine Ae ‘+b Grrr? ($), 


where the coefficients c,,- - -,¢, are the same as in (12). Proceeding in the 
same fashion, we find, more generally, that for any element w in X there exists 
a formal integral power series: a, 


Yo + da + + bm + | 

where y, is a form of degree i inm,’ ` +, nr, with coefficients in K, such that 
(14) = + a+ + ym(B), m-arbitrary. 
Here ÿo=0,. if and only if w=—0(%). From (14) it follows that 
w= Yo + pi H: e H Yn (P), and we know that in this congruence 
the polynomial po + yi +` +--+ Ym ts uniquely determined by m1 Hence, 
if o= 0 (P=) , ‘then this polynomial must be‘identically zero, and consequently 
(5) RAY =P", m-arbitrary, | | 
a generalization of (13). : i ; 

. The result to the effect that the uniformizing parameters at P are also 
uniformizing parameters at P*, can be inverted. We show namely that if P is 


a simple point and if r elements w:,: ` * ,w. in o are umiformizing parameters 
at P*, then they are. also uniformizing parameters at P, i.e. §*(o1,° * - ,w,) 
_ = P” implies Jwr ` +, or) =P. Let 
(16) a= cum + + Carnr (P), 


c CK. It has been proved (2%, Theorem 1) that the non-vanishing. of the 
determinant | ci | is a necessary and sufficient condition in order that 
. @1,°* *,er be uniformizing parameters at P*. Hence |c | 340, and since 

the cs; are in K we conclude from (16) and (13) that m:,° * -, yr satisfy con- 
. gruences of the form: 


m= duo +: ++ dirwr (87), (i= 1,2, :-,r) 
i dy C.K. | 
Hence, by (12), every element w in $ satisfies a congruence of the form: 


w= bw +- + ++ rw, (B*), e CK. Denote the ideal Y: (wn + -, ar) by 
A., The above relation implies the following relation: 


(It) (GB) HB. 
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Since Q*2 = 8*, it follows that A has no prime ideal divisors in Ÿ other 
than P. Consequently À is a primary ideal, with Ẹ as associated prime 
ideal, and this, in view of (17), implies that % coincides with P,” as was 
asserted. 

We now are in position to prove the following 


THEOREM 5. There exist untformizing parameters ©, ` - ,w- at P 
which are elements of o and are such that o is integrally dependent on the 
ring K [wn * -, or]. Such parameters are furnished, for instance, by linear 
forms in &,° + +, & with non special coefficients in K. 


. Proof. Since the original uniformizing parameters m,° °°, yr are poly- 
nomials in &,°- ', én, and since P= J: (éu' : : ,Én), we have relations of 


the form: me Di cyé (P), t= 1, 2,---, 7. The r forms > cijé; are linearly 
j=l r J=1 
independent mod $B? (Theorem 4). Hence if e=} eim; (W), then the n by r 


matrix (e44) must be of rank r. If then we put w; = = ue), i = 1,2,°°°,7, 


then for non special constants wij in K the r-row monte matrix (wis) (ej) 
will be non singular. The elements o,° - : ,w, will then be uniformizing 
parameters at P*, hence also at P. 

In addition, by a well known normalization theorem of E. Noether, for 
non special wij; the ring o will be integrally dependent on K[w:,° `, or]. 
This completes the proof of the theorem. 


10. We have seen in the preceding section that if the point P is simple 
for V., then P? is simple for V*,. It can be shown by examples that the 
converse is not generally true?! We prove, however, the following 


THEOREM 6. Under the hypothesis Kp = K, a necessary and sufficient 
condition that P be a simple point of Vr is that P* be a simple point of V*, 
and that K be maximally algebraic in $ (K algebraically closed tn 3). 


Proof. The condition is sufficient. For if K is maximally algebraic in %, 


19 Let P: be a prime ideal divisor of Q. There exists in S* at least one prime 
ideal, say P*,, which lies over H, (Krull). Since P7, is a divisor of XY, and since 
S“ (= pr) is maximal, necessarily B*, =", whence 9}, =. 

#Let p be the exponent of Qf, i.e. let Pe = 0 (Q), E Assuming 
p > 1, we multiply (17) by Jet, getting Pet = (Y - Pa, Pe) = 0(9{), a contradiction. 

We refer to the example given in thé footnote’. Let p= g- (a, aV3), 
= p* (œ). The point P* is simple, and œ is a uniformizing parameter at P*. The 
quotient field Kp is obviously the field K. However, P is not a simple point, since 
the ring p/p? is a K-module of rank 2, while, according to Theorem 4, the ring p/p* 
for a simple point must be of rank r. In the present case we have r= 1. 


+ 
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then the relation $*A  $ = Y holds true for any $-ideal A (Theorem 1). 
If, in addition, P* is simple for V*,, then from the proof of Theorem 6 it 
follows that we can find uniformizing parameters ,,: ' -, wr at P* which are 
elements of 3. We will have then the relation: 3*(w,- © <, or) = %*. Since 
P*a == PB and since, by Theorem 1, YF + (wnt, or) NS =: (w1,+°*, wr), 
it follows that S- (w:,: > +, or) = P, whence P is a simple point of Vy. 

The condition ts necessary. Let m,° - -,9r be uniformizing parameters 
et P and let 6 be an element of % which is algebraically dependent on K. Let 
0 == ¢/B, a, B C X, and let 


B = Jem’ g ",9r) + Yon: . *,7r) +: Up 0, 


be the power series expansion for 8 (see preceding section, especially con- 
gruence (14)). The coefficients of these power series are in K. Since a = 08 
and 6C K*, the element a will have the following power series expansion: 
a = Op + Op +--+. Since the coefficient of the form 6p must also be 
elements of K, it follows that 0 is an element of K. Hence K is algebraically 
closed in 3, q. e. d.?? 

Let m,:°°°,7 be r algebraically independent elements in o such that o is 
integrally dependent on the ring K[m,-+:,r]. Let w be an element of o and 
let G(m,°°*,mr32%) be the norm of z— o with respect to the field K(m, +++, mr). 
Let moreover == c, (p), c C K. In the case of an algebraically closed field 
K we have proved the following (1°, Theorem 4): a necessary and sufficient 
condition that P be a simple point and that m— ci’ > *,9r—Cr be unt 
formizing parameters at P, ts that there should exist an element w in o such 
that olm, © +> 9r3o) ¥£0(p). Using Theorem 6 we are now in position 
to extend this result to the case under consideration (K,— K). 

Assume that P is a simple point and that the elements m1 — ©,‘ , 9r — Cr 
are uniformizing parameters at P. The elements yı — C1; *** , 9r — cr are then 
also uniformizing parameters at P*, and o* is integrally dependent on the ring 
K” [nr]. By the quoted theorem, proved for the algebraically closed 
field K*, there exists in o* an element w such that Folyt, yr; ©) 0 (p*), 
where Fm," :","r;2) is the norm of z—w with respect to the field 
K* (m5: ©" ,7r). More specifically we have shown (15, p. 269) that we may 
put o = vé +--+ + Unn, vs C K*, provided the coefficients v, do not satisfy 
certain linear relations with coefficients in K*. Hence, we may choose the v, 
in K, and we may therefore assume that w ts an element of o. The relation 
FF’, 3£0(p*) implies at any rate that F'(m,- * -+,9r;2) is irreducible (over 


33 An immediate corollary of Theorem 6 is the following: if K is not mawimally 
algebraic in È then the residue class field Ky for any simple point P of V, is neceasarily 
a proper algebraio ewiension of K. This is a special case of a more general theorem 
(Theorem 9) proved in Section 14. 
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K*) and that o is a primitive element of 3*/K*(m,---,r). By Theorem 6, 
K is algebraically closed in 3. Hence, by Lemma 1, the relative degrees 
[3*:K*¥(m,° +, 9r)],[8:K(m,°--,2r)] are the same. Consequently 
F(m,°**,9r32) is also the norm of z — o with respect to the field K (m, tt, mr), 
and since #”,5£0(p*) implies F’.,5£0(p), it is thus proved that our condition 
is necessary. 

Conversely, assume @’a(m,°-*,9r3@) 0(p), and let Fm, "",7r32) 
be the norm of z— o with respect to the field K*(m,---,9-). Clearly F 
either coincides with G or is a proper factor of G, according as the relative 
degree [2*7 : K* (m, * -, yr) ] coincides with or is less than the relative degree 
[3: Km: © +, 9r)]. In either case the relation @’.£0(p) implies the rela- 
tion Fu £0(p*), and hence P* is a simple point, with qı — ©," © ‘,9r— Cr 
as uniformizing parameters. To prove that P is a simple point of Vr, we 
have only to show, according to Theorem 6, that K is maximally algebraic 
in 3. Let K’ be the relative algebraic closure of K in 3 and let [K’: K] =g. 
Let 8, be a primitive element’ of K’ over K, so that K’—K(6@,). Since the 
relative degrees [X: K’(m,°° -,yr)] and [3*:K*(m,- + -,7r)] are the same 
‘(Lemma 1), F(m,° * *,7r3%) is the also norm of z — w with respect to the 
field K’(m,° © ` yr). Hence F is a polynomial in m,---,9-,2 and 0, with 
coefficients in! K: F == Fm --,r3 2301). If 8a, > +, 6) are the conjugates 
of 6, over K, then 


(18) G (m1, ° g "3r; 2) = Pv ` "97325 643). 


Let w=c(p),cCK (since K, =K). If we reduce the equation F(m,° : >, 
"ryw; 01) = 0 modulo p*, we get Fc, --,c;c;6)—0. Hence also 
F(¢,,: :,cr;c;6)=m0, 11,8, :,g, ie. 

(18°) Fm: one; 0) =0(p*), (i—1,2,:::,g). 
Now if g were greater than 1, then it would follow from (18) and (18°) that 
Guo = 0(p*), ie. Qo == 0 (p), since 44 C 0, a contradiction. Hence g == 1, 
i.e. K’ ame K, q. e. d. f 


Remark. Let o: (m,° © +, 7r) == LP, 41 92,° > +] be the decomposition of 
the ideal 9+ (m,° © *,7r) into maximal primary components (see (9’)), where 
we assume that m,° °°, yr are uniformizing parameters at the simple point 
P and that o is integrally dependent on K[m,---,y,r]. This last condition 
implies that the above primary components are all zero-dimensional. Let 
w= c(p). For algebraically closed ground fields we have proved (**, Theorem 
4), that the elements w such that @’.540(p) are characterized by the condi- 
tion: wzéc(p4), i=1,2, <, where pi, Pat ' > are the prime ideals to 
which qi, q2,- © > belong respectively. It is clear that this result holds true 
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also in the present case where K, == K. It is sufficient to take into account the 
relations: o*(m,° - +, qr) == o*p:o*qi: o*qe: > - == [o*p, o*qi, 0%q2,° * *]. 


IV. Simple points. General case. 


11. Let P be a point of V,, p the corresponding prime sion 
ideal in o. Following the plan outlined in Section 8, we extend our ground 
field K as follows: we pass from K to the least normal extension K* which 
contains the residue class field K,. We put: 2*— K*3, o* = K*o, 3* == K*S, 
where $= op. Since K* is a finite extension of K, there is only a finite 


number of prime o*-ideals which lie over p, say p*.,- - +, pa, and we have, 
by Theorem 2: 
(19) o*p = [p*1,° a +, px] == př," 5 pa. 


We denote by P*, the point of V*, defined by the prime ideal p*;. For 
the residue class field K*, we have the relation (Lemma 2, Section 2): 
KE = K*- Kp = K*, since K, € K*. Hence the residue class field at each 
point P*; coincides with the new ground field K*. If then P*; is a simple | 
point, we have, as far as V*, is concerned, the situation studied in the preceding. 
Part III. 


We prove the relation * = o* pe, C 0% pe °° OF pe, 


Proof. It is clear that %* is contained in each of the quotient rings 
0*,. Hence to prove the above relation we have only to show that if * is 
an element which belongs to each quotient ring 0*,., then 7 C &*. We can 
write në = 04/84, = 1,2, -,h, where a*:, Br Cof, B*;: = 0(p*i). 
Since the př; are maximal ideals, we can find, for each i = 1,2, > -, h, an 
element w*, in o* satisfying the congruences: w*; = 1(p*;), w*; = 0 (p*;), 
jet If we put y* == got, + +++ Latour, Of = Brio, + °° af B#r0%», 
then „ë = y*/8* and 8* = B*,5£0(p*,). We have thus found for y* a quo- 
tient representation y*/8* in which the denominator 6* is not in any of the 
ideals p*;,1<=1,2,---,h. But then the ideal (o*p, o*5*) is the unit ideal, 
and the rest of the proof is the same as that of Lemma 3 (Section 9). 

It now follows immediately that $* has h and only A distinct prime zero- 
dimensional ideals $, Pa, ++, fa which correspond to the ideals p*,,--:,p# 
respectively. Namely, let $*:= 0*,-, and let 8*, be the prime zero-dimensional 
ideal of 3"; 


(20) | PF —= Ni pt, p*; = oF a Pr. 

Then 

(21) Pi = Prin pr 3 (t— 1, 2,- j ‘,h). 
23 Let $ be a prime zero-dimensional ideal in %*, and let $ no—=p". The prime 


ideal p* ig zero-dimensional and must coincide with one of the ideals pip: ` “apy 


ALGEBRAIC VARIETIES OVER GROUND FIELDS OF CHARACTERISTIC ZERO. 207 


The quotient ring $*§, contains the quotient ring X*4, since o* e P; = ph. 
On the other hand we have 3*g, S 3*;, since P*, at, Hence 


(22) D ge = Ve OF 
12. The relations (20-22) are true for any point P, simple or not. Now 
we assume that P is a simple point and that q’ :, are uniformizing 


parameters at P. We have then X- (m°, =B =X- p. Hence, by 
(19) and (21), 3*- (ms n) =: ‘Pa, and consequently, in view 
of (22), Sa (ms © conr) == Pa This shows that each of the points P*i 
is a simple point of V*, and that m1,° °°, r are uniformizing parameters at 
P*;. The following theorem is in a sense the converse: 


THEOREM 7%. If Pisa simple point of Vr and if the elements w,''', or 
of X are untformizing parameters at one of the points P*;, then they are also 
uniformizing parameters at P. 


Proof. For the proof, we first establish the following relation: 
(23) Bra S— PB", m— an arbitrary.integer = 0. 


Since P” = 0(38*;"), we have only to show that any element a of Prima 
is contained in P”. Let yı’ -,yr be uniformizing parameters at P. The 
element æ certainly belongs to P, and since P= 3: (m,° °°, 7), we have: 
a = Ain, +: + Arr, A CS. If A, o, Ar also belong to P, then we 
can put æ in thé form: a == 2 Amm; Continuing in this manner we will 


ultimately get for æ an expression of the form: a = ¢s(m,° °°, r), Where 
és is a form of degree s in m,° °°, Whose coefficients Aq) (= 45,...5,) 
are elements of X, and the following are the only two possibilities: (a) etther 
s= m, or (b) s < m and not all the elements Aq are in P. In the case (a) 
we have «==0(8"), as was asserted. We show that the case (b) leads to a 
contradiction. Let us denote by $s‘ (= $s (m,° °°, 7r)) the reduced form 
obtained from ¢s(m,°*°,r) by reducing the elements Ai modulo B*; to 
elements of K*.24 By hypothesis the form ¢,°° is not identically zero in 
m'te Since a= 0 (*;™) and since s < m, we have obviously the 
congruence : 
ba (nay * ea nr) = 0 (PR). 


because any element which is not in any one of the ideals pry: ay P*a is a unit in x. 
Let, say, $3 a9” = p*, If a*/B is an element of $ where a" Co", BC o bse 0(p), 
then a* = 0(p*,), since 8 is a unit in $*. Hence M =9" -p*, g contains at least 
h distinct prime zero-dimensional ideals, namely the ideals P, = Bry X3=1,2,...,h. 
They are distinct, because B no*=*;nag*=p*p Hence the 4 ideals Speo 
i=1,2,-..,h, are the only prime zero-dimensional ideals in gr 

“t We recall that K* coincides with the residue class field of M, 
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This congruence is in contradiction with the uniqueness of the polynomial 
Vo + di ++ satisfying the congruence (14) (Section 9) (with B | 
replaced by $*;). The uniqueness of this polynomial, for a given element o, 
shows namely that no form of degree s in m,’ ` ‘yr, with coefficients in K*, 
can belong to %*,"". The relation (23) is thus proved. 

Now let w’, be elements of o which are uniformizing para- 
meters at one of the points. P*;, say at P*,. We have then the relation: 
1+ (or, + +, or) = B*. Let a be any element of $ and let, by (12) 


(Section 9): 

(24) ge oF a; ++ + H tror (B), oF CK 
Let, in particular, i 
(25) qi E= CM yyy + + c*iror (B*2"), tu C K*, 


(t= 1,2,°--¢,7r). 
Since the y*,; are uniformizing parameters at P, we have: 
vi = Aug +: + + Air, Au CS. 
Let Ai; = d*4;($B*,). Since the Ay; are in X, the elements d*;; are not only 
in K* but also in Ky We have: 
(26) eee dah H + em (BA), (TD ne 
The matrix (c*;) in (25) is non-singular, since 1," °°, 7 are also uni- 
: formizing parameters at P*;. Comparing with (26) we see that the matrix _ 
(c*4;) is the inverse of the matrix (d*:;), and consequently, since d*i; C Kp, we 
conclude that the c*4, also belong to Kp. Now, let a = d*i +--+ > d*rmr(B*,*).- 
The same argument by which the d*,; have been proved to belong to K, shows 
that d*,,- - <, d*, belong to Kp. Since c*; = $ d*sct 54, it follows that also 
j=l 
the coefficients c*, in (24) belong to K,. Consequently there exist in $ ele- 
ments Án ''',Ar such that A,=c*,($8*,), and for such elements the 
relation (24) implies the following: 

‘ a = Aww: +: i -+ Ayo, (B*1?). | 
Since a— A,w;—- + : — Aror C X, this last congruence, in view of (23), 
implies that: f 

a = Ayo, +: - : + Aror (P), AC. 
Such a congruence holds for any element a in P. Consequently 
(3: (o, nov ",or), p3) = P, | : 
. and therefore, by an argument used before (footnotes 19 20), 
S (eo > +, or) =P, 


ie. w *, 0, are uniformizing parameters at P, q.e. d. 


` , 
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18. Let EF; -2 Uist, t= 1, 2, to ST, Uj Cc K*, and let & =v", (D), 
oes a 3 ; 3 


v*;CK*, By Theorem 5, the elements é*;— v*; are uniformizing para- ` 
meters at P*,, provided the ui; are “non special,’ and moreover, 0* is 
integrally dependent on K*[é*,,- - -}é*,]. Since the values of the wi; to be 
avoided are those which satisfy certain algebraic re we may choose the 
ty in K, and sale we may assume that E*,,- - -,&*, are elements of o. ` 
The constants v*,,---,v*, are algebraic over K. Let fi(v*;) —0 be the 
irreducible equation over K which is satisfied by v*;, and let us consider the 
elements w; = f;(é*,), t—1,2,---,r. These are elements in o and 0 is 
integrally dependent on the ring Klo, - + +, or], since é*; is integrally depen- 
dent on K[o4]. Moreover, o;==0(%), since fi(é*;) = fi(v*s)(%) and 
filo) = 0. Let vi = vti ve s U*4,9, be the conjugates of v*, over K. 


gi : - ' 
We have: w; == fi(£*;) = ME — o*u). Since &*, — vti 5£0(*1), if 
1# 1, the product fice. she is a unit in the quotient ring & Hence . 


Ir (or, + S 0r) = SF, (8 0,5, Er uty) = g 
since é", — v1," °°, ee are uniformizing parameters at P*.. ‘It fol- 
lows that also w’ '*,er are uniformizing parameters at P*,, and conse- 
quently they are aaa ener also at P (Theorem 7). We have 
thus proved the following | 

THEOREM 8. If P ts a simple point, uniformizing parameters w`, or 
` at P can be found in such a fashion as to satisfy the conditions: (a) «Co; 
(b) o is integrally dependent on K[o1,- + <, or]. _ 

This is an extension of Theorem 5, except for that part of Theorem 5 
which asserts that the uniformizing parameters may be chosen as linear forms 
in the &. This part of the theorem is not valid, of course, in the general case. 

14. In this and in the following sections we wish to prove the following 
important theorem: , ac 

TEOREM 9. The quotient ring 3(—0,) of a simple point P contains 
the relative algebraic closure of K in 3. 

We shall need several lemmas. Let Kr denote, as usual, the relative 
algebraic closure of K in 3. 

Lemma 4, -K’ is contained in the residue class field K,. 

Proof. By the assértion K’& K, we mean the following. We know that 
the residue class field K*,«, at each point P*, (i—41,2,---,h) coincides - 
with the ground field K*. Since P*; is a. simple point, it follows (Theorem 6) 
that K* is algebraically closed in X*, whence K’ € K*. In the homomorphie 
mapping of *ı (= 0*,+,) ee K* (== o*/p*,), the elements of Y are mapped 

14 
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upon a set.of elements which form a subfield K, @ of Ks, co isomorphic 
to K,. ‘The assertion of the lemma is to the effect that KS K,, for 
te=1,2,---,h. 

The te is similar to the second part of the proof of Theorem 6. Let 
CK, 6—=4/B, a, BOY. Let B= pm" * +97) +: + +, ¥p 0, be the 
expansion of B at the point P*,, in terms of the uniformizing parameters — 
my’ sm at P. We have B= 0 (8%), whence, by (23), 2=0(B?). We 
therefore can write 8 in the form: B= ZAu ebp ort, ha + ep, 

: + 


Aa CX. The coefficients cs, 4 of the form Yp(m, ``, nr) are obviously 
the K*-residues of the elements Aq) mod R*4. ` Since the Aux, are elements 
of 3, we conclude that the ca, belong to K,“. For the element @ we will 
. have the expansion: ‘# — Op +: - -, and by the.same argument we deduce 
‘that the coefficients fcq, belonging to K,#*. Since not all the coefficients ce 
are zero, it follows that 0 is an element of K,, as was assorted. 

Lemma 5. If 8 C3 and tf m,- -, nr are untformizing parameters at 
the simple point P, then the power series expansion 
(27). B= Yo + Wim nr) + 
of B at-P*4 has all its coefficients in K,%. 


Proof.?5 That Wo is an element of K,(% is trivial, since 8 = Wo(*:) 
and BC S. We therefore use induction. We assume namely, for every ele- 
ment B in X, that the coefficients of Yo, Y1,° °°, Ym- are in K,, and we . 

prove that also the coefficients of Ym are in K,(#. ‘ 

Let po Zn... gma, fa bee + jpemo, be an eee form 


„Oof degree o in m,° * *,9r, whose coefficients Cy) are in Ky, Let A... 
be an element of $ such that 45...,,2=0c7...3,(@%). If we put 
a BA.. jm "md", then the expansion ‘of a at P*, is of the form: ` 
ee aire 


a = po + terms of higher degree. 


Let Qmm Do + gout: *. The form ġo,» depends in an obvious manner 
on the terms of degree y of the expansion of the various elements Aq. By our 
induction, the coefficients of these terms are in K,“, if y&m— 1. Hence 
the coefficients of bou,” t, osmi peng to Ke. By the same argument 


we can find! ht elements &,° * > &m2 in $ such that: 

| = OPFOR + 

where the coeffidients of the forms $1!) ,+ - ED na are in K, and 
pou + T°: +69 0, . (jm 1,2, < -,m— 2): 


| #We point out that in the course of the proof of the preceding Lemma we have 
incidentally established the truth of Lemma 5 io the coefficients’ of the terms of 
lowest degree of the expansion of 8. 
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If we put y = a +- @i -+> > -+ Gms, then the expansion of y is of the form: 
Y = $o + Josm1 + terms of higher degree, where Josm-1 is a form-of degree 
c+ m—1 in m,: ++, with coefficients in K,“?. We now take succes- 
sively for po the forms yı, Yz °°, Yma of (27). We get then elements 
Yo Ya" "> Ym Buch that yı — yı + gm + terms of degree > m, yj = Wy + 
terms of degree > m, j—2?,: : :m—1. Here the coefficients of gm are ele- 
ments of K,(9,. Let yo = 6, C K,” and let 


o = B — (y+ ye + * + Ym). 
The element w has the following expansion at P*;: 
(28) w = b1 + (¥m—.gm) + terms of degree > m. 
Let f(6:) =0 be the irreducible equation, of degree g, which 6, satisfies 


over K, and let Bay’ - +50, be the conjugates of K. We have f(w) = 
(w— 01) (w — 83) + + + (o — 64), and from (28) it follows immediately that: 
f(a) = (Wm — gm) f (61) (mod PF). 

Now f(w) is an element of 3 and (Ym— gm)f (61) is the set of terms of 
lowest degree in the expansion of f(w) at P*;. Hence * the coefficients of 
the form (Ym— gm)f’ (61) are in K,“. “Since 6, C K”, also f’(8) belongs 
to K,“, and since the coefficients of gm are in K,“”, it follows that the 
coefficients of Wm belong to K,‘”, as was asserted. 


15. Our next lemma concerns arbitrary integral domains in which every 
ideal possesses a finite basis (Hilbert’s basis theorem). Let © be such an 
integral domain and let p be a maximal prime ideal in ©. We consider an 
arbitrary ideal M in © and its decomposition into maximal primary com- 
ponents. Let q:,q2,* ` *,Qm.be the primary components of A whose prime 
ideals pi, Pas’ * *, Pm are multiples of p. Let gi, q’2,: * ~ be the remaining 


primary components of A; p's, p’2,- © -,— their prime ideals. Thus we have: 
A — [q Qu °°, Gm3 Fo W'2,° 7] 
pi ==0(b), f (i == 1, 2,° ` ,m);. 
pj 0(p), G=12:-::) 


_ Let q denote a primary ideal belonging to p and let A(X, q) be the intersection 
of all the ideale (X, q) as q runs through the totality of all primary ideals 
belonging to p. a 


Lemma 6. ACH, q) = [dis G2, ° i “s Om]. $ 
Proof. If we assume that the lemma is true for primary y ideals A, then 


„the, lemma follows in general. Namely, we have: A(X, 0S A (di, q), and 
- hence, by your assumption : | 
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(9) © Aa) S odes + +, ae] 


‘We put M = [q +, Am], W, sos Ken fatc]. Since p is maximal and 


, W,540(8), we have (2’1,q)—=G, for any primary ideal q belonging to p. 


Consequently, we can write: W, == (MW, Mq) = 0 (9, q), whence: 
(2) Las +; an] =M E A (Mq). 
From (29) and (2%) our lemma follows. i 
Let now Y be a primary ideal, and let P be the associated prime ideal. 


We may assume $ = 0 (p), because in the contrary case the lemma is trivial. 
` We denote by 8 our ideal ACU, q) and we first establish the following relation: ` 


(80) . (êp, A) = 
Since x= 0(8), the relation ; 

` (81). - (8p, X) = 0 (8). 

. ig trivial. Let 
(82) (3p, A) = [4, das Ta] 


be the decomposition of the ideal (8p, X) into maximal primary components, 
where we assume that the prime ideal p’; associated with q’; is 4 p, i= 1, 2,- 
and that q is either the unit ideal or belongs to p. Since Y = 0 (q), we Lise, 
by definition of 8, that 8==0(q). On the other hand dp==0(q’;), and since 
bp 0(p) it follows-that 8==0(q/s). Hence 8S (8p, N), and (30) follows 
in view of (31). ei ae 5 E 
By means of the relation (30) the proof. of the lemma is readily com- 
pleted. Let d,,---,dp be a basis for the ideal 8. By (30) we have the 


; F À | 
following set of relations: di == >) pud; + ai, i = 1, 2,: > +, p, where the pay 
j=l. s 


are in p and a), ' ',@p are elements of M. Hence 
(33) 2 (64 —pu) d= 0(8), 12,0), 


where 84; == 0 or 1, according as t4 j or i= j. The determinant A = [55 — pis| 
` is of the form 1 + p, p==0(p). Hence As40(p), whence a fortiori Ax40(). 
Since Y is primary, with $ as associated prime ideal, we conclude from (33) 
that d;,-:-,dp belong to A. Hence 8 = 0 (X), and since We=0(8), it follows 
that A(X, q) =ô = M, q.e. d. i 


16. Now at last we are in position to prove Theorem 9. Let @ be an ` 
element of K’. Since 3 is the quotient field of X, we can write 0 = a/B, 
a BC We may assume 8 == 0(B), because otherwise there is nothing to 
prove. Consider one of the points P*,,:--,P*;, say P*,. By Lemma 4 there 
exists an element y. in Y such that y:5=0(%"1). Let y=0 Hp P ty, LE... 
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be the expansion of y: at P*;, in terms of the uniformizing parameters 
75°" ‘59r at P. Here yi is a form of degree i, and its coefficients are in 
K,™ (Lemma 5). In particular, the coefficients of the linear form ÿ:? are 
in K,™, and therefore we can find an element yı in &§ whose expansion is of 
the form: yi == — yp, +--+. IE we put ya = yı + Ya then yo = 0 + y ® 
+ terms of degree > 2, where y® is a quadratic form in m,' * -, yr with 
coefficients in K,. Let y. be an element of X whose expansion is of the 
form: y’;==—. + terms of degree > 2, and let ya = ye + y'o Then ys 
is an element of % whose expansion at P*, is of the form: y;=6-+¥y, 
+ terms of degree > 3, and again all the coefficients in this expansion belong 
to Kp (Lemma 6). Continuing in this manner, we can find for each positive 
integer à an element y; in X whose expansion is of the form: yi = 0 + yi‘ 
+ terms of degree >i. Here yi‘ is a form of degree i in m,° °°, with 
coefficients in K,™. Hence y, = 0 (H*,*), and since a = 68, it follows that 
q — yip = 0 (P*,+). Now a—vyf is an element of X and we have proved 
earlier that $*,4- 3 == Pt (congruence (23), Section 12). Hence 


(34) a — yB=0(P!), (t= 1, 2,° . ar 


Let © be an arbitrary primary ideal belonging to $ and let p be the exponent 
of O. From (34), for i = p, we deduce a = 0 (8, ©), whence 


(35) a S A(8,0). 
. D 


Since $ is maximal, we may apply Lemma 6, where we put A = 3- £, p = P. 
In applying this lemma we must take into account that every ideal in $ is a 
multiple of H, whence the primary components q’1,q’e:- : of Y are never 
present in the case under consideration. In other words: in the present case 
our lemma asserts that the intersection of the ideals (8,21) is the ideal X: 8 
itself. Hence, in view of (35), we conclude that a C 3° 8, whence 4/8, i. e. 8, 
is an element of 3. This completes the proof of Theorem 9. 

The following theorem is a generalization of Theorem 6: 


THEOREM 10. In order that P be a simple point of Ve it is necessary 
and sufficient that: (1) P*,,: > -, P*a be simple points of V*, and (2) that 
the quotient ring 3 (= 0,) of P contain the relative algebraic closure K” of 
K in 3. 


Proof. We have already proved that the conditions are necessary (see 
Section 12 and Theorem 9). We prove that they are sufficient. The uni- 
formizing parameters at the simple point P*, may be chosen in $ (Section 
13). Let m,° °°," be such uniformizing parameters. We have 


(36) Sim 7) = BAL, 
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where $*, = 0%. We now use condition (2). Since K’ C %, we can 
apply Theorem 2’ to the ring Š (put o = X, A=A’=—K’). Let X? = K*S 
and let $2,---, Bo, oh (see (21)), be the conjugates of the prime ideal B 
under the relative automorphisms of S* over X. The intersection LB", Pol 
is an invariant ideal and its contracted ideal in % is P. Hence, by Theorem 
2’, E +, Bo] = SP, and consequently o =h, i.e. the prime ideals 

vy +, Ba in NF which lie over $ form a complete set of conjugate ideals. 
From (22) and (36) it follows that 8, is a maximal isolated component of 
the ideal (mm, > *,7r). Since this last ideal is invariant, it follows that 
all the ideals Pı, i—1,2,:-:,A, are maximal isolated components of 
3* (1 +, 9r)2* Taking into account the fact that Pa- + -, Pa are the only 
maximal prime ideals in 3*, it follows that S* (m°, mr) =- Pa = VIP. 
Hence, by Theorem 1, 

S (m tnr) = 8, 

and this shows that P is a simple point, q. e. d. 


17. In Section 10 we have extended to the case K, = K the theorem on 
the different @’,, proved in our paper.*® We now propose to prove this theorem 
in the most general case now under consideration. 

Let P be a point of V,, p—the corresponding prime o-ideal. Let m1,°°*, Yr 
be algebraically independent elements of p such that o is integrally dependent on 
the ring K[m,°-+,7r]. Given an element w in o we denote by G (m't, nr, g) 
the norm of z— w with respect to the field K (q3 © +, mr). 


THEOREM 11. A necessary and suficient condition that P be a simple 
point and that m,° ` >, 9r be uniformizing parameters at P, ts that there exist 
an element w such that G’o(m,° © *;mr10) £0 (p). 


We first prove that the condition is necessary. If K’ is the relative 
algebraic closure of K in 3, then since P is a simple point, K€ Ky (Theorem 
9, or Lemma 4). Let l 


[Kp: K] =m, [KK] =m [K*:K’] = 9, 


whence [Kp: K] == mu. Since K’C 3, the ideal $*@ decomposes into at 
most m prime ideals $, i. e. we have À = m (see Section 4, where we should 
put K = K’ =A, A = K*, whence Ap == Ky, since Kp € K*). On the other 


7° Hence 7,- - -,7, are also uniformizing parameters at P°,, -..,P*, Without 
the condition K C & this is not always true. For instance, in the example given in 
footnote ™ let P be the point given by the ideal g. (æ,y*— 2,2). The corresponding points 
P*,P*, on V*, are given by the prime ideals 9* (a, y — v2), o*(@y+ v2) respectively. 
The elements 7, = y*— 2, n, = 2 + 2a (= VS. 0(y + V2)) are uniformizing parameters 
at P*, but not at P*,. 


ALGEBRAIO VARIRTIES OVER GROUND FIELDS OF CHARAOTERISTIC ZERO. 215 


hand, since K’ is algebraically closed in X, the prime ideals of $* form a set 
‘of conjugates under the relative automorphisms of %* over 3. These auto- ` 
morphisms are extensions of the relative automorphisms of K* over K’.- If we 
now take into account the relation (6), or (6’), of Section 4, we deduce that 
h == m and that | 


(37) SB — [R +, Bn] — Pi - Ba 
(37) i 0$p = [p*,, bene Dm] = pt o ptm. 
Let F(n,---,r32) denote the norm of z—o with respect to the field 
K*(m,° © *,7r), where w is an element of o*. Our theorem is true for each 


of the simple points P*,,- +: > P*m, in view of the fact that at each of these 
points the residue class field coincides with the ground field K*.‘ Thus, dealing 
with the point P*,, we may assert that there exists an element w in o* such that 


(38) Folq: * + 9r3@) Æ 0(p*). 


With the aid of the Remark at the end of Section 10 we proceed to make.a 
judicious choice of the element œ. Let 0 == 6, be a primitive element of 
K,™, with respect to K *7 and let 69, i= 1,2,-.- -,m, f==1,2,---,p, be 
the conjugates of 8 over K. We choose the notations in such a fashion. that 
6,9, OaD,- - +, Om‘ is a complete set of conjugates with respect to the inter- 
mediate field K’. Let f(@) —0 be the irreducible equation, of degree mu, 
which 6 satisfies over K. Since 8 C.K, , there exists an element £ in o such 
that 


(39) | t= A, (ph), 
whence 
(39) - f(t) = 0 (p). 


Let . | qu , 
f 0° (m: © canr) = [p, 4° * +5 qe] 


be the decomposition of the ideal 0: (m,° : *, 1r) into primary maximal com- 
ponents (see (9’), Section.8), and let p.,- ~-, Po be prime ideals associated 
with the primary ideals Qu, ©- , qo respectively. The ideals pi, + `, Po are 
zero-dimensional, since o is integrally dependent on the ring K[m,° © conr] - 
We show that there exists an element w in o such that 


(40) - f(e) =0 (p), 

(40) Flo) £0 (p1), (i= 1,2, -,0). 
*K,( is a subfield’ of K*, simply isomorphic to Kp and is contained in the 

residue class field of the point P*,. This field has been first introduced in the proof 


of Lemma 4 (Section 14). The fields K,@, K,),- Ses K,™ are conjugate fields 
over K’, not necessarily distinct. ` | 


K 
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-We put, namely, w = £ + or, where c CK and ~ is an element of p but 
not in þa t= 1,2, -,0o. Since w=={(p}), the eonginence io) =0(p) 
follows from (39’). On the other hand we have: 

Co) =F(E+ cr) =f (0) +e af (E) + + ome 

(we assume that the leading coefficient of f (6) is h This is a polynomial ` 
in c with coefficients not all == 0 (p:), i=—1,2,: - : ,o, since a”t s£ 0 (p). 
Hence we can find the constant c in K so as-to satisfy the relation f(w) £0(p1), 
for all 4==1,2,---,0. We assert that the element w of o, constructed: in this ` 
‘fashion, anes satisfies (38). . Namely, let p*4,, p*42,° © - be the prime 
ideals in ọ* which lie over p;. The ideals Pot e Pp m, pi are then the 
. prime ideals associated with the primary components of the ideal 0*(m1,---, mr). 
Since p*2,° © *, P*m are the conjugates of p*i, it follows, by (39), that 


i 


(397). oH (pi), 
whence 
(41) 0 56 0,0 (p*,), ` (i—2,:::,m). 


Moreover, by (40’), we have f(w) Æ O(p*4y), 7 — 1,2, > +, whence, in 
particular, l i | 
(41) wo SO, (p*45), . (iæ 1, 8; 7403 j=1, CAS À 
_ The relations (397), (41) and (41’) imply (38),'in view of the remark at the 
end of Section 10. se | 
Since K’ is algebraically closed in ¥, it follows, by an argument repeatedly 
used before and based on Lemma 1, that the norm of z— with respect to 


the field‘K*(»:,- - +, yr) is the same as the norm of z — w with respect to the - . 


field K’(m,°°‘,9r). Therefore the coefficients of Fm, : :,"r,2) ate in 
K’. If F,’ + -, Fp denote the conjugate polynomials of F with respect to K 
and if G(m,: --,9r32) is the norm of z—w wtih respect to K(m,°--,7r), ` 
` then obviously : Fu i 

(42) En 593%) = PFa > + Fpa 


If we pat F(0,---,0;2) = $ (2), then $ (0) ==0(p*,), since F(m, ++, 1750) 
.— 0 dnd y; == 0 (p°). Consequently $(z) is divisible by z — 8,() (397). But 
(38) implies that p(z) is not divisible by (z —@,™)?. Since F(m,- ++, nr, 2) 
is invariant under the relative automorphism of 3° over X (its coefficient . 
being in K’), it follows likewise that (z) is divisible by z — 6,(*, but is not 
divisible by (2 —0™)?, i= 1,2, > +, m: 

We can also show that (z) is not divisible by z— 0%, for all j1 
and t==-1,2,---,m. In fact, if say (z) was divisible by z — 0, then 
6, would have to be the value of w at some prime ideal p* belonging % the 
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A : 
ideal o* - (9, © +, yr) ie. o= 6,2) (p*). This ideal p* could not be any 


of the ideals p*,,---,p*m, since o==6,%(p*,), 4—1,2,:: :,m, and 
6,2) mare Hence p* would have to be one of the ideals p*, Dis” °°, 
i= 1,2, > - ,o, considered above. This, however, would be in contradiction 
.with (40). | 


{ 


The polynomial ¢(z) done dos as follows: 


o(2) =T(@—&) yl), 


where ¢(z) and f(z) have no common roots. Let ¢;(2)(—F;(0,---,0,2)), 
j=1,2,---,—1, be the conjugates of ¢(z) with respect to K. We will 
have likewise: 


(2) =Å e—a) yla GLB aD), 
where again w,(z) and f(s) have no common roots. By (42) we have: 
G(0,- + +052) = (2): (2) ' ` ` pua(2) =f (2) A(z), 
where A(z) (= yy: + Ypa) and f(z) are relatively prime. Since 7;==0(p), 
we have: a= f (w) -A(w) +f(w) + A (w) (p), ie. To= f (v) -A(w) (p), 
since f (w) = 0 (p) (40). Now we observe that f (w) £0 (p), since f (o) =0 (p) 


and f is an irreducible polynomial, and.we also note that A (w). =£0(p), since 
A(z) is not divisible by f(z). Hence G’.5£0(p), as was asserted. 


18. Continuation of the proof. The condition is sufficient. From (42) 
we deduce in the first place that G’.£0(p) implies the relations F’.<0(p*s), 
i—1,2,---+,h. Hence from the hypothesis 4,54 0(p) follows at any rate 
that the points P*,,---,P*, are simple (by the special case K, = K; see 
Section 10). It remains to prove that K’ CS (Theorem 10). We shall prove 
the following stronger result: Y is integrally closed in X. , Since KC J and 
K’ is an algebraic extension of K; the property of Ÿ being integrally closed 
will obviously imply that K’ is a subfield of 3. Now to show that 3 is 
integrally closed in X, we consider the complementary module e ?? of the ring 
Kins: tyro]. If y denotes the relative degree [X: K(m:,° - -,ar)], then 
it is well known that the elements’ 

= 1/ Fu of Fa To ot / Fa 
form a module basis for e with respect to the ring K[m,: - : , mr]. Since 
se 0(p), it follows Peers that e ts sonicne in > Since e contains 


28 This assertion has been. proved in the case of an algebraically lone ground 
field (%, p. 263, footnote 13) and therefore is obvionsly true for any ground field. 

2° The module € consists of those elements č of Z for which the trace e T(E. a) is in 
K[a,,- + -+,%,], for every clement a in Kin - .,7 a]. 
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the integral closure of the ring K[m,' > :,7r]-and since o is integrally depen- 

dent on this ring, we conclude that Y contains the integral closure 0 of o. 

Let P be the. contracted ideal of P in 5: b—%n5. We have then 

3 = Spy — 55. On the other hand 05 D op (since Ẹ ^ o =— p), whence 5 D Ÿ. 

Corisequently X = 55, and therefore Y- is integrally closed (since for the 
integrally closed ring D it is true that the quotient ring of any prime 0-ideal 

is integrally closed). This completes the proof of Theorem 11. 

As a corollary we have the following . 


THEOREM 12. The quotient ring X of a simple point is integrally dod 
in ils quotient field. 

‘Remark. If P isa simple pik with m,° °°, nr as uniformizing para- 
meters, and if o is integrally dependent on K[m," - “, nr], then the elements 
w of o such that @’.s40(p) are characterized by the relations (40), (40), 
where the irreducible polynomial f(w) must be of degree mp == [K,:K]. This 
follows from the first part of the proof of Theorem 11 (Section 17) and from 
the remark at the end of Section 10. | 

. In Theorem 11 we have assumed that the elements m,: ` +, yr are in p. 
We now drop this assumption, and we consider the uniquely determined 
irreducible polynomials f; (z) in K[z] (i= 1, 2,---, 1) such that fi (yi) =0(p). 
We can easily prove the following stronger theorem: 


THEOREM 11’. A necessary and sufficient condition that P be.a simple 
point and that fi(m)," °:,fr(nr) be uniformizing parameters at P ts that 
there exist an element w in o such that olnu’ `< ,7r50) £0(p). 

That the condition is necessary follows almost immediately from Theorem 
11. Namely, let us put Li fi(m) and let H(£:,:::,6r;2) denote the 
norm of z—w with respect to the field K(&,---,%-), while G(m, °°: amia) 

. denotes, as before, the norm of z — w with respect to the field K(m,° ++, mr). 
Since this last field contains the field K(&,- - -,¢,-), it is clear that we have ~ 
an identity (in z) of the form: 

A(s,° + abr) = G (no +5973 %) Alm,’ © anr), 
where A is a polynomial with coefficients in K. Since Em: -nr w) = 0 
we have 

Hollo: >, PIF = olm os “35 ) Alto i +, 9r5 0). 
Now, by hypothesis, &,- - `, ĉr are uniformizing parameters at P, and more- 
over, o depends integrally on the ring K[Z:,- - >, ĉr], since each element y: 
is integrally. dependent on K[£:]. Hence, by Theorem 11, we must have 
H’.540(p), for a suitable element w in 0. The above identity shows then 
that we must also have G’.50(p), as was asserted. ` | 


>$ 
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The proof-that the condition is sufficient is direct and follows the lines 
of the proof of sufficiency given in the special case of Theorem 11. . The rela- 
tion ŒuséO0(p) implies the relations F’a(m,: : :,mr;w) £0(p*:), for 
t—1,2,---,h.. Hence P*,,- : +, P*a are simple points and 7:—4,,- °°, 
nr — 6, are uniformizing parameters at P*,, where the 4” are elements ` 
of K* and nj = 0; (p*,) (Section 10). Since fi(n) =0(p) ==0(p*s), 
8, ® must be a root of f;(z). Since f;(z) has no repeated, roots, it follows 
that f;(;) differs from y; — 9;‘” by a factor which is a unit in the quotient 
ring. 3*4 of the point P*;. Hence film), © :,fr(mr), Le. °° +, êr are 
also uniformizing parameters at P*, (t = 1,2,---,h). It remains to prove 
that K’ CS, since from this it will follow that P is a simple point (‘Theorem 
10) and that {,: - >, €r are uniformizing parameters at P (Theorem 7). The 
rest of the proof is the same as before, namely it is shown that X is integrally 
closed in 3. 


COROLLARY. If V, is a linear space, 1. e. tf o is a poisoned ring, then 
every point of Vr is simple. 

In fact, if o— K[é,--:, é], where &,°**,é- are algebraically indepen- 
dent elements, then the norm G@ of z — w with respect to the field K(&,---, ér) 
is 2—w itself, whence G’,, = 1. 


V. Simple subvarieties. 


19. Let V, be an irreducible simple subvariety of V,, of dimension s. 
Let p be the corresponding s-dimensional prime ideal in o and let X = op. By 
definition, there exist r — s elements in X, say 71,° °°, r-a, such that 
(43) D (mu + 5 ares) = B= Bp. 
Let ¿i> © +,%2 be elements of $ which are algebraically independent mod p 
(with respect to the ground field K), but otherwise arbitrary. We take as 
new ground field the field Q — K(£,: -> , £.) and we put 0—Q:0, P = op. 
Since Q is a pure transcendental extension of K, p is prime. It is of dimension 
` zero, since the £’s are algebraically independent mod p. With Q as new ground 
field, the elements é` - -,é, define an (r—s)-dimensional variety V+. of 
which they are the codrdinates of the general point. The ideal p defines a 
point P on Fa- The quotient ring § — 55 coincides with X, since the ele- 
ments of Q are units in Y. Hence, by (43), 
(43°) 3° (mu a Nra) =% p. 
i.e. P is a simple point of Vr -3 


Conversely, assume that P is a simple point of Vi». There will exist 
then elements m,° © -, yrs in %, i. e. in X, satisfying (43’). The relation (43) 
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is equivalent to (43’). Hence V, is a simple subvariety of V.. Thus the 


assertions: V, isa simple subvariety of Vy; P is a simple point of V,s—are 
equivalent. 


THEOREM 13. The quotient ` ring V— Oy) of a simple subvariety ts - 
integrally closed in 3. 


The theorem is an immediate consequence of Theorem 12 and of the fact 
that J= $ — 55 


THEOREM 14. If Va is a simple subvariety of Vr, the untformizing 
parameters qı, °°, nr- along V, and the elements ĉi, > -,£s algebraically 
independent mod p, can be so chosen that they be elements of o and that o be 
integrally dependent on the ring K[,- © * ; mr-s3 Gus © 5 fe]: 


Proof. Let 6 — 0/p == K[E,: : <, Ex], whence 6 is of degree of tran- 
scendency s over K.' Subject to a preliminary linear homogeneous transforma- 
tion on é,,° © <, n, with coefficients 4, in K, we may assume that the following 
_ conditions are satisfied. 


(a) éno’ ++, & are integrally dependent on K[é,: = -,&]; 
(b) Eou- -E are integrally dependent on K[E,,: : -,&]; 


(e) fé" © >, és; én), t= 1,2, < :,r— 8, are uniformizing para- 
meters along V,, where f, is an irreducible polynomial in &,,- + +, ée 64; with 
coefficients in K. | 

The- possibility of satisfying conditions (a) and (b) is trivial. As to 
. condition (c), we observe that by (b) the elements é, - -, & are algebraically 
independent mod p. We put ii, += 1,2,---+,s. It then follows from 
the proof of Theorem 8 Sn 13) that for “non special” uz, certain poly- 
nomials f4(ésu), t= 1, 2,---,7—8, with coefficients in O(—K(é,,---, s)). 
will be uniformizing pavamatans at the point P of Vs, hence also along Va. : 
` The values of the wu; to be avoided are those which satisfy certain. algebraic 
relations with coefficients in Q. Since K contains infinitely many elements, 
the us; may be chosen-in K. We may assume that the leading coefficient 
of fe ig 1. If we put m = filé," e, éj éni), $= 1,2, < -,r—s, then 
Es ste (~ ént, £s) are algebraically independent mod p and 9, °°" ,7r-e 
are uniformizing parameters along Vs. Since m=0(b), we find, passing to 
the ring 3: fé," °°, &3 out) — 0. Since f; is irreducible, it follows by 
condition (b) that the coefficients. of f, are polynomials in £é, > +, é. Hence 
és is integrally dependent on K[&é.,- - -,é3],t—=1,2,---,r—s, whence 


ALGEBRAIC VARIETIES OVER GROUND FIELDS OF CHARACTERISTIC ZERO. 221 


f° © >, are integrally dependent on the ring K[f:,° - -,£e5 mo" nra 
This completes the proof of the theorem, in view of condition (a). 


20. Letno’ - +, be algebraically independent elements of 3 such that 
o is integrally dependent on the ring K[m:,: : -,9r]. The residual class ring 
0 == 0/p will depend integrally on the ring K[%,: - -, qr], and since 5 is of 
degree of transcendency s over K, s elements 7 have to be algebraically inde- 
pendent. Consequently s of the elements 7 are algebraically independent 
mod p. We assume that ,- - `, are algebraically independent mod p. Let 
fi (gy > 985 7244) == 0 (p) be the irreducible congruence which 7, satisfies over 
Kms '";%]. Let moreover, F(m:,°:+,9r3z) be the norm of z—a(wC 0), 
over the field K(m,' ++, 9r). As a generalization of Theorem 11’, we prove 
the following 


THEOREM 15. The existence of an element w in o such that Felo, 
qr; ) S20(p) ts a necessary and sufficient condition in order that V, be a 
simple subvariety of Vr and that fi(n, "me men)"; from", 783 7) 
be uniformizing parameters along Vs. 


The theorem is an immediate consequence of Theorem 11’. It is sufficient 
to observe that Fm, - +, 7r32) is also the norm of zo with respect to the 
field O (s41 * ‘5 9r), Where Q == K(m1,° * * , 75). Moreover, as was pointed 
out in the preceding section, the subvariety V4 of V, and the point P of Via 
(the elements yı, * * `, 9. now play the rôle of £,,- > >, €s) are both simple or 
not simple at the same time, and that uniformizing parameters along V, are 
also uniformizing parameters at P, and conversely (see (43) and (43’)). 

An immediate corollary of Theorem 15 is the following: tf V, contains 
a simple point P of Vr, then V, itself is a simple subvariety. In fact, if Po 
is the prime zero-dimensional o-ideal which cotresponds to the point P, then 
p= 0 (po), and G’4340(p.) (Theorem 11’) implies the relation u ##0(). 

We can invert this result. We show namely that a simple subvariety Va 
contains at least one simple point of V,. Using the notations of Theorem 15, 
if Va is simple, then G’.s£0(p), for some w in o. We can therefore find a 
prime zero-dimensional divisor pọ of p such that G’.40(po).2° If P is the 
point of V, defined by po, then P is simple (Theorem 11’) and lies on Vs 
(since p==0(po)). 


THE JOHNS HOPKINS UNIVERSITY. 


“If we pass to the ring 9/p, then our assertion is equivalent to the following: 
if a <0, then there exists a zero-dimensional prime ideal which does not contain a. 
The proof of this assertion is straightforward. 


ASSOCIATIVE MULTIPLICATIVE SYSTEMS.* 
By J. E. EATON. 


1. Introduction. Grouplike systems with non-unique multiplication 
were first studied by Marty in 1934.2 Others who have been interested in such 
systems are Wall, Kuntzmann, Ore, Griffiths, and Krasner.? In 1938 Dresher 
and Ore.* undertook an axiomatic investigation of the general properties of 
such systems which they called multigroups. 

Many of the results of Dresher and Ore were concerned with the relation 
of subsets of the multigroup to the multigroup itself. In this paper we shall 
extend some of these results to algebraic systems with a multivalued associative 
operation multiplication which however need satisfy no quotient law. As in 
multigroups, coset decompositions of the system are possible, and we may 
characterize completely the homomorphisms generated by such decompositions. 
Normal subsets of the system exist which have interesting structure properties. 
In particular for these subsets we may enunciate a Jordan-Hôlder theorem. 
The theorems derived in this paper in a few instances represent improvements 
in the known theorems of the theory of multigroups. 


2. Definitions. An associative multiplicative system is an algebraic 
‘system in which there is defined a single binary operation multiplication. We 
shall for brevity. throughout this paper refer to such a system as an m-sysiem, 
We shall denote by W an arbitrary m-system and by Mı, me,- * - the elements 


* Received April 26, 1939. 

1 F. Marty, (1) “Sur une généralisation de la notion de groupe,” Huitième congrès 
des mathématiciens scandinaves, Stockholm, 1934, pp. 45-49; (2) “Rôle de la notion 
d’hypergroupe dans l’étude des groupes non abéliens,” Oomptes rendus, vol. 201 (Paris, 
1935), pp. 636-638; (3) “Sur les groupes et hypergroupes attachés & une fraction 
rationelle,” Annales de Véoole normale, 3 sér., vol. 53 (1936), pp. 82-123. 

2H. S. Wall, “Hypergroups,” American Journal of Mathematios, vol. 59 (1937), 
pp. 77-98; J. Kuntzmann, (1) “Opérations multiformes. Hypergroupes,” Comptes 
rendus, vol. 204 (Paris, 1937), pp. 1787-1788; (2) “Homomorphie entre systèmes 
multiformes,” Comptes rendus, vol. 205 (Paris, 1937), pp. 208-210; (3) “ Systèmes 
multiformes et systèmes hypercomplexes,” Comptes rendus, vol. 208 (Paris, 1939), 
pp. 493-495; Oystein Ore, “Structures and group theory, I,” Duke Mathematical 
Journal, vol. 3 (1937), pp. 149-174; L. W. Griffiths, On hypergroups, multigroups, and 
product systems,” American Journal of Mathematics, vol. 60 (1938), pp. 345-354; 
M. Krasner, “Sur la primitivité des corps §R-adiques,” Mathematica, vol. 13 (1937), 
pp. 72-191. 

3 Dresher and Ore, “Theory of multigroups,” American Journal of Mathematios, 

vol. 60 (1938), pp. 705-733. We shall in the following cite this paper as D. and O. 
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of M. We make no assumption on the ‘finiteness of the number of elements 
‘of M but we shall suppose, that they are at least enumerable. The multiplica- 
tion which is defined in an m-system is subject to but two axioms: the existence 
of the- product of any two elements and the associative law. 


AXIOM 1. The Product. If m, and my are any two elements of Mt, then 

the product mamy is a non-void subset of Pt. 
mmy — {m’, x}. 

The existence of the product of any two elements of M permits us to 
give meaning to the notion of the product of any two subsets of M. HA 
and # are two non-void subsets of Yt with elements {a} and {b4} respectively, 
then an element m of Y is in WB if and only if m is contained in some product 
a;bx. The, relation of containing we shall symbolize in the usual manner. 
If N and B are any two subsets of M, MW D B shall mean that every element ` 
of B is an element of M. 

Axiom 2. The Associative Law. If mi, my, mx are any three elements 
of M, then 
lacie = m (mm) = mmmy. 


These products have meaning according to the definition of products of subsets. 
Kuntzmann * has noted several weaker forms of this associative law which 
‘are of interest in systems in which a non-unique multiplication: is defined. 
We shall have occasion to refer later to one of these. 
| -AXIOM XY. The Left Roue Law. If my, my, my are any three ele- 
ments of WM, then 
(mym;) m, =) mi(m;me) 7 


- A multigroup is defined by Dresher and Ore* to be an m-system satis-_ 
fying the following quotient law. | : 

Axiom 3. The Quotient Aviom. For any ordered pair of elements 
my there exist at least two elements x and y such that 
; zm, D mj miy D my.. 

3. Representation. Before we develop some of the properties of an 
m-system it would be well to derive a representation of such a system. Let M 
* be some multiplicative system satisfying Axiom 1 (we do not assume it is 
associative) and consisting of elements {mi}. To each m, associate a matrix 
M, defined in the following manner. If mm; D mr, then M; has e, the unit 
of a Boolean ring, as its k, j-th element. The remaining elements of M, are 


t Op. cit., 1, p. 1787. 
FD. and O., pp. 706-707. 


224 à Lie A E. EATON. 


zeros. We call the set of Ms the left regular matrix association® of M. 
‘ Similarly M, is an element of the right regular matrix association of M if 
when mym; D mx then i, has e in its k, j-th place. We say that a matrix 
association is a representation if when mom; — {mx}, then MM; = 3M, 
where multiplication and addition of the matrices is defined in the usual 
manner. We then have the following theorem. l 


THEOREN 1. The necessary and sufficient condition that a multiplicative 
' system W be associative is that both its left and right rogni matris associa- 
tions are representations. 


Proof. (i) Necessary., Let a, b be elements of Dt, ab = {c:, Cnt * ‘}, 
-and À, B, Ci the corresponding matrices of the left regular matrix association. 
Let there be e in the i, k-th place of AB. Then for some j there is e in the 
i, j-th place of A and in the j, k-th place of B. Hence there is an my such that 
am; D m; and bmx D my. Then abm D mi. Hence for some cs, ime D Ma 
. Therefore some C; has e in its &, k-th place. . Conversely let there be e in the. 
4, k-th place of some C. Then cm, mi. Hence abmx D mı. Then for some 
ms C bmx, amy D my. Hence A has e in the 4, j-th place and B has e in the 
j, k-th place. Therefore AB has e in the i, k-th place. 
(ii) Sufficient. Let the left regular matrix association of Yt be a repre- 
' sentation. Consider m, (mime) D mr. ‘Then for some ms C mymz, mim, Dmr. . 
Hence M; has e in the s, k-th place and M4 has e in the r,s-th place. Then 
MiM; has e in the r,k-th place. This implies that for some m C mim, 
mm: D m. Wo- then have mi(mjmxz) C (mamy)mx. The reverse inequality 
is obtained if the right regular matrix association is a representation. Hence 
we have proved the theorem. However the left regular matrix association 
being a representation does not imply the right is. This may be shown by: the 
system of two elements, a and b, whose eee scheme is: aa@—==@; 
ab = b; ba =a, b; bb =b. : 


. COROLLARY. The necessary and sufficient condition that a. multiplicative . 
system be left associative (satisfy Axiom 2’) is that its left regular matrix 
association is a representation. 


4, Coset decompositions. We shall say that any subset W of an m- 
system is a subsystem if Y D AH. We then have immediately that A itself 
is an m-system. The subsystems with which we shall be primarily concerned 
in this paper are the left reversible subsystems.” A subsystem A of an m- 
system is left reversible if when Am; D my, then Um; D mi Similarly À is. 


° Cf. Wall, op. oit., p. 86; D. and O., p. 709. 
1 The ds: of reversibility: was introduced in D. and O., p. 118. 
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right reversible if when m4 D mj, then mA D ma The set Am; we call a 
left coset of Y. The interesting feature of left reversible subsystems of an 
m-system is that their left cosets constitute a partition of the m-system in the 
precise sense stated in the following theorem.® 

THEOREM 2. Let be a left reversible subsystem of an m-system Wt. 
Then W has a unique left coset expansion Nt == Wm, + Am +- - - such that 

(i) any element tn a coset génerates the same coset; 

(ii) a coset contains its generating element; 

(iii) every element of M lies in some coset; 

(iv) the cosets are disjoint. 


Proof. (i) Consider any coset Ym; and any m; contained in it. Then 
there is an a; in À such that arm; D my. From left reversibility there is an 
aj in A such that ajm; D my. Since À is a subsystem, X D Wa for all a in À. 
Hence Um; D Warm, D Am; D Waym; D Amı Since the first and last ele- 
ments of the chain ate identical, we must have equality throughout. Thus 
Am; = Am; for any my; in Am. 

(i) m is in Wm, since m; is in Wm; and Wm; = Amy. 
(iii) From (ii) every m, in W lies in the coset Wm. 
(iv) Let mx be contained in both Um, and Wm;. Then Um, = Am, = Wm. 


COROLLARY. The theorem is true under the weaker assumption that M 
is a left associative multiplicative system. 

We may mention that Nt has a unique double coset expansion M — Am,% 
+ AmB + + + with respect to a left reversible subsystem % and a right 
reversible subsystem %.? 

5. Homomorphisms. Let us consider any partition of an m-system 
Nt into subsets Y,,X.,---, where the subsets are not necessarily disjoint. 
We do however suppose that every element of W lies in some #4 We have 
already noted what we mean by the product X;1’; in the element sense. We 
may define the set product XX, to be the totality of those subsets‘, which 
have at least one of their elements in the element product X;X;. The %78 
then obviously form an m-system which is homomorphic to the original m- 
system. By a homomorphism of an m-system W? to an m-system M* we mean 
the usual many-one correspondence between the elements of M and the ele- 
ments of Mt which preserves multiplication. That is, every element of Ut 
corresponds to a unique image element of M* and every element of M* is 
the image of at least one element of Mt. Furthermore if mi, mj, nw have the 


8 This is a direct analogue for m-systems of Theorems 8 and 9, D. and O., p. 717. 
e Cf. D. and O., p. 718. 
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respective images m*,, m*j, me and if ee ors then m*ym*; D my. 
Also if mms D m"; Fi for some:.mi, mj, mx corresponding ` to them 
mim; D my. However, in the homomorphisms which we shall consider we 
shall find it convenient to place certain restrictions on the inverse. corre- 
` spondence from. M* to M. A homomorphism from M to M* is a strong 
left unit homomorphism if M* contains left scalar units and if for any m 

and m’ with the same image there is an element a in Wè iii eck to some 
left scalar unit of Yt* such that am D m. : 

An element .e of an m-system W is a-left scalar unit 20 if em =m for 
every m in M. An element which is both a left scalar unit and a right scalar 
unit (me = m) is called an absolute unit. Obviously there can be at most one 
absolute unit in any m-system. A set of left scalar units of an m-system M*. 
form a system of fundamental units with respect to a strong left unit homo- 
morphism of M to WM* if for any one of them there is an a in Mt corresponding 
to it such that for some m and m with the same ‘image am D nm’ and if for 

‘any m and m with the same image there is an a corresponding to one of them 
such that am D m’. We are now in a position to characterize completely the 
homomorphisms seat by the left coset a ete of left reversible 
subsystems.™* 


THEOREM 3. Let W be a left reversible subsystem of an m-system M. 
Then M is strongly left unit homomorphic to the m-system of the left coset 
expansion of Ut with respect to A, M/N, and the homomorphism has as a set 
of fundamental units those cosets containing elements of W. Conversely by 
any strong left unit homomorphism WM -> M" those elements of M which 
correspond to any set of fundamental units of M* form a left reversible sub- 
system A of M such that the Fe of the a coset expansion RAS ts 
isomorphic to M*.- 


Proof. Since M is a subsystem, X D Wa for any a in À. This implies 
AY D Wa. Furthermore XD HA. Combining we have AD Wa and 
Am D Yam for any m in M. However, since there is but one coset on the 
left, we must have equality. Thus the cosets containing elements of M are 
left scalar units. From Theorem 2 if m and m lie in the same coset there 
is an a in A such that am D m’. Hence the homomorphism is a strong left — 
unit homomorphism which has as fundamental units those cosets containing 
elements of M. . 

Conversely, let {e*; } be a set of fundamental units of M* and À the 
totality of elements of Nt corresponding to them. Suppose AA 8, where 


"3° For a complete discussion of units cf. D. and O., pp. 710-714. 
11 This is an improvement of D. and O., Theorem 13, p. 721. 


i à myn, A ` A 
"ASSOCIATIVE MULITPLICATIVE SYSTEMS. - . 827 


b is not in W. Then {e*,}{e*,} D be, where b* is not in {e*i}. But: 
| {e*i}{e*;} = {e*;}, which yields a ` gontradiction. Thus WY is a subsystem. 
If for some a in M am D m’, then m and m’ have the same image m*. Since 
the homomorphism is a strong left unit homomorphism, there is an a’ in W - 
such that am Dm. Hence Y is left reversible. ` But the same argument 
shows that two elements lying in the same coset of Y have the same image and 
two elements with the same image lie in the same coset. Therefore M* is 
isomorphic to the m-system of the left coset expansion M/A. 

COROLLARY. If WM is a multigroup then M* is a multigroup and A is a 
right submultigroup (that is, in M the second relation of Axiom 3 is satisfied). 

. Let us now introduce an extremely important class of subsystems, those 
which we shall call left normal subsystems. A subsystem % of an m-system Yt ` 
is left normal in M if for any m in Pt we have Um D mA? With this type 
of subsystem we may associate a homomorphism in which we impose a more 
rigid condition on the inverse correspondence than we assumed in the previous 
theorem. We say that a homomorphism % to M* is a strong left homo- : 
morphism if whenever in M* m*ım*; D m*; there is for every m; and mx in 
M corresponding to m*; and m*x respectively some m, corresponding to m*, 
such that mim; D my.1® We then have the following theorem.** 

Taxorem 4. Let A be a left reversible, left normal subsystem of an 
m-system M. Then W is strongly left homomorphic to the m-system of the 
left coset expansion M/A and M/A has as an absolute unit A. Conversely 
by any strong left homomorphism M —> M* wherein M* contains an absolute 
unit e*, those elements of Ÿ which correspond to e* form a left reversible, 
left normal subsystem A of M such that the m-system of the left coset 
expansion M/A is isomorphic to M*. 

| Proof. Let Am: Am: D Ams. If rC Wms, s CAm, we have to find an 
z in Am, such that er Ds. Since any element in a coset generates the coset 
we may write Am, Xr Ds. From left normality, WWm,r Ds. Therefore 
Amr s. Hence for some a in À, amyr D s and for some 7v in amy, Tr D 5: 
We thus have a strong left homomorphism. From the preceding theorem 
we know that any coset containing an element of Y is a left scalar unit. But 
for any m in M and any a in A we have Um D AUm D AmA D UmAa. 
Since there is but one coset on the left we must have equality and Wa is a 


11I have learned by correspondence that M. Krasner has also used this type of 
normality. 

13 This type of homomorphism was introduced in D. anā O., p. 721- 

u The same theorem for normal, reversible er is proved in D. and O., 
Theorem 1, p. 724. es 
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right scalar unit. It is thus an absolute unit and since there can be but one 
absolute unit in any m-system, Ya == M. - 

To show the converse we need only, by virtue of the preceding He 
prove that is left normal. Consider any r C ma. We know r and m have. 
the same image since e* is an absolute unit. From the strong left homo- 
morphism there exists an a’ such that a'm D r and hence Y is left normal. 

COROLLARY. If Ÿ is a multigroup, then Y is a submultigroup. 

6. Conjugate subsystems. In seeking an analogue to strong normality . 
in multigroups*® we are'led to the following notion. ‘A subsystem of an 
m-system M is left scalic if for any m; and my in M there exists an m'y such 
that Wm’; D m,Wm;. We cannot show, as in the case of multigroups, that a 
left scalic subsystem satisfies a normality condition. However we may show 
that if it is left reversible its cosets form a scalar m-system, where by scalar 
_m-system we mean an m-system in which multiplication is unique; that is, 
the product of any two elements of the system is a single element. 

THEOREM 5. The necessary and sufficient condition that an m-system M 
‘be strongly left unit homomorphic to a scalar m-system M* is that the m- 
system of the coset decomposition of Dt with respect to some left TRS 
left scalic subsystem A is isomorphic to M*.. 

Proof. ‘In view of Theorem 3, it is only necessary to show that if Wis 
left scalic, M/M is a scalar m-system; and if M* is a scalar m-system, À is 
left scalic. Since for any m4 and my there is an m’; such that Um’, D mi Am, 
we must have Um’; D Am:;Am;. As there is but a single coset on the left, 
we then have equality, and hence the product of any two cosets is a single. 
coset. Conversely if for any Am, and Wm, we have Wm’; — Umm, we then 
must have Wm’; D m;Am; since Y is left reversible. Thus Y is left scalic. 

The concept of left scalic subsystems leads naturally to the notion of left 
_ conjugate subsystems. Two subsystems A, B are left conjugates if there exist 
some m and. some m’ such that for any m; there exist m,’ and ms” which 
satisfy the following relations: _ 

Wm; mBm, BmyOm'Um/ Bm” D mm, Wm; D mBm;”’. 

It is obvious that if the m-system considered is a group then the above con- ` 
struct is the ordinary conjugate of a group. | 
_ THEOREM 6. Left conjugate is a symmetric, reflexive, transitive relation. 

Proof: The symmetry is obvious. The reflexiveness follows from the fact 
that Y is a subsystem, for we may choose m and m’ as elements in M. Then 
if we choose m,’ and m,” equal to m; we find that we may take $ — Y in the 





15 À strongly normal submultigroup is one whose coset expansion forms a group; 
_ cf. D. and O., p. 728 ff. 
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definition of left conjugate. To establish the transitivity assume that we have 
given subsystems M, 8, and © satisfying 
(1) Am? DmBm; Bm D mAm/ Bn DnEm En D n’Bni/ 
(2) Bm” DmAm;, Am D mBmj"” En’ D n'Bns Bn D nEn”. 
Select m; =— nj’ in (1). Then 
Um; D mBns D mnEn D TEn, rC mn; 
Crni D n'Bns D ww Am’ Dr Ami, 9 C nm. 

Select n; = mj” in (2). Then 
Cni” D Bm" D n'm/Umy D Amy: Am D mBmj" D mEn” D En”. 
Since m; is arbitrary we may replace it by the n; of the preceding relation. 
We then see that NX and € are left conjugates, for note that + and 1” are inde- 
pendent of m; and n. 

THEOREM 7. A left revérsible, left scalic submultigroup * is its own only 
left reversible left conjugate. 

Proof. Suppose Y is left reversible and left scalic and there exists a left 
reversible subsystem B such that 

(1) Um; D mBm;, (2) Gm; D m'Amy 
(1) Bm,” D m'Am; (2) Um; D mBm,”. 

It is easy to show that 2 is a normal submultigroup.! But this, together 
with (2), yields Bm, > Ur for any m; and some r depending on mj. Choose 
mj==b. Then 8 D 6. If we multiply through by 8 on the right, we obtain 
BDAB — BA W. We now establish the reverse inequality. From (2’) 
and (1’) we have Am; Wmm’Am;. As there is but one coset on the left 
we must have equality and mm’ is in Y since my is arbitrary and M is the only 
left scalar unit. In (2) choose m; == mj,” and multiply through by m on the 
left. Then from (2’) Am; D Am;’. Hence there are my in (2) such that 
my’ is arbitrary. Choose s so that sm D b for some b in 8. From (1), multi- 
plying through by s on the left, we have sAm; D Bm;. Hence AsUm D WBmy. 
Since Y is left scalic the left side is a single arbitrary coset. We may choose 
that coset equal to W. Then A D Bm;A for some m;. But in a two sided 
coset expansion any coset contains its generating element,!® and so my is in W. 
Hence A > BY == AB D PB. Since we have already established the reverse 
inequality, W == $. 

We define the left normalizer of a left reversible subsystem % of an 


10 We may readily show that such a submultigroup is strongly normal in the sense 
defined in the paper by Eaton and Ore, “ Remarks on multigroups,” American Journal 
of Mathematics, this number, pp. 67-71. 

11 Cf. Eaton and Ore, op. oit. 

Cf. D. and O. Theorem 10, p. 718. 
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m-system M to be the totality of elements n4 in Mt such that for each mu there, 


is an ni’ for which for any m in M there is an m’ and an m” which satisfy 
the following relations: 
Am D mm Um D ny Um! Mm” D nm Am D nm”. 


THEOREM 8. The left normalizer St of a left revertsble subsystem M. is a 


left reversible subsystem in which Y is left scalic. 


Proof. Let n, and nz be two elements of N. Then from the symmetry of ` 


the definition of left normalizer, n, and ne’ are in N. Let u be any element 
in the product nn: and select v as some element in the product nyn’. In the 
definition of left normalizer let m, ‘(the arbitrary m associated with n,) equal 
m. We then obtain Um,’ D ulm, and Am, D Am’. If we now select 
M: = m,” we find Um,” © vm, and Wm, D ulm,”. Since in the derived 
relations both m, and mz are arbitrary, u is in N and N isa subsystem. Suppose 
nr Ds for some n in N. Take m—r in the definition. Since W is left 


reversible we may take m’ = s. Then we have Ur D v/s. ‘If we multiply 


through by M on the left and observe that the left hand side is.a single coset 
we have Ur = Yn/Ws. Then for some a, ans Dr. But all the elements of A 
are in Jt and hence every element in the product an’ is in J. But for some 
n in this product we must have ms Dr. Therefore M is left reversible. 
That is left scalic in N follows immediately from the definition of left scalic. 


7. Structure properties of normal subsystems. Let us in this section © 
derive certain structure theorems for left normal, left reversible subsystems . 


- of an m-system which will permit us to formulate a Jor ordan-Hölder theorem 
‘ for these subsystems.!? 
| THEOREM 9. Let M be an m-system and À and ® left reversible, left 
normal subsystems. Then the crosscut (X, B) is a non-void Fi which 
is left reversible and left normal in both X and B. 
Proof. P= (A, B) is non-void since, from left normality, AB BA > a’ 
"Hence. there is an a and a b such that ab Da’. From left reversibility: there 
is an a” such that aa’ Db. Since Y is a subsystem b lies in A and hence in D. 
D is a subsystem since DD lies in both A and B and hence in D. Suppose for 
some d, ad, and & we have da, D as. Then for some bv’, b’a, Da. But 


Ub’ DEA. This implies that for some «” in A, a'b’ D a,. From left reversi- 


- bility aa, > b’ and as before D’ lies in: Y and hence in D. Thus D is left 


reversible in Ÿ and similarly in 8. To show left normality, consider any 
a, aD. Then for some b, ab D a. Since Ba D aW there is a b’ such that 


1° All the results contained in this section have been previously proved for normal, 
reversible submultigroups in D. and O., pp. 725-727. Our extension to left normal, left 
reversible subsystems -is,-of course, equally valid when the m-system is a multigroup. 


ey 
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b’a Da,. The left normality of D in A follows immediately if we can show ` 


that b” lies in M. We know Ab’ D WA and thus there is an a’ such that 
ab’ Da. Then for some a”, a'a, D b’ and as before b’ lies in W and hence 
in ©. Similarly D is left normal in 8. i | 


THEOREM 10. Let M be an m-system and N and B left reversible, left - 


normal. subsystems. C h the union [A,B] a left reversible, left normal 
subsystem and [N, B] — E 

Proof. UB — ae - AMD. Hence AB is a subsystem. But AB D P. 
Also AB D BY DH. Since [M, B] is the least subsystem containing both 
A and B we must have [W, B] — UB. Furthermore YB is left reversible, for 
suppose rm D m where ab Dr. Then abm D m’. Hence there is an s C bm 
such that ar D m’. There is then an a’ and a b’ such that a’m’ D x and 


b's > m. Hence there exists an s C b'a’ such that sm’ Dm. But AB — BY 


since they are both equal to [%, B]. Thus s is in ABY and WB is left reversible. 


‘Finally AB is left normal since ABm D AmB D mAB. 


Tasorex 11. The Dedekind Relation. Let Mt be an m-system and X, 


.B, and t left reversible, left normal subsystems such that CDM. Then: | 


(C, [M, B]) = [X (€, B)]. 


. Proof. Let s be in [X, (€, B)]. Thén s is in AE and hence in € and 
also in YB. Thus s is in (C, [X,B]). Any element in (C, [A,B]) is in ©. | 
Let c be such an element. We then-have for some a and some b, ab D c. From _ 


left reversibility there is an a’ such that a'c D b. - Since a’ is in © so also is b. 
‘Thus c is in both YB and WE and is consequently in [A, (€, B)]. 
Tarore 12. The Isomorphism Theorem. Det M be an m-system and 
A and B left reversible, left normal subsystems. Let X/Y denote the m-system 
of the left coset expansion of X with respect to Y. Then 
[A, B]/A = B/ (A, B). 
Proof. Let (A,B) =D and [N, B] = AB.—N. Let 


(1) > ` B=DHDb H HDb 


be the left coset decomposition of 8 with respect to D. Then 

(2) fe | N = A H Ab +: - +A +: 

is the left coset decomposition of Jt with respect to for firstly every element 
of Jt lies in some coset of (2) since every element of 8 les in some coset of (1) 
and A = AD, Further the cosets of (2) are distinct since if ab, D by then 
from left normality b'a D b; and from left reversibility a is in 8 and hence 
in D. But we would then have a contradiction of the assumption that (1) is 
a left coset decomposition. Now let %b,%b; D Abr. Then from left normality 
bib; D Wh, and for some a, ab;b; D by. But as before we must then have 


` 
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‘à lieb. in B and hence in D. “This Ste DB, > Dh. Conversely if 
Db:Db; D Dbz then ‘for, somé 4 ‘and a, abab; D bk and Ab Ab; D Ubi. The 
isomorphism is thus established. . 
The preceding theorems are sufficient to prove an analogue of the Jordan- 
Holder theorem for m-systems. A chain of subsystems A DH D M: D- -D My 
—8 isa left composition series between A and B when each’ Wy is a left 
reversible, left normal subsystem in the preceding and when no ether, terms : 
can be intercalated in the chain. We then have: 
TauOREA 13. Any two left composition series between A and B have 
. the same length and the m-systems of the left coset expansion of consecutive 
terms in one series are isomorphic in some order to those in. the other series. 

The proof of the theorem is by induction on the length of the chain and l 
- is entirely similar to the proof of the corresponding theorem in group theory. 
| We may mention that we may define, as in group theory,” a left quasi- 
normal subsystem Ÿ of an m-system Yt to be such that AB D BU for every 
subsystem 8 of WM. It is then possible to formulate a J ordan-Hôlder theorem 
. involving strong structure isomorphism for the left quasi-normal, reversible 
subsystems. Ea 

8. Coset dicotgosition of groups. We may use the preceding results ~ 

to characterize completely the coset decomposition of a group with ee to 
`. any subgroup. . 
, THEOREM 14. The necessary and’ sufficient condition that a partition of 
_ a group © into disjoint subsets be the left coset decomposition with respect to 
a subgroup © is that the m-system M of the partition be such. that ` 
(i) M contains a left scalar unit E; 
_ (ii) XA D A implies X — E. 


Proof. Necessary. § is obviously a left scalar unit of M. Suppose 
$9:9; D Hg. Then for some h and h’ in §, hgih’g; = 9;. This IDÉES 
gs is in $ and Sg: = Š. 

Sufficient. If G is finite the elements corresponding to F obviously form 
a subgroup. If © is not finite we still must have that e, the unit of ©, 
corresponds to # since # is the only left unit. Furthermore if the es 
of some element corresponding to E corresponds to X, we must have XE D E 
and hence X = E. Now let r and r be two elements of @ with the same 
image À. We may determine v in G such that sr = 7. Then XRD R and 
== E. Thus the homomorphism of @ to M is a strong left unit homo- 
morphism and by Theorem 3 Wè is isomorphic to the m-system of the left coset 
expansion of & with respect to the er corresponding to F. f 
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By Gustav A. HEDLUND. 


1. Introduction. Two distinct methods have been used to prove that : 
the flows defined by the geodesics on suitably restricted surfaces of constant 
negative curvature are metrically transitive. The first-of these [2,3] + involves 
the use of symbolism to characterize the geodesics and the proof is restricted 
to those surfaces for which a suitable symbolism has been devised. The second 
of these methods [6, 7, 8] makes use of the theory of harmonie functions and 
is valid for all complete surfaces of constant negative curvature and of finite 
area. It is the opinion of the author that both of these methods involve 
excessive machinery and it would seem desirable to derive a more simple and 
straightforward proof of the result under discussion. ; 

The present paper gives a new method of proof of the metric transitivity 
of the flow defined by the geodesics on any closed orientable surface of con- 
stant negative curvature. It seems to the writer-that the present proof is 
considerably simpler than any previously given. The method extends readily ` 
to the general class of complete surfaces of constant negative ARVALS and 
of finite area. 


"2. Two-dimensional manifolds of constant negative curvature.. Let J 
¥ be the interior of the unit circle U, £? + y? — 1. To © we assign the metric 
2 4(de + dy?) 
c(1— 2? — 7)? 
the Gaussian curvature of which is —c. The metric (2.1) assigns a length - 
-to curves in ¥ and this length is termed hyperbolic length or H-length. Angle 
is euclidean angle and the element, of (hyperbolic) area is 

4dxdy 
(ie) 

The geodesics defined by (2.1) are arcs of circles orthogonal to U and 
are called hyperbolic lines or H-lines. -An H-line is uniquely determined by 
two points of U and these points are the points at infinity of the H-line. The 
hyperbolic distance or H-distance between two points of ¥ is defined to be the 
H-length of the unique H-line Lo joining the points. 


(2.1) | ~ dg? om e>0, 


* Received November 17, 1939. š | 
+The numbers in brackets refer to the bibliography. hg 
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wre annie is.a cutideagtircte internally tangent to U. The salt A 
of contact of the horocycle with U is the point at infinity of the horocycle. 
Let # denote thé minimtim H-distance from the origin O to an arbitrary point 
of the horocycle, and let + == + # or —# according as O is interior or exterior 
to the horocycle.- The- point at infinity A and the constant r uniquely deter- 
mine a horocycle and this horocycle will be denoted by C(A,r). The horocycle 


O(A, r) is an a trajectory of the set of H-lines having A as one point 


at infinity. 


The metric (2.1) is inyariant under linear fractional transformations 


which take ¥ into ¥, so that under such transformations, hyperbolic distance, 
angle and area are invariant. 

© Let F be a Fuchsian group with U as principal circle (cf. [1], Ch. III). 
Such a group has a normal fundamental region containing the origin and 


bounded by H-lines or H-line segments which are congruent in pairs. If to: 


‘ this domain is added a suitably chosen subset of the vertices and a suitably 
chosen subset of the sides, the resulting region R is such that no two points 
of it are congruent and any point of ¥ is congruent to some point of R. We 
assume that F is such that the closure of R contains no points of U. It follows 
that F is of the first kind and has a finite set of generators, while the hs 
R has a finite set of sides. 

If points which are congruent under F are considered identical : there is 
defined a closed two-dimensional manifold M (F,c) of constant negative curva- 
`. ture. In the case in which M(F,c) contains no singular points, all the 

transformations of F are hyperbolic. The genus of M (F, c) is then necessarily 
greater than one. 

An element e in © is a point P of ¥ together with a direction at that 
point and can be specified by three coördinates (x, y, $), where æ and y are 
the coördinates of P and #, ÒS p < r, is an angular coôrdinate measured 

. positively in the counterclockwise sense from the directed H-ray which has È 
as initial point and is part of the directed H-line which passes through P and 
has (1,0) as initial point at infinity. The point P is the point bearing the 
element e. A neighborhood of the element (21, 4:,¢1) is the set (x, y, $) 
such that 

-H(P, Pi) <8, | ¢—¢ | <8, 


where P is the point (z,y), Pı is the point (2,,4,), H(P, P;) denotes the 
H-distance between P and Pı, || ¢— ¢: || denotes the least value of the set 
| p— pi +?nr|, (n—0,+1,+2,- “:), and 8>0. Let € denote the 
space of elements in Ÿ with neighborhoods thus defined. 


A transformation of F carries an element into a congruent element. The 
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l space Q of elements on M (F, c) is the space obtained by ibn net 
elements of €. If neighborhoods in Q are defined. by correspondence with the, 
neighborhoods defined in € (cf. [9], p. 32), Q is a* Hausdoïff space.. It is 
easily shown that Q is separable and regular and it follows that a metric 
yielding an equivalent topology can be assigned to Q. With such a metric 
assigned, the diameter of a subset of Q is defined. 

An element (z, y, p) determines a unique pois of Q, and this os will 
be called either the point of Q determined by (x, ÿ +) or simply the point 
(z, y, $) of Q. 


If measure is defined in € by means of the volume element 


dodé, 


where do is given by (2.1), congruent measurable sets of € have the same 
measure and this measure serves to define measure in Q (cf., e. g., [4]). The 
hyperbolic lines or geodesics define a geodesic flow G, in Q (cf. [4]) and G, 
is a measure preserving transformation of Q into itself which is defined for 
each real s. 


3. The tubular property of invariant sets. Let A be a point of U 
and let a be the smallest positive angle which the radius OA forms with the 
positive x-axis. The points of U are then in 1—1 correspondence with 
the interval 0: a < 2r and the horocycle C (A,r) can be denoted by C (a,r). 

Let C(a,r) be a horocycle, some point of which is in the region R. The 
points of U(a,r) in R form one or more connected arcs of O(a, r). Let this 
set of arcs be denoted by a(a,r) and let the number of arcs in this set be 
N(a,r). Since a horocycle can cross an H-line in at most two points, the 
number N (a,r) is not greater than twice the number of sides of R. Since R . 
is determined by F, and since, under the assumptions we have made, R has a 
finite number of sides, there exists a uniform upper bound Na for the integers 
N(a,r), where this upper bound is determined by F. Let L(a,r) denote the 
H-length of the shortest arc of C(a,r) which contains all the arcs a(a,r). 
Since the closure of À contains no points of U, there is a uniform upper bound 
La for the numbers L(a,r) and again Lo is completely determined by F. 

Consider the set of arcs a(a,r) and the elements externally normal to 
C(a,r) at the points of a(a,r). The points of Q which correspond to these 
elements form a set of arcs &(a,r) of Q and the totality of arcs a.(a,r) 
obtained by considering all admissible values of a and r form a set which is 
identical with ©. -By considering elements internally normal to C(a,r) we 
obtain similarly a division of Q into arcs ai(a, r). 

Since the closure of ca lies interior to U, the values of + assumed in the 
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` determination of the arcs a(a,r) or ai(a, r) lie between two suitably chosen 
constants 1, and ra such that rı < 0 < T}. It follows that the values of (a,r) - 

determining ares e(a, T) {ai(a,r)} form a set Œe{Œi} of the rectangle &, - 

OSa < m, ri LTL Tg 
~ Let the rectangle R be divided into a net À, of n? rectangles by n lines 

parallel to the a-axis and # lines parallel to the 1-axis. We assume that the 

sequence of nets Aj, Az,’ : - has been so determined that the maximum dia- 
` meter (measured in terms of euclidean distance in R) of the rectangles of 
the net A, approaches zero as n becomes infinite. It is to be understood that 
any one of the rectangles of the net A, includes just one vertex, namely the 
lower left corner, and two open sides, the lower horizontal and the left vertical. 

. The division of R into the net A, effects a division of the set Ge{ 1} into 
subsets, a subset being the points of (.{@Zi} in a rectangle of the net A4. 
The totality of arcs & (a, r){&(a,r)} of Q corresponding to the points of. 
C.{@;} in a single rectangle of A, will be called a tube of Q. Thus, corre- 
sponding to the net A, there is a division of Q into two sets of tubes and we 


<- will denote these sets by Te(n) and Ti(n) respectively. A tube is evidently 


a measurable set and for each positive integer n, Q is the sum of the tubes of 
To(n){Ti(n)). | 

Let # and G be’ measurable seta of Q and let G be of positive measure. 
The relative density of the set Æ in the set G is definèd to be the number © 
m(E£ - @)/mG@. . 


THEOREM 3.1. Let Æ be a mainile invariant set of Q. Then, 
given « > 0, there exists a positive integer N such that if n >N, the set E, 
except possibly for a set of measure less than e, lies in tubes of To(n){Ti(n)} 
tn which the relative density of the set F ts at least 1—e. . | 


The proof will be restricted to the case of tubes of EAM): An analogous 
proof applies to the other case. 

Under the measure preserving els Ga of Q into itself, the set 
ae(a,r) is transformed into a set of Q determined by the elements externally 
normal to a-set of segments o,(a,r) of the horocycle C(a,r+s). Let L.(a,r) _ 
be the H-length of the shortest segment of C(a,r+s) containing the set 
os(a,7). As s—>— oo, L,(a,r) —0.: Since the H-lengths Lola, r) = L(a, r) 
are uniformly bounded by the constant La, the numbers L,(a,r) approach 
zero uniformly as s—>— œ. It follows that the diameter of the set of Q 
determined by the elements externally normal to C (a,r +s) along os(a,r), 
and hence the diameter of the set Gs(ae(a,1r)), approaches zero as 8—>— œ. 

Let te(n) denote an arbitrary tube of the set T.(n). As 8—>— œ, the 
* diameter of the seb G,[te(n)] does-not in general approach zero. But if n is 
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_ large, the tube te(n) consists of sien eater near the elements of a - 
set a(a,r) and iff <0 is properly chosen, the diameter of the set Gs[te(n) ] 
will be small. Let sn denote a value of s for which the maximum diameter 
cf the sets G.[t. (n)] is a minimum, $ ranging over the interval — œ < $ < 0, 
while t(n) ranges over all tubes of the set 7,.(n), and let this minimax ` 
diameter be d(n). It is then geometrically evident that lim d(n) — 0. 
: n700 


Since, for any given positive integer n, the tubes f.(n) of Te(n) form a 
division of Q into non-overlapping measurable subsets whose sum is Q, the 
same is true of the sets G.,[t.(n) |]. Since G,, is a measure preserving trans- 
formation of Q into itself and F is an invariant set of Q, it follows that 


m{Gs,[te(n)]} = m[te(n) ], 
and : ca 
m{Ge,[te(n)] E} = m[to(n).- E]. 


Hence the relative density (if it exists) of the set F in te(n) is identical with 
the relative density of E in the set G.,[t.()]. To prove the stated theorem 
it suffices to prove the following lemma. 


Lemma 8.1. Let E be a measurable set of Q and let An, n—1,2,:::, 
denote a division of Q into a set of subsets called cells such that: (1), the 
number of cells in An ts finite; (2) the sum of the cells in A, is Q; (3),.each 
of the cells forming An ts measurable; and (4), if. da denotes the maximum 
diameter of the cells of An, lim da = 0. Then given « > 0, there exists a. 


n> 


positive integer N such that if n > N, the set E, with the exception of a set ‘ 
of measure less than ¢, lies tw w cells of A, in which the relative nie. of E 
is at least 1—e. 


Since E is a measurable set and mQ <.0, corresponding > € > 0, there 
exists an open set Fo of Q such that Es 2 E and m(E — E) <&/2. 

Let A*, denote the subset of cells of An lying in Ey. The set A*, con- 
tains any point of Hy which is the center of an open sphere of radius d, made: 
up of points of Eo. It follows that any point of Hy lies in the set A*a for n 
sufficiently large and hence, corresponding to e> 0, there exists a positive 
integer N. such that m(H)—A*n). < </2, provided n >N. The following 
inequalities evidently hold. | | 


(3.1) m(A*, — E+ A*,) <m(Eo—E#) <é/2, n>N. 
(3.2) ©  m(E—E-A*,) S m(Eo— A*a) <</2, n>N. 


Let A*, denote the set of cells of A*, in which the relative density of 
the set F is less than 1—<. It follows that 
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emA*, S m(A*, —A*,-E), n>N, 
and since | 
At, — AY, EC Ey — E, n>N, 


we infer with the aid of (3.1) that 
(8.3) TEE mA*, <</2, n>N. 


Except possibly for the sum of the sets E— E- A*, and H-A*,, the set 
E lies in cells of A*, in which the relative density of E is at least 1—e. 
But the measure of each of the sets E — E + A*, and E - A*, is, according to 
(3.2) and (3.3), less than «/2 if n > N, and thus we can infer the truth 
of the stated lemma. 

The proof of Theorem 3.1 is complete. 

The following evident extension of Theorem 3.1 will be useful.’ 


THEOREM 3.2. Let T*,(n){T*,(n)} denote a division of each of the 
tubes of Te(n){Ti(n)} into measurable non-overlapping subsets such that 
for each n the number of the sets in T*,(n){T*, (1) } is finite and their sum 
is Q. Let E be a measurable invariant set of Q. Then given «> 0 there 
exists a positive integer N such that if n > N, the set E, except for a set of 
measure less than €, lies in sets of T*,(n){T*i(n)} in which the relative 
density of the set E ts at least 1 —<. - 


4, Metric transitivity. Let e be the element (x,y, $). This element 
can also be specified, by three codrdinates (a,r, h), where @ and r are the 
numbers determining the horocycle C(a,r) which passes through (z, y) and 
‘has e as an exterior normal element, while À is the oriented hyperbolic arc- 
length on O («,r), measured positively in the clockwise sense on O (a,r) from 
the point of C (a,r) which is nearest the origin. The transformation from 
(z,y,¢) to (a,r,h) is analytic with non-vanishing Jacobian in the set 
z? Hy <1, 0< o<. 

Let Q* denote the subset of Q determined by the elements (x, Y, p) such 
that (x, y) is in the interior of the region R and 0 < D <r. It is evident 
that m(Q—Q*) —0. If F is a measurable subset of Q, the measure of the 
set E -Q* coincides with that of Æ and by the transformation from (z, y, $) 
to (a,r, h) defined above, the-set E:Q* can be represented in the (a,r, h) 
space by a measurable bounded set, the measure of which is defined to be that 
of E-Q. If the metric density of E at any point of * is defined by means 
of cubes in the (a,r, h) space, it is well known that the metric density of E 
is 1 at almost all points of E and 0.at almost all points of Q— E. 
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Lemma 4.1. If e(@ tu hı) and ec, Ti, he), hı < he, are elements 
such that the points bearing all the elements 


(41,71, h), h Sh S h, 


‘are interior to R and if the metric density of the measurable invariant set E 
is 1 at 6, then the metric density of E at ez ts also 1. 


_ We assume in the proof of this theorem that a: 40. If a, were zero, 
a slight rotation of the rogton R would permit the application of the given 
proof. : 
Under this condition the constant 8 > 0 can be chosen so small that all 
the points (a,r, h) satisfying the. condition 


` 


ļa—a] <8 |r—n|<8 h—8ShASh. +8, 
are in Q*. We denote this P by Bs. The subsets of Bs determined by the 
inequalities | 
esalea renlet |h-—ù|<s, (i= 1,2), 
will be denoted by Os and Ds, respectively. If we let 


m(E : Cs) = 
mC3 


it follows from the hypotheses of the lemma that lim às = 1. For the 


moment we hold & fast. 
Let Te(n) be a division of Q inte tubes as defined in § 3, and let the sets 
T$e(n) be determined as follows. If a tube of 7.(n) contains no points.of 
Bs, the tube is a set of T*,(n), while if a tube t(n) of Te(n) contains a 
point of Bs we divide te(n) into two sets t.(n)- Bs and te(n) — t(n): Ba, 
both of which are sets of T*,(n). The sets T*,(n) then fulfill the conditions 
imposed in Theorem 3.2 and given € > 0, there exists a positive integer N 
such that if n >N, the set E, except possibly for a set of measure less than e, 
lies in sets of 7*,(n) in which the relative density of E is at least 1 — «.. 
Let the set obtained by excluding the exceptional set from Æ be denoted by 
E*,. Then m(# — E*,) < if n> N, and we assume that n is so chosen.. 
The subsets of 7*,(1) containing points of Bs form a set of rectangular: 
parallelepipeds (in (a,r, h) space) 2 
(4.1) > à t”, l (i= 1,2,; --,v(n))- 


The sets 
(4.2) hd: Ca, | (t= 1, 2,- "+, ¥(n)), 


À, 
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. form a division of C3 into non- overlapping subsets acne parallelpipeds) . 
and let 


(4. 3) ti," Ca, (EA; 2,-- -,u(n)), 
denote those sets of (4.2) which contain points of E*„. It follows that . 
(4.4) mS ta C5) = m(E*, Ca). - 
Since .c | 
m(E Cs) = Asma, | 

“it follows from the condition m(E— E*,) < e and (4.4) that 
(45) nur. 0) Z dam —e 
But the transformation 7 

| a =&, r=r, h= h + const. 
is measure preserving in Q (cf. [5]) and thus 
(46) ms ms; mt. 0s) = m(t": Ds), (i= 1,2, -,v(n)). 
We infer from (4.6) and (4.6) that 


k . uin) | 
(4.7) m(Z ti”: Ds) = AsmDs — e. 
=1 N 
Since í M i ; 
m(t": E*n) = (1 —«) mtn”, KA (k —1,2,:::,n(n)), 
it follows that : 2 , 
(4. 8) m{E*, > (ti,"* Ds) } = mit - Ds} — emin", 


(k= 1,2,- j ",m(n)). : 
Summing over k = 1,2,- - -, y(n), we obtain | 


m(E*, : Ds) = nS (ti” Di) — em CE ta”). 
From this n and (4.7) we infer that 
| ris(H*,- Ds) = AgmDs—«—emB, 








` whence 
| m(E - Ds) m(E*, Dè) > = e Bs 
(a . MmDs = mDa =a aD; € mD” 
Since, for a given § > 0, e can be chosen arbitrarily small, we infer that . 
m(E- Ds) SX 


mDs 
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This implies that the lower metric density of E at (æ, rı, ha) is at least as 
great as the metric density of E at the point (a, 1, hy). But the latter was 
assumed to be 1, and-the statement of the lemma is proved. 

The element obtained by rotating a given element through 180° is termed 
` the element opposite the given element. The proof of the following lemma is 
closely analogous to that of Lemma 4. 1, and will be omitted. 


LEMMA 4.2. If ea, hı) and es (1,11, he), hi < he, are elements 
such that the points bearing all the elements 


(ca, 1, h), hks ha, 


are interior to R, and if the metric density of the measurable invariant set E 
ts 1 at the element opposite e,, then the metric density of E at the elemani 
opposite 6, 18 also 1. 


THEOREM 4.1. (Metrical Transitivity.) If E isa measurable invariant 
set of Q, either mE = 0 or m(Q — E) = 0. 


It is sufficient to show that if mE > 0, then mE = mQ. If mE > 0, 
E contains a point p(z,y,¢), (x, y) interior to R, such that the metric 
density of E at p is 1. Since Æ is invariant under the measure preserving 
geodesic flow G., each element on the directed geodesic of which p is an ele- 
ment will be a point of # at which the metric density of Ẹ is 1. Since (x, y) 
is interior to À, there is a connected arc a, of the horocycle C, of which p is 
a normal exterior element such that a contains (v, y) and lies in the interior 
of R. According to Lemma 4.1, each element which is externally normal to 
C, at a point of & is a point at which the metric density of E is unity. 

Similarly, there is an arc & of the horocycle Cz of which p is a normal 
interior element such that a; contains (x,y) and lies in the interior of À. 
According to Lemma 4. 2, each element which ig internally normal to C» at a 
point of a; is a point at which the metric density of E is 1. 

Thus there are three possible transformations of an element such that if 
the metric density of Æ at the element is initially 1, it is 1 after the trans- 
formation. It is a simple geometrical problem to show that given any # 
such that 0S ¢’ < 2r, it is possible to transform (z,y,¢) into (x, y, ¢’) 
by means of these transformations without passing out of a small neighbor- 
hood of the point (z, y). Thus, if (z, y,#) is an element at which the metric 
density of E is 1, then the metric density of Æ is 1 at all the points (x, y, 4”), 
OS p < 2x. If ¢’ is chosen properly, the directed geodesic determined by 
(x, y, ’) will pass through the origin, and thus some point (0,0,¢) of Q is 
a point at which the metric density of # is 1. But then all the points (0,0,¢), 
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0 ¢ < 2, are points at which the metric density of E is 1. By reversing 
this process we infer that every (x, y, $), (2, y) interior to R, OS $ < 2r, 
is a point at which the metric density of E is 1. It follows that mE == mū. 
and the proof of the theorem is complete. i 


UNIVERSITY OF VIRGINIA, 
. CHARLOTTESVILLE, VA. 
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THE EULER NUMBER OF A RIEMANN MANIFOLD.* 


By CARL B. ALLENDOERFER. 


1, Introduction. One of the chief links between the differential 
geometry and the topology of two dimensions is the corollary to the Gauss- 
Bonnet theorem which states: The integral of the total curvature of a two 
dimensional closed surface over the-surface is equal to 2rW, where W is the 
Euler number of the surface. Since the Gauss-Bonnet theorem is of intrinsic 
character, this theorem does not require the surface to be a subspace of any 
Euclidean space. 

An alternative, but less inclusive, proof of this theorem can be given 
which avoids the Gauss-Bonnet theorem and uses instead the property that 
the surface lies in a three dimensional Euclidean space. This proof can be 
generalized to a closed Riemann space En of even dimension which is a sub- 
space of an n + 1 dimensional Euclidean space, i.e., a hypersurface. In this 
case the theorem takes the form: 


(1.1) f KdO mN 
Ra 2 


where K is the total curvature of Rn, o is the area of an n-sphere (a sphere 
whose surface is n dimensional), and N is the Euler number of Ra. We recall 
here that K is defined for a hypersurface as the product of the n principal: 
curvatures, and that it can be expressed as a polynomial in the Rapys of Rn 
divided by the determinant of the gag. We shall not consider the case n odd, 
for it has been shown that (1.1) does not hold under these circumstances.* 

The problem at hand is to extend (1.1) to spaces which are not hyper- 
surfaces. If no imbedding is to be assumed this requires a generalization of 
the Gauss-Bonnet theorem to more than two dimensions, and so far this has 
not been accomplished. Progress, however, can be made by assuming that Ry 
lies in a Euclidean space of n + q dimensions, and on'this basis we shall 
prove the following 


THEOREM. If a closed Riemann manifold of even dimension can be 
made a subspace of a Euclidean space Eng, then 


| Í. Kd0 = kon, 


* Received October 12, 1939. 
1 For the case of a hypersurface see H. Hopf, “ ther die Curvatura integra ah 
sener Hyperflachen,” Mathematische Annalen, vol. 25 (1925), pp. 340-367. 
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where 


K e= Raabipa LA Bansanbniba dasi aneh, Fete 
n12 | gap | 





The term “closed Riemann manifold” is used in the sense defined by 
Hopf in a paper in which the background of this problem is discussed and 
the present investigation suggested.* 

The chief difficulty in the preparation of this paper was the definition 
of K since no theory of principal curvatures, etc. exists for spaces other than 
hypersurfaces. Instead K is defined indirectly by the use of the theory of 
tubes recently developed by H. Weyl.’ ‘Once this is accomplished our theorem 
is an immediate application of Kronecker’s index theorem * and of Weyl’s 
results, 


2. Kronecker’s index.* An important tool in the proof is an integral 
theorem due to Kronecker, the proof of which is here summarized from the 
present point of view. Let S be an n dimensional closed Riemann manifold 
on which is defined a set of n + 1 functions of class C°, V4(x), which satisfy 
ViVi== 1. By means of this set of functions we can consider a continuous 
mapping of S upon the unit n-sphere, X, whose equation is V#Vt=1. The 
orientations on S and X are those imposed by a fixed orientation in the 
arithmetic space of the parameters 2%. This mapping is of a definite degree 
d, where & is an integer, positive, negative, or zero. We seek an analytic 
expression for d. Consider the determinant: l 


yt... ym 

A dise 

ðr! Oat 
(2.1) D = 

ay: | | gym 

ðr” dx” 
It is easy to show that 

A ayt ave 
(2.2) D = m. f 








2H, Hopf, “ Differentialgeometrie und Topologische Gestalt,” Jahresbericht der 
Deutsoher Math. Vereinigung, vol. 41 (1932), pp. 209-229. Also see Hopf und Rinow, 
“{ber den Begriff der vollstandigen differentialgeometrischen Flache,” Comm. Math.” 
Helvet., vol. 3 (1931), pp. 209-225. f 

SH. Weyl, “On the volume of tubes,” American Journal of Mathematics, vol, 61 
(1989), pp. 461-472. Readers should note that in Weyl’s paper w, refers to the surface 
area of a sphere which incloses a volume of n dimensions. We put w, equal to the 
area of a sphere whose inclosed volume is # + 1 dimensional. 

i For a full treatment see J. Tannery, Introduction à la Théorie des Functions, 
Note by J. Hadamard, vol. 2, pp. 437-477. 


THE EULER NUMBER OF A RIEMANN MANIFOLD. -245 


which is recognized as the determinant of the metric tensor of the n-sphere 
if V‘ are taken as Euclidean coördinates. The area, wn, of the sphere is thus 
given by: 


f E E a ee 
S 


integrated over 8 provided the sign of the radical is chosen from point to point 
to allow for overlapping of the covering. This is accomplished at once by 


using Í, Ddzx. For D is numerically equal to VD? and has a positive sign 
for elements on the sphere of positive orientation and a negative sign in the 
opposite case. Therefore f. Ddr = d: wn. 

5 


In the special case where S is a hypersurface and where V* is the normal 
vector ét, we arrive at the total curvature of S. For 





i E L Buggi PE 
(2.3) gaa Vasg ae 
where bap are the negatives of the coefficients of the second fundamental form 
: ay? byt , 
of S. From the fact that Ge Qa "Jen We have that: 
(2.4) D? = | bapg? ba |; 
or that 
2.5 D =e | bap ; 
~ [gesi 


By considering the special case _ 
i 
E= (1,0, 0); = (050,10, : -,0) 
with 1 in the (æ + 1)-th place, bag == bap; Jag — Sag; it is shown that e is 


definitely + 1, since (2.5) is an identity. But since K, the total curvature, 
equals | Bag |/ | gag |, this shows that: ' 


(2.6) f KAO = d` wn, 
8 


where dO = V | gag | dat: > - da”. 
Since n is even, it is necessary that N, the Euler number of 5, be equal to 2d. 
Hence i 


(2.7) f, Kd0 = FuN 


which is the required theorem for this special case. 


3. Fundamental equations on tubes. Let y'==#yf{u) be the para- 
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metric equations of Ra in Eu: for a neighborhood of Ry. Since it may be 
necessary to consider a number of sets of such parameters in order to cover 
Rn completely, we shall let u* be a typical set. There now exist g mutually 
orthogonal unit vectors st(o ==1- +q) which are normal to Ry. Let 
these be chosen as functions of class C* and such that the determinant 








| Ect, oe > 0. The parametric equations of a tube of unit radius may then 
‘be written: 

(3.1) af yi(u) + (0) bot 

where ù : | 

(3.2) | tte — 1 


and v4 (A—1::-q—1) are parameters on a (g—1) sphere. Again sev- 
eral sets of v’s will be needed to cover r the sphere, The tangent vectors of this 
tube are then: * 


dat 1y 

= as za T E apg > y ui T VPo/a épt} 
(8. 8) 8x ae Et. 

804 dvd 


The tube is moreover a hypersurface of Eng and its normal mile is then 
tot. This follows from (3.8) and the ais 


ot y: == 0; ÉctÉpt == Bap; 3 pl a = 0; VpO Ja "= — Vop/a- 
We then observe that: ` 


| | 
(3.4) pa (Eo!) = — Payag + Eyoo/abot; 
3. 4 
a ate 
aon (bot) = a bot. 


Hence for the tube the D of Kronecker’s index is 


ÉTÉ a! 1 row 
: gi | 
(3. 5) D=— ua oO 4 q— 1 rows 


10048198 + Évypo/aëpt | n rows 


where ay = — And at once it follows that 





dés gte 
(3. 6) Dt = | Pragtig | X DA O98 
whose square root gives: 
10%, PT a 
(3.7) pg) ree 
f | gae | 


£ For instance see L. P. Eisenhart, Riemannian Geometry, p. 189, equations (56. 3). 


THE EULER NUMBER OF A RIEMANN MANIFOLD. “ 247 
a a N We note that V¢# is the surface element of a q— 1, 
sphere, 3. The value of e here depends essentially on the orientation chosen 
on À, and on the sphere. In order to decide this matter consider the special 
case where: fo! == (0,---,0,1,0,---+,0) with 1 in the o-th place only; 
Yat = (0,--+,0,1,0,- > -,0) with 1 in the a + g-th place only; Map — Sab; 
us = 0 for o1; gY? == 8%; vpoja =Ò. Then (3.7) becomes term by term: 





where t = | 





th ee ote qa 4 
oe ott 
Out êvt o: 
$ = es 

ata =. VE vi 
Bai Spori : 

| ip 

o gp 

0 ' 


tt 
The value of the upper left-hand minor is + Vt. We choose the positive 
orientation on X so that the sign of the radical is positive at all times. This 
shows that e = -+ 1, since (3.7) is an identity. Remembering to perform 
all integrations in the thus determined positive sense, we have that l 


f LB | 7 yg ar- . . dyttdy- - Mites aa 
Re È | gap | a 


where Ñ is the Euler number of the tube, provided that n + q—1 is even. 
We have assumed that n is even, and hence require q to be odd. The'case g 
even will be handled presently. 


4, Final results. Here we follow closely the results of Weyl’s recent 
paper in evaluating the integral on the left. , Since 


(4. 1) r= [sel Vra «ave 
| gap | 
is an orthogonal invariant with respect to the index o, and since n is even, 
I is expressible as a polynomial in {.g0%s== Eagyy. eee Weyl we 
have that: 
eS Tes (n—1)(n—3)---3-1 

4,2 I= + KE 
a 9 (q +2) - =< (w+ q—2) 
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where Fe o 
. g — l Barres" ` Banpns/anb t Te r 
nl “| gap 


Because of the relation: Ragy = Euyygs — Easy, we may write: 








14. 1 Rampe: : Robe Men En 
(4:8) =a = i 


Thus: ` z í 
. —— | (n—1):::3:1. 
4. 4 K dut : - - du™- wg : 
AD, fpr VTT dat + dur oes DST nm 
ace : , — $ Nowa: 
Recalling that q is odd we have that 








ONS (ar) - 
eee T+ 2): (nF g— 2) 
t 2 - (2r)"/3 


ot (n—1)(n— 8) Bed 

and hence that , ; 

Se oe (n—1){n—3): S 
RS deu {ame fe 

© Combination of (4.4) and (4.5) gives: 

Era | eee 


: i EE de ae 
(4.6) LE Video| au du" PAM 


Now we know from topology that Ñ == 2N where N is the Euler number 
of Ra. For the tube is topologically the product of Rn with a q — 1. sphere 
where g — 1 is even. This leads at once to the above relation between their .. 
Euler numbers. Thus we have that for n even and q odd: 


TE l N 
leaa’ R a == — Oy, 
(4.7) Sf, EK V | gap | du du = f KdO z 


- This result extends immediately to the case q even. For if q is originally 
even, imbed the n + q dimensional Euclidean space in a similar space of 
n-+q-+1 dimensions so that the parametric equations of Ra are yt == yt (u), 
t= 1: ng; y” constant. Now the proof proceeds as above, yield- 
ing the desired result, since q does not appear in the final formula whatsoever. 
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THE INDEX THEOREM FOR A CALCULUS OF VARIATIONS 
PROBLEM IN WHICH THE INTEGRAND ` 
IS DISCONTINUOUS.* 


By Nancy Core. 


Introduction. The purpose of this paper is to establish Morse’s Index 
Theorem +. for a problem in euclidean m-space in which the integrand is dis- 
continuous and in which the basic curve g is a broken extremal with a finite 
number of corners. We assume that at each corner g is cut across by a regular 
(m — 1)-manifold of class C?, which is not tangent to either arc of g at the 
corner, and that at each corner g satisfies a set of.“ primary incidence rela- 
tions.” Our integral J along g will be of the form 


ym f F(a, %)dt-L>- +f Peia 


where g1,* * ', gx indicate the extremal arcs of which g is composed. Mason 
and Bliss (see Bliss 1) discussed the minimizing properties of a broken extremal 
in a problem with a discontinuous integrand in 2-space. Miles (1) extended 
their results to 3-space. In order to keep the notation as simple as possible, 
we treat below the case k = 2 in m-space. 


1. General hypotheses. Let R be an open region in the space of the 
variables (x) == (z',---,2"). Let g be a simple continuous curve lying in 
R and composed of two successive regular arcs g, and gz, each of class C?. 
We shall represent g in the form 


(1.1) zt = yt (t) (i=1,::-,m) 
where ¢ is the arc length and increases from ?’ to i” inclusive. 

Let c, where # < c < t”, represent the value of the parameter ¢ at the 
point of intersection of g, and ge. We suppose that at the corner t == €, g is 


cut across by a regular (m—1)-manifold M which is not tangent to either 
arc of g at t==c. We assume that M is representable in the form 


(1. 2) gst = zt (at, + +, at) = zt (a) (t=1,: : -m;n =m — 1) 


where the functions 2‘(a) are of class C? for («) near (0), and for (a) == (0) 
give the point t = c on g. We term M the deflecting manifold. 


* Received June 22, 1939. 
1 See Morse 2. Numerals following the name of an author refer to the bibliography 
at the end. 
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By a function of class C° in a domain § which is not entirely open, we : 
shall mean a function of class O? in an open region which contains the domain 
8 in its interior. 

In the space of the variables (x) let R, (Rz) be a domain of points (x) 
near gı (gz) excepting those points (z) not on M which lie on the same side 
of M as ga (gi). Let 


F*(a,r) = F(t. ++, a", rt + 7) 
be a function of class C° for (x) in R, and (r) any set not (0), and let 
FT) = ied Copa ae cer ee | 


be a function of class C? for (x) in Re and (r) any set not (0). We assume 
‘that each function F(x,r), k== 1 or 2, is positive homogeneous of order 1 
in the variables (r); that is 


(1. 3) Fe(s; kr) = kF" (a, r) 


for all numbers k > 0 and (r)=4 (0). We assume also that the problem is 
pe regular along g; that is ; 


Fu (ar) DO, u T. (ÿjæl,.-:,m) 


(i. 4) Fere (T, 7) MAI > 0 


for (æ,r) = (y, 7): respectively on gı and g, and for (A) any set not (0) 
and not proportional to (y) on gı and gə respectively. 
A curve of class D neighboring g will be termed admissible if it joins 
the initial point # of g to a point (z) on M and that point (z) on M to the 
- final end point ¢ of g, and crosses M just once. 
For our problem the familiar integral J defined along an admissible curve 
y of class D? neighboring g will be of the form 


n= f F(a, idt f F(z, à)dt 


where &* stands for the derivative of xt with respect to the parameter ¢ and 
where yı and, ya denote the arcs of y lying in R, and Re» respectively. 

We assume that the Euler equations hold along g; that is the Faler 
equations 


one ri am) 


SP Ft —0 


(1.5) 


hold along the arcs gı and gs respectively. A simple continuous curve which 
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les in Æ, or À: or both and which is composed of a finite succession of regular 
ares of class C? along which the Euler equations hold will be termed a broken 
extremal, te 

Tf hf(4) is any function of t defined for t near ¿=c on g, we shall 
represent the left and right limits of h*(t) at t= c, provided they exist, 
by hf and h* respectively. 

If the point {— c on g is denoted by P, by points near P on the negative 
(positive) side of M we shall mean points lying in R, (Re) near P. 


2. The primary incidence relations. It is easy to prove that a neces- 
sary condition that g afford a weak minimum to J relative to neighboring 
admissible curves of class D? is that the directions of g at t= c satisfy the 
following conditions 
(2. 1) LP) — Fr (yt, yt) ] ant (0) = 0, - 

(tom1,---,m;h=1,---+,n) 


where the subscript À indicates differentiation with respect to a*. From now 
on we shall assume that the directions of g at t == c satisfy (2.1). 
Consider the condition 


(2. 2) LP, (2, 17) — F?,*(2, rt) ] dat =0, - (i=1,:::,m) 


where (z) is given by (1.2), where (17) and (1+) denote directions near 
(y) and (y) respectively, where the differentials dz* are to be expressed in 
terms of the differentials da* using (1.2), and where (2.2) is to be regarded 
as an identity in these differentials for (a) near (a) = (0). 

For a point (z) on M near (a) — (0), the condition (2.2) may be 
written in the form 


(2. 8)’ {Fiale(a), 17] — F?,'[2(a), 1] }zat(a) == 0, 
(t=1,---,m;h—1,---,n) 


where the subscript h indicates differentiation with respect to a". The con- 
ditions (2. 3)’ will be termed the primary incidence relations at a point (z) 
on the deflecting manifold M. 

A broken extremal which is composed of two successive extremal arcs 
lying in Æ, and À: respectively will be termed an extremalotd if its directions 
at the corner (z) on M satisfy the primary incidence relations (2.3)’. We 
note that g is an extremaloid which satisfies the primary incidence relations 
with («) therein equal to (0). 

If to the conditions (2. 3)’ we adjoin the condition 


(2.3)” pipit ee 1, (t—=1,---,m) 
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the m conditions (2.3), considered as equations in the variables (r*), have a 
unique solution r+ = rt (a, r), where r* are functions of class C? for (a), 
near (0) ‘and (r-) near (=). That such a solution exists follows from the 


fact that g satisfies the primary incidence relations and from the fact that for — 


(a) == (0) the functional determinant of the left members of the system 
(2.3) with respect to the variables (1+) is not zero. To prove the latter fact 
we use the method of Bliss (see Bliss 2, p. 447) to show that the m- square 
functional determinant- 


2atF3+, | (a)=(0) 


(2.4) y Fa 








(i,j =1,: "m; h=1,: > sn) 


is a to + B- F,, where 


: 0 y . oa 
10 cn z" oy |+ 
Br i : f J? Fi, AM (a) = (0). 
0 Zn! . Za” 


The determinant B is not zero since the regular manifold M is not tangent to 

g2 att—c. That F, is not zero is a consequence of the positive regularity 

hypothesis (1.4). See Morse 1, p. 112. : | 
We state the following theorems. 


THXOREM 2.1. Given a point (z) on M near (a) = (0) and at this 
point a direction (r) near (Y). There is a unique extremal on the positive 


side of M which issues from the point (z) with a direction (rt) determined 


by the primary incidence relations, and along which the parameter is the arc 
length. 


THEOREM 2.2. An n-parameter family of extremals defined for t=c 
and intersecting M for t = c determines an n-parameter family of extremals 
defined for t= c which with the respective extremals of the given family 


. satisfy the primary incidence relations at t == c, and along which the parameter 


ts the arc length. 


Such a family defined for t= c is called the continuation of the given 
family defined for #<c. The two families of extremals form a family of 
extremaloids. | 

Similarly a family of extremals may be defined in terms of a family of 
extremals which is defined for ¢ on ga and intersects M for t==c and its 
continuation family for t on qu. Í i 
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8. Conjugate points. Let {° be a value of t, # S t < c, and let (y°) 
denote the direction cosines of g, at the point for which t= ¢°. Let K denote 
the unit (m — 1)-sphere with center at the origin. Let (f,- : :,8"1) be 
the parameters in a regular representation of K in the neighborhood of the 
point (7°), with (8) — (0) corresponding to (+°). The family of extremals 
issuing from ¢° with directions determined by (8) can be pre in thé 
form ; : 

(8.1) 0 . at = $e gs wt, BY) p (r, B)» ar dt 
| | oe ead: m; n=m—1); 

where 7 ig the arc length and (8) = = (0) gives gı- The functions ¢ and ¢,+ 

are of class O? for (8) near (0) and z on gı. The zeros, r + 1°, of the jacobian 


y D(#!,: . -,p") | 
0 3 = 
(3. 2) Dr, t ) D(r, B, eRe > B*) > (8) (0), | | 
of the family (8.1) are termed the conjugate points on g, of the point £ of 
g. The order of vanishing of D(r, t°) at a conjugate point r of ¿° is termed 
the order of that conjugate point. 
Since g, is not tangent to M at t == c, the equations 








$*(7, 8) — 2 (a) (f= 1s +m) 
, have a solution of the form | 
r==r(B),. . p a : 
ea ak == ax (B), ; (h=1, + -,n—m—1) © 


where r(B) and «(B) are functions of class C? for (8) near (0), and where 
7(0) =c, &(0) = 0. Geometrically this means that the extremals of the 
family (3.1) intersect Af for (8) near (0) in the space (x). | 

In order to define the conjugate points on gz of the point t° of gs it is 
convenient to represent the family (3.1) im the form 





(3. 4) at = $#(, 8) =Y (t, P) Gt ym) 
where : 5 

y tee i, 
and w 


y(t B) = y(t), 
-yc B) = si[a(8)]. 
Such a die of parameter ia admissible, and we note that. (8) = : (0) gives 
gx for the family (3. 4) and. that along 1 ‘the parameter t is the a arc ee 
Moreover the jacobian 
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FERN Diphyes ye Z 

(3.5) D(t, t ) ~~ D(t, Bt: Š -, 8") > (8) (0), 
of the family (8.4) vaniśhes if and only if the jacobian (3.2) vanishes, and 
it vanishes to the same order. Thus the conjugate points on g, of the point 
t° of g, are defined by the zeros, t£ #°, of D(t, t°), and their orders, by the 
order of vanishing of D,(t, t°) at the respective points. 

From Theorem 2. 2 it follows that there exists a family of extremals on 
the positive side of M which issue from the points (z) on M near (a) == (0), 
with directions r+ determined in (2.3) by the directions rt- of the family 
(3.4), and along which the parameter is the arc length. These extremals 
represent the unique continuations of the respective extremals of the family 
(8.4), and will be represented in the form š 


(3.6) tt yt (t, B), (¢ on gz) 
where ¢ is the arc length, where (8) = (0) gives gz, and where 
y*(c, 8) =2*[a(B)] (i—1,: Á m). 


The functions y* and yst are of class C? for ¢ on g: and (8) near (0). The 
zeros, tc, of the jacobian 


Dy + we) 
3.7 DAG = A, = (0), 
( ) 2( ) D(t, B, 5 +, B") (8) ( ) 
of the family (3.6) are termed the conjugate points on g, of the point t° of 
g The order of vanishing of D.(t, t°) at a conjugate point ¢ of t° is the 
order of that conjugate point. 
We shall find it convenient to refer to the family of extremaloids 


(3. 8) a! = yi (t, B) | (i on g) 

which is defined by (3.4) and (3.6) for ¢ on gı and gz respectively. 
Conjugate point determinant. We set 

D,(t, t°) — D, (t, i°), (? Sisco) 

D, (t,t?) = Dit), (c<tSt”) 


understanding that (8) ==(0) therein, and term D,(t,t°) the conjugate 
point determinant. The zeros, t£ t°, of D(t, t°) define the conjugate points 
on g of the point t of g,. The conjugate points of {° and their orders are 
independent of admissible changes of parameter. 

For any point ?°54c¢ on gz the conjugate point determinant D,(f, t°) 
-is defined in a similar fashion, using the family of extremaloids issuing from 
the point ¢° with directions near the direction of gz at t°. In so doing it is 
understood that the corner point ¿== c is considered as a point of g, 


(3.9) 
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Finally if °=«c, the conjugate points on g of the point t==c are the 
zeros, t>4c, of the conjugate point determinant D,(t,c) defined in terms 
of the family of extremaloids with a corner at the point t= c on g, with 
directions + determined as in (3.1) by the parameters (8) in a regular 
representation of K in the neighborhood of the point (y~), (8) = (0) corre- 
sponding to (ÿ-), and along which the parameter ¢ is the arc length. The 
order of vanishing of D,(t,c) at a conjugate point is the order of that 
conjugate point. . 


4. The second variation. Let 


ah == at (e), a*(0) = 0, (h == 1,: + :, n == m — 1) 
be a set of n functions of class O? for e near 0. Let 
(4.1) at = z(t, e) (i—=1,: m) 


be a 1-parameter family of admissible curves for which the functions zi(t, e) 
are of class C? for t on g, and e near 0 and for t on g: and a near 0 respec- 
tively, which contains g for e == 0, and which satisfies the identities 

(4.2) | z(c, e) ==2'[a(e)]. 


For each value of e near 0, the integral J evaluated along the admissible curve 
determined by e is a function J (e) of class C*. We obtain a formula for the 
second variation in which we set 
204 (9, 7) = Fehi H Ea + Fetan, 
(x= 1,23t,f—1,---+,m) 

where the arguments of the partial derivatives of FF are (x,r) = (y,ÿ) On ge. 

Since g satisfies the primary incidence relations at ¿= c, the second 
variation takes the form 


$f 
J"(0) = buat + f 20%(m dt + f 20*(9, dt, 
where ; 
(4. 3) bu = [Fy 7) — Fay, y+) It (0), 
(t=1,:--,m;h,k—1,---,ny 


and where 7‘ and the n constants w are respectively the variations æt(4, 0) 
and a#(0) and satisfy the secondary end conditions 


(4.4) a(t) =0, h(t”) —0, (i= 1,- + +, m) 
and the secondary corner conditions 

(4. 5a) nt = %'(0) 0%, l (h==1,+-+,n) 
(4. 5b) nt = ani (0) ah, | 
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5. Solutions of the Jacobi equations. The Jacobi equations for £ on 
gi and g, are 
(5.1) Loos, Meter) 
‘ where k=— 1 and x=? respectively. Throughout this section we shall assume 
that « is fixed ; that is, x is either 1 or 2, not both.” 

It is well-known that the Jacobi equations for t on s are satisfied 
ideñtically by tangential solutions of the form 


p(t)¥#(E) “Geax a) 


where p is an arbitrary function of ¢ of class O° for t on gx. If, for t = t*, 
a solution of the Jacobi equations 7*(t) satisfies the relation 
u(t) — p(t) y(t) — 0, Ge: 5m): 
we shall say that nt(f) vanishes modulo a tangential solution at t == {*, If a 
solution of the Jacobi equations for ¢ on gx is determined except for the 
` possible addition of a tangential solution, we shall say it is determined modulo 
a tangential solution, or more briefly, mod T. We seek conditions by which 
solutions of the Jacobi equations for ¢ on ge are ee mod T. 
- Since the determinant of the coefficients of }* in (5.1) is zero, in order : 
to obtain’ solutions of the Jacobi equations for ton gr we consider the auxiliary 
differeritial equations 


d dias 

- (6.2) > à ge o o mO, | (s,j—1,: ` ' m) 
P. | 
À (i'n!) = 0, 


'. for ¢ On ge. Ct. Bliss 3, p. 199 and Graves 1, p. 17. To solve the m + L 
equations (5.2) we introduca the system ` 


R S 
a Ont + Ay 0, 


(Pn) =0, 


_where À is an unknown function of t of class O? for ¢ on gx. Using the method 
of Morse 1, p. 124, it is easy to prove that A==0 in solutions of (5.3). Thus 
(5.3) may be'regarded as identical with (5.2). Hence the 7 in (5.2) can 
be expressed as linear homogeneous functions of the variables (7,%) with 
coefficients which are of class C* in ¢ for t on ge. - 

The most general tangential solution of the waa differential panoni 
($ 2) is of the form 


(5.3) 


\ 
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© + bt) y*(t), | 1, NT m) 


where a and b are constants. 
‘We shall prove the Are lemma. : 


Lemma 5.1. Any solution of the Jacobi equations for t on ge may be 
written as a solution of the auxiliary differential equations for t on ge plus 
a tangential solution of the Jacobi equations for t on ge. 


Let (7) be any solution of the Jacobi equations for ¢ on gx. Consider 
the difference ` 


y(t) =r (i) — pt)" (t) | (d= 1,- ++, m) 
‘where p(t) is a function of ¢ of class C? for ¢ on gx. The difference y*(t): 
is a solution of the Jacobi equations for ¢ on gx. If we choose p(t) so that 


T= (j= 1sm) 


then we have 
(5. 4) | | p= is OP) 


and y*(t) is a solution of (5.2), as was to be proved. 
We shall prove the following lemma. 


Lemma 5.2. A solution of the Jacobi equations for t on gx which’ 
vanishes with its derivative at a point, is identically equal to a tangential 
solution of the Jacobi, equations for t on gr. 


Let (7) be any solution of the Jacobi sation for ¢ on ge which 
vanishes with its derivative at a point t= t*. Consider the difference ‘ 


(5.5) wi) = (t) =el), 

where p(t) is a function of class C? for ¢ on gx which. satisfies (5.4) and 
where p(t*) = p(t*) = 0. Then wt(t) is a solution of (5.2) which vanishes 
with its derivative at #—#*. Hence wt(t)==0. It follows that y*(t) is 
identically equal to a tangential solution of the Jacobi equations for ¢ on gr. 


6. The secondary incidence ‘relations. The secondary problem is non- 
parametric in the space of the a (7,5: + +, 9%, t). The n-plane 


N of mm za (0) 0%, t=? (i=1,: n m; h=: ‘s n= m— 1) 


is the analogue of the deflecting manifold M di will be Aa to as the 
deflecting, plane N. The deflecting plane N is regular by virtue of the fact 
that the rank of the matrix | za*(0) |] is m. The only tangential solutions of 
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the Jacobi equations or of the auxiliary differential equations for which : 
t = c gives a point on N, are those tangential solutions which vanish at t == c. 

In the space (7, ¢), a solution of the Jacobi equations which is of class C° 
for ¢ on g, and ge respectively, and which for t==c has a corner on N is the 
analogue of a broken extremal in the space (x) with fixed end points and a 
single corner on the deflecting manifold M. ` 

We shall next define the secondary incidence relations. To that end let 
(u) be an arbitrary set of n == m — 1 constants and e a parameter neighbor- 
ing e==0. Consider the 1-parameter family of extremaloids 


(6.1) gt == vi (t, e) (ii, :,m) 
determined by setting p* == eu* in the family of extremaloids (3.8). The 
family (6.1) satisfies the following identities in e: 

(6.2) z'(c,e) —=32t{a(eu)]. 


The variations 2,'(t,0) of the family (6.1) will be denoted by y(t). 
Differentiating the identities (6.2) with respect to e and setting e == 0 yields 


A 
(6.3). atm (0) D (h, k=l, -,n) 
where 
dar at 
de Tap 


For the family (6.1) the primary incidence relations (2. 3)’ reduce to 
a set of n identities in e. Upon differentiating these identities with respect 
_to e and setting e = 0, we obtain 


: 
(6. 4) bm + 216 (0) 64]; = 0, Gi, -,mihk=1,..,n) 


where bx is given by (4.3) and where 

(6. 5) Er = Dpt (7; T) br = 07 (7°, 7) : 

Setting da*/de = w*, (6.3) becomes 

(6. 6) nf = art (0), 

and (6.4) becomes 

(6.7) baxo + za (0) i]; == 0. 

We term (6.7) subject to (6.6) and (6.5), the secondary incidence relations. 
It is understood that the independent variables in (6.7) are (w), (7) and (7*). 


If „f(t) is a solution of the Jacobi equations with a corner on N for 
t= c, and if its slopes 4#* and #** with the set (w) determined by (6.6) 
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satisfy the secondary incidence relations (6.7), then 7 (2) will be said to 
satisfy the secondary incidence relations. 

In order to solve the secondary incidence relations for the variables (9°) 
in terms of the remaining variables, we adjoin the condition 


vn, —0 | Gi: -p m) 

to the relations (6.7%). The m conditions 
baot + at(O)GT = 0, (jm. +n) 

yeh, —0, . Ss 
are called the restricted secondary incidence relations. Since the problem is 
positive regular along gs, and since the regular manifold M is not tangent to 
ga at t= c, the restricted secondary incidence relations (6.8), considered as, 
equations in the variables (%'), have a unique solution (7) de aa in 


terms of the remaining variables. 
We have the following lemma. 


(6.8). 


Lemata 6.1. Corresponding to any solution of the auctliary differential’ 
equations for t on gı which for t == intersects the deflecting plane N at the 
point (w), there exists a unique solution of the auriliary differential equations 
for t on gz which for t = c gives the point (w) onthe deflecting plane N and 
at (w) has a slope uniquely determined by the restricted nou incidence 
relations. 


Such a solution is called the continuation for t on ga of the gren solution 
for ¢ on 91. 

Returning to the problem of expressing the saabe (a ) of the sec- ` 
ondary incidence relations (6.7) in terms of the remaining variables, we state 
the following lemma. 


Lemara 6.2. Corresponding to sets (4-) and (o), there ts a set (7) 
determined except for the possible addition of a set of the form (ky*) where 
k is a constant, by the secondary incidence relations (6.7). 


The proof of the lemma is based on the following statements. 

(a). If the variables (7) with sets (7) and (w) satisfy the aay 
incidence relations (6.7), then the set (ÿ* + kyt), where k is a constant, 
with the same sets (7°) and (w) satisfies the secondary tneidenge relations 
(6.7). 

(8). Any two sets (7*) and (4°) which with sets (y) and (w) satisfy 
the secondary incidence relations (6.7) differ by a set of the form (ky*), 
where k is a constant. 
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Statements («) and (£) may be readily verified by direct substitution. 

The following theorem is a consequence of Lemma 6.2 and the fact that 
thé only tangential solutions of the Jacobi equations which for tc define 
a point on the deflecting plane N are those which vanish at t = c. 


Turorem 6.1. Corresponding to any solution (7) of the Jacobi equa- 
` tions for t on g, which for t = c defines a point (w) on the deflecting plane N, 
there exists a solution of the Jacobi equations for t on ĝa which for t =c 
gives the point (w) on the deflecting plane N and which at t == c has a slope 
determined except for a constant multiple of ¥* by the secondary incidence 
relations (6.7). This solution is unique modulo a tangential solution of the 
Jacobi equations for t on g} which vanishes at t == c. 


‘Such a solution of the Jacobi equations is termed a continuation for t on 
gz of the given solution for ¢ on g:. i l 


Corozrany 6.1. -A tangential solution of the Jacobi equations for t on 
gı has continuations for t on ga, if and only if it vanishes at t =c. Moreover 
its continuations for t on gs are tint tangential solutions which vanish at 
eTA k 

- We state the following theorem. 


THEOREM 6.2. -The continuations for t on gz, if they exist, of any two- 
mutually conjugate solutions of the Jacobi equations for t on g, are mutually 
conjugate in the sense of von Escherich.: ” 


Let nt and 7* be any two solutions of the Jacobi equations for ¢ on ga 
which are mutually conjugate in the sense of von Escherich; that i is, suppose 
the identity 


(6.9) po — Fit = 0 aes 


holds for ¢ on gi. (See Bolza 1, p. 626). We assume that the continuations 
of (7) and (7) do exist. Making use of the fact that the given solutions (7) 
and (5) and their respective continuations satisfy the secondary incidence 
relations, it is easy to prove that the continuations of (7) and (7) are mutually 
conjugate in the sense of von Escherich. (Cf. Morse 1, p. 52.) 


7. The determinant A(t, t°). In this section we shall assume, unless 
otherwise specified, that a point 4° on g, means a point t such that 
St <c. Recall the conjugate point determinant D,(t,i°) defined in 
(3.9). Let the (p + 1)-st column of D,(t, t°) be represented in the form 
C à w(t), O (Eele mipæt,t:.,1) 
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_ and let the first column be multiplied by (es £) (t— c) so that it will be 
a tangential solution of the Jacobi equations which vanishes at t == {° and is 
continuable at t == c. The determinant A(t, t°) is defined as follows: 


A(t, t) —|(t—#)(t—c) H(t) m)l. 


For tt and tsc, the determinant A(t, 4°) vanishes if and only if the 
conjugate point determinant D,(t, 1°) vanishes, and to the same order. Hence 
the zeros, #34 ¢° and tc, of A(t, t°) define the conjugate points on g of - 
the point t° of g,, and the order of vanishing of A(t, £) at a conjugate point 
defines the order of that conjugate point. For t= c, A(t, t°) always vanishes. 
The point c of g will be conjugate to the point t° of g, if and only if the 
order of vanishing of A(c, £) is greater than 1, and the order of c as a con- 
jugate point of ¢° will be 1 less than the order of vanishing of A(c, t°). 

The variation mt(t) representing the (p + 1)-st column of A(t, t°) is a 
solution of the Jacobi equations determined by g. The variation yp‘(t) is, 
moreover, precisely the variation z.*(t,0) of the family (6.1) when u’ == 1 
and the other n — 1 ws are null. Since the secondary incidence relations are 
linear in all the variables, we have the following theorem. 


THEOREM 7.1. The combination w,t(t) of the last n columns of 
A(t, t°) ts a solution of the Jacobi equations which satisfies the secondary 
incidence relations (6.7), provided (w) therein is taken as 


ot = ap (k, p=1, n). 


We shall prove the following theorem. 


THEOREM 7.2. The m columns of the determinant A(t, t°) represent m 
linearly independent solutions of the Jacobi equations for t on g, and the last 
n columns represent solutions which are linearly independent of tangential 
solutions. 


That the columns of A(t, ¢°) are linearly independent for ¢ on g, follows 
from the fact that A(t, t°) does not vanish identically for t sufficiently near t°. 
Suppose first then that the last n columns are linearly dependent upon a tan- 
gential solution for ¢ on gı; that is, suppose m constants (c), not all (0), 
exist so that 


(7. 2) | Comp (t) = p(t) y (t) (=1,:::,n) 


where p is a function of t of class C? for t on gi. The function p cannot be 
identically zero for ¢ on g:, for that would imply the linear dependence of 
_ the last n columns of A(t, t°) for ton gı. But since (c) s£ (0) and ps0, 
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the identity (7.2) implies that A(t, t°) vanishes identically for t on gı. From 
this contradiction we infer that the last n columns of A(t, t°) are linearly 
independent of tangential solutions for ¢ on g1. 

It remains to prove that the theorem is true for ¢ on i ga Next suppose - 
that the m solutions of the Jacobi equations represented by the columns of 
. A(t, t°) are linearly dependent for ¢ on gz; that is, suppose that constants 

(— d, Gi," “`, Ca), not all zero, exist so that the identity ` : 

cmp (t) =d (i — t) (t— eyt) (dam + ;mpæl oan) 
holds for ¢ on g2- The constants (c) cannot all be null, for c, == 0 for each 
p would imply that d—0. By Theorem 7.1 the solution cymt(#) for t on gs 
is a continuation of Cp #(t) for ¢ on gı. On the other hand since cympt(t) 
for t on. gz is identically equal to a tangential solution which vanishes at t= c, 
it must be a continuation of a tangential solution which vanishes at t= c. 
Hence for ¢ on g, we have cmp (t) + p(t)yt (t) =0, where p(t) is a function 
of class C? for t on g, and where p(c) —0. Now since (c) = (0), this 
implies the linear dependence of the last n columns of A(t, ¢°) on a tangential 
solution for ¢on gi. From this contradiction we infer the linear independence 
of the m columns of A(t, t°) for ¢ on go. . 

The proof that the last n columns of A(t, t°) are linearly’ independent 
of tangential solutions is similar and will be omitted. 
| That the following theorem is true for t on g, may be verified by substi- 

tution in (6.9). That-it is true for ¢ on gs follows from Theorem 6. 2. 


THEOREM 7.3. The columns of A(t, 1°) represent mutually conjugate 
solutions of the Jacobi equations. 


We shall prove the following theorem. - 


THEOREM 7.4. -A necessary and sufficient condition that a point t =t" 
on c < t*& t be conjugate to a point t= t° on YS t < c is that there 
exist a solution of the Jacobi equations which vanishes at t == {° and į = t*, 
which satisfies the secondary incidence relations at t — c, and which is not 
gwen by a tangential solution of the Jacobi equations. | 


If the point == {* is conjugate to the point ¢ == °, then A(t, t°) P 
for. ¿= t*. There exists then a proper linear combinations (w) of the 
columns of A(t*, t°) which vanishes. Let (d,c,:-:,c,) denote the con- 
stants in this linear combination. First, I say that cp is not zero for each p, 

‘for (c) — (0) would imply d == 0 which is impossible. The solution (w) 
defines a solution of the Jacobi equations which vanishes at t == ¢ and t= t*, 
which satisfies the secondary incidence relations at t= c, and which is not 


k given by a tangential VUE 
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Conversely let 7*(#) be a solution of the Jacobi equations which vanishes ` 
at f= {° and ¢<=7%*, which satisfies the secondary incidence relations at 
t==c, and which is not given by a tangential solution. Consider the difference 


(7.3) TO — AE) EN — com), 


where mt(#) is given by (7.1) and (d, ©,“ °,c,) are constants. The dif- 
ference (7.3) is a solution of the Jacobi equations which vanishes at t= ?°. 
Moreover, by virtue of the fact that the determinant 


FC) itte) 


is not zero, the constants in (7.3) may be chosen. s0 that the derivative of ` 
(7.3) also vanishes at ¢== ¢°, Consequently for ¢ on gı, we have 


(7. 4) MCE) — cmt (t) = 0, (c) (0), 


modulo a tangential solution which vanishes at t = £ and t =c. The solu- 
tion represented by the left and right members of (7.4) must have the same 
continuations for é on gz. A continuation of the left member of (7.4) is 
T (t) — cympt(t), and all the continuations of the right member are tangen- 
tial. For ¢ on gs, then, we have 4*(t) me =0, modulo a tangential 
solution which vanishes at t =. 

Since 7#(t*) 0, it follows that for some constant k, cynpt(t*) 
— ky (t*) — 0. But (c) ~ (0), so that Ati) must vanish at ¢==¢*, 
and the point {== ¢® is then conjugate to the point t == 4°. The proof of 
Theorem 7. 4 is complete. | | 

À necessary and, sufficient condition that a point £—1* on << c 
be conjugate to a point t== t° on t£ t° < c is that there exist a solution of 
the Jacobi equations which vanishes at ¿= {° and t=: t*, which satisfies the 
secondary incidence relations at t= c, and which is not given by a tangential 
solution of the Jacobi equations. 

Consider the m-square determinant 


a(t) =|) EA ml 
(§$—=1,-+-,m;p=1,---,m) 


in which the last n columns are solutions of the Jacobi equations which vanish 
at the point ¢ == 2° on gi, which satisfy the secondary incidence relations at 
t= c, and which are linearly independent of tangential solutions. 


LEMAA 7.1. The order of vanishing of Hey at any point = bong 
te equal to the nullity v of 8(b). 


The proof of this lemma follows the method of Morse in (2), but slight . 
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modifications are necessary since we are using solutions of the Jacobi duos | 
in place of solutions of his “ restricted Jacobi equations. ‘a 
Let b be any value of ¢ on g except t and c. Let r be the rank of oy. 
Then r > 0, and y =m — r. We suppose that the last n columns of 0(b) 
` have been reordered so that the rank of the first r columns is r. We also 
suppose that the rank of the last v columns is zero; for if it were not zero, 
it could be made zero by adding suitably chosen linear combinations of the 


first r columns to the remaining columns. Understanding that this has been 
done, let | 


uat(t), vat (t) (=1,:::,m; h =1,: - -,r; k=; + +,v) 
represent the first r and last y columns respectively of @(t). 

Applying the integral form of the law of the mean to the elements in | 
- the last v columns of 0(t) yields 
(7. 5) 6(t) = (t—bYB(t) 
where the function B(t) is continuous in ¢ for t on g, and for t on gs 
respectively, and where . : 
| B(b) — | wt (b). int(b)|. 

The lemma will follow from (7.5) if B(b) 40. 

Suppose that B(b) —0. There will exist then a proper linear combina- 
tion (w) of the columns of B(b) with coefficients (¢,,--+,¢r, — d;,°::,— dy) 
such that (w) = (0). Moreover the constants de cannot all be zero. For 
dx = 0 for each & would imply that cauat(b) — 0 for each t, and the rank of 
8(b) would be less than r. We set ; ; 

| ` u(t) — cuit, vi(t) — devit (t) 
(i=1,: m; h =1,: --;r;k=1,:::,y). 
Hence | D 
(7.6) ut(b) = vt (b), vt(b) = 0. 
We note that ut(b) cannot be zero for each i. For that would imply 
vt(b) = vt (b) = 0 for each i, and hence that (v) is a tangential solution 
which vanishes with its derivative at tb. That this is impossible follows 
from the hypothesis that the last n columns of 8(t) are linearly independent 
of tangential solutions. 
Making use of (7.6) and of the fact that u‘(t) and v*(¢) are mutually 
conjugate for ¢ on g, we obtain 
Fre [y (b), 7(b)]0* (E) (b) — 0, 

- where x is 1 or 2 according as t = b lies on gı or gz. It follows from (1. 4) that 


(7.7) t(D) = kyt (b) | (k 7 0). 
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But (7.6) and (7.7) imply that (v) is identically equal to a tangential 
solution which vanishes att}. This is impossible since the last n columns 
of O(t) are linearly independent of tangential solutions. We conclude that 
B(b) is not zero, and the lemma follows from (7.5) when 6 =4?° and bse. 

The lemma is true when b= {° and b =c, but the proof is slightly 
different. 

Since the determinant A(t,{°) satisfies the conditions imposed on 6(¢) 
in Lemma 7.1, we have the following theorem. 


THEOREM 7.5. The conjugate points on g of a point t° of g, are isolated 
and possess orders equal to the nullity v of A(t,t°) at the respective zeros 
Et? and t>4c of A(t, t°). The order of t =c as a conjugate point of 
t= t ig y— 1. : 


Since any solution of the Jacobi equations which vanishes at t ==? and 
satisfies the secondary incidence relations at t — c can be written as a linear 
combination of the last n columns of A(t, t°) and a tangential solution which 
vanishes at t — £ and ¢= c, we have the following theorem. 


THEOREM 7.6. If a point t=t* on c < t* SU” ts conjugate to a point 
t= ont St <c, the maximum number of solutions of the Jacobi equa- 
tions which vanish at t = t° and t = t*, which satisfy the secondary incidence 
relations at t = c, and which are linearly independent of tangential solutions, 
equals the order of the conjugate point. | 


If a point {—4{* on St Sc is conjugate to a point t—2° on 
YSE <c, the maximum number of solutions of the Jacobi equations which 
vanish at t= ¿° and ¿= t*, which satisfy the secondary incidence relations 
at t= c, and which are linearly independent of tangential solutions, equals 
the order of the conjugate point. 

Throughout this section we have assumed that t° £c is a point of gı. 
Corresponding theorems hold if ¢° is a point on gz for which e < P S t”. 

If t =c, we define A(t, c) as the determinant obtained by multiplying 
the first column of the conjugate determinant Dg(t,c) by (t—c). For ic, 
A(t,c) vanishes if and only if D,(t,c) vanishes and to the same order. The 
m columns of A({,c) represent m linearly independent, mutually conjugate 
solutions of the Jacobi equations which vanish at t= c and satisfy the sec- 

-ondary incidence relations with (w) = (0) therein. Moreover the last n 
columns represent solutions which are linearly independent of tangential 
solutions of the Jacobi equations. A necessary and sufficient condition that 
a point t = ¢* on g be conjugate to the point t = c is that there exist a solu- 
tion of the Jacobi equations which vanishes at t = c and t= ¢*, which satisfies 


3 
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the secondary incidence relations at £ == c, and which is not given by a tan- 
gential solution of the Jacobi equations. Furthermore the conjugate points 
on g of the point t == are isolated and possess orders equal to the nullity of 
A(é,c) at the respective zeros {4c of A(t,c). The order of vanishing of 
A(c,c) is equal to the nullity m of A(c,c). Finally, if a point é === {* of g 
is conjugate to the point ¢==c, the maximum number of solutions of the 
Jacobi equations which vanish at t==c and t= 1¢*, which satisfy the sec- 
ondary incidence relations at t= c, and which are linearly independent of 
tangential solutions, equals the order of the conjugate point. 


8. The index theorem. Let tae, ome 1," > ‘,g—1l,p<+1,:::,X 
be a set of values of ¢ such that 


(8. 1) Wohi La LE San LT LEA On, 

(do = 0’, Orr = t”) 
and such that no one of the À + 1 segments into which g is divided by the 
points of (8.1) contains a conjugate point of its initial end point. 

_ Let Mo be a regular (m — 1)-manifold of class C? which intersects g at 
the point t == ae, but which is not tangent to g at that point. We suppose 
that Mo is regularly represented neighboring the point t de in the form 


gt = ZW (Bat, , Bot) (11, ,min—m—1; o not summed) 


and that (Bo) = (0) determines the point t == ao on g. Set ap == c, and let 
M, be an alternative notation for the deflecting manifold M. The manifolds 
My, qg=—1,:::,X, are termed a set of intermediate manifolds. 

Let the points t == a, and t == a4), on g be denoted by A and B respec- 
tively. Let points P, on the respective intermediate manifolds A, be chosen 
so near to g that the successive points 


(8. 2) A, P,, +, PaB 


can be joined by extremal segments. 
Let (v) be a set of An variables, the g-th n of which are the parameters 
of the point Pg on Ma; that is, 


Co j +0) un (Br B Bar: "> Ba" a, ::,a, Ban; | eure 


For (v) sufficiently near (0), the points (8.2) are completely determined by 
the set (v) and the points A and B. The broken extremal Æ (v) joining the 
points (8.2) will be expressed in the form 


(8. 3) at X4(t, v); (t= 1,: i s m) 


where X* and X;* are functions of class C? in their arguments for (v) near 
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(0) and # on each of the À + 1 components of H(v), and where (v) = (0) 
gives g. We assume that the. parameter ¢ has been chosen so that t=? 
and t= t” give the points A and B respectively, so that the values t — ay 
(g=1,---,A) give the respective points Py on Mg, and so. that between 
each two successive vertices (8.2) the rate of change of ¢ with respect to the 
' arc length is constant. ae | 

The integral J considered along E(v) is a function of class C? for (v) 
‘sufficiently near (0), and will be denoted by J(v). 


Tumors 8.1. The function J(v) has a critical point for (v) = (0). 


To prove this theorem we consider the first partial derivatives of J(v) 
© with respect to the variables v”, r == 1,- - -,An. Integrating by parts, setting 
_(v) = (0), and making use of the fact that g satisfies the primary incidence 
relations at ¢==c, we find that the first partial derivatives of J(v) with 
respect to the variables v” all vanish for (v) = (0). | 
We set B en À 
Q(v) = Jour 0, > (r,8—1,:::,an) 


where the superscript 0 indicates evaluation for (v) — (0), and term Q(v) 
the index form associated with the extremaloid g. -If F is the rank of Q(v), 
the number An—# is termed the nullity of Q(v), and will be denoted by 
N(Q). The tndex of Q(v) is the number of negative characteristic roots 
belonging to | J°s"s* |, and will be denoted by I(Q). 

In order to obtain a representation of the index form Q(v) in terms of 
the second variation we set up the family of broken extremals Æ determined 
by the points A and B and the set (ev',- - -, ev"), where (v) is held fast 
and e is a variable near 0. The family of broken extremals Æ will be 
represented in the form : 

(8. 4) gt = ri (t, e) {i=1,"::,m) 


where the functions z*(t,¢) are defined by referring to (8.8) and setting 
X*(t, ev) == at(t,e). The family (8.4) has the property that for e near 0, 


gt (c, 6) == zt (eat, - -, ea”), (i= 1,: : -m;n =m — 1) 
Tt (ao, 6) = Z" (Bo, Š aBa”) . , 
(e = 1,: -,u—1,u+1,::-,A; onot summed). 


Before proceeding with the problem of the second variation for the family 
(8. 4) it will be convenient to present several definitions and prove a lemma. 
Let nt(t) be a broken solution of the Jacobi equations which is defined 
and continuous for ¢ on g, and which has corners only at the points t == c 
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and t= as (o—1,---,n—1,n+1,---,A). The broken solution (t) 
will be. termed admissible if it ‘satisfies the end conditions | 


(E) =0, f(t”) —0, (1—1, cm) 
and if with a set (v) it satisfies the conditions | 
(8. 5)’ nt (c) = znt(0) a4, (h—1,---,n—=m—1) 
(8.5)” (do) = Zi (0) Bo, ' (ø not summed) 


at the corners t = c and t == ao respectively. If the broken solution y‘(t) is 
tangential, a necessary and sufficient condition that it be admissible is that it 
vanish at the points at which ¢ takes on the values 


(8.6) Qo, Ary* ` ', Apr, Gps Aer © > Orr Peu A (ay = 0). 


Any two admissible broken solutions of the Jacobi equations will be said 
to be equal mod T°, if their difference is an admissible broken tangential ` 
solution. Understanding that a solution determined mod T° means a solution 
which is unique except for the possible addition of an admissible broken 
tangential solution, Lemma 8. 1 is as follows. 


Lemma 8.1. An admissible broken solution of the Jacobi equations | 
determines a unique set (v), and is determined mod T° by a set (v). | 


The first part of the lemma follows from the fact that each of the inter- 
mediate manifolds M, is regular. To prove the second part we note that the 
variation f(t) = 2.'(t,0) of the family (8.4) is an admissible broken 
solution corresponding to a given set (v). Let y(t) be any other admissible 
broken solution corresponding to the same set (v). Then the difference 
w(t) = mt(t) —7*(t) is a broken solution of the Jacobi equations which 
vanishes at the points at which ¢ takes on the values (8.6). Since no one of 
the A + 1 segments 
(8.7) ` a StS an (j—=0,1,: ::,à) 


of g contains a conjugate point of its initial end point, wt({) is an admissible 
broken tangential solution. Hence y*(t) and y(t) are equal mod T°, and 
the proof of the lemma is complete. 

Returning to the function J (ev), differentiating twice with respect to e, 
integrating by parts, and setting 6 = 0, we find that 


(8. 8) J'y TU == Deel, *| Fe + > Ca A DA + TtoeF?;t] 
; e 


a +4” 
+ fi amat S 20%(n 9) at 
v ° 
(i=1,-",m; gee dos pT p +1, A) 
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where 7*(t) is the variation z,*(t,0) of the family (8.4) and where « is I 
for ao on g, and 2 for a on gs. The terms corresponding to the limits # 
and ¢” vanish in (8.8) as do the terms corresponding to the points as. Hence 
we have the following theorem, readily proved with the aid of Lemma 8.1. 


Tuéonex 8.2. The index form Q(v) admits the representation 


Q(v) = mada + f 20"(n,4)dt+ f 20% (9,9) at $ 
(h, k=], -< n= m— 1) 


where n'(t) is any admissible broken solution of. the Jacobi equations deter- 
mined mod T° by (v), and where 


bi = [P(n 5) — FY (7%, ÿ*) ]24m(0) (i=1,:::,m). 


An admissible broken solution of the Jacobi, equations 7*(¢) will be 
termed a apd solution if’at t = c it satisfies the secondary incidence rela- 
tions with w* == a therein and if corresponding to each corner = as there 
is a constant M such that 


Ait = Jaa = byt (ae) (1, ":,m). 
We shall prove the following lemma. 


Lemma 8.2. . À necessary and sufficient condition that a set (v) £ (0) 
be a critical point of Q(v) is that (v) determine mod T° a special solution of 
the Jacobt equations. 

A necessary and sufficient condition that (v) = (0) be a critical point 


of Q(v) is that 
(8.9) Qr = 0 © (r=, sy An). 


We shall prove first that the conditions (8.9) imply that any admissible 
broken solution (7) determined mod 7° by (v) is a special solution. For o 
fixed, the n members of (8.9) representing the partial derivatives of Q(v) 
with respect to (Bot, *' > , Bo"). may be written in the form . 


. (8.10) AZZ, = 0, (i= 1, + -,m;k—1,: + +,n) 
where 
Abi = OF), 
with «= 1 or 2, according as ae is a point on g, or gz respectively. Moreover 


(8.10)” Ati (ae) = 0. 
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The m equations (8.10) have a unique solution .Af; —0. But from the ` 
definition of Af;, we see that Ag; == 0 implies that 


(8. 11) Ag! = by (ae) ` 
` for o fixed. 
The n members of (8.9) representing the partial derivatives of Q(v) 
with respect to (at,- --,@") may be written in the form ` 


(8. 12) baxa + zet (0)é]; == 0 (==, -,m;h, k =1, cn). 


Thus (7) satisfies the secondary incidence relations at t == c with w* = a* 
therein, and the condition of the lemma is necessary for a critical point. 
Conversely if (y) is a special solution determined mod T° by (v), (8.12) 

holds for = c, and (8.11) holds for each 6. It follows that (8.10) holds 
for each oc. Hence (8.9) holds, and the condition of the lemma is sufficient 
for a critical point. 

` Understanding that solutions which are linearly independent mod T° are 
solutions which are linearly independent of admissible broken tangential 
solutions, the following lemma is readily proved. 


Lemma 8.3. A set of admissible broken solutions of the Jacobi equa- 
tions are linearly independent mod T° or not, according as the sets (v) are 
linearly independent or not, and conversely. 


From Lemmas 8.2 and 8.3 we infer that the nullity of the index form 
is equal to the number of, special solutions which are linearly independent 
mod T”. 

A solution of the Jacobi equations for ¢ on g which is of class C* for ¢ on 
gı and gz respectively and of class ©? on each interval (8.7), which vanishes 
at t= Ë and t= ?’, and which satisfies the secondary incidence relations at 
i = c will be termed a reflected solution. A reflected solution is admissible 
if it satisfies conditions of the form (8.5)” at each point t == ag. An admissi- 
ble reflected solution is a special solution which has no corners at the points 
t = Go. We shall prove the following lemma. 


Lemma 8.4. Any special solution of the Jacobi équations is, identically 
equal mod T° to an admissible reflected solution. 


Let 4*(t) be any special solution. Let p(t) be a continuous function for 
l on g with the following properties. On each interval (8.7), p(t) is of class 
C? with p(a;) = pau) = 0, and with p(a;) and p(aj4:) so chosen that for 
t on g, the solution 


a(t) =n (t) — p(t)7 (t) G=1,:::,m) 
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has no corners at the points t==do (o==1,---;p—i,p~+l,---,A). 
Then 7t({) is an admissible reflected solution, and the proof of the lemma is 
complete. 
With (7) and (%) defined as in the proof of Lemma 8.4, we see that 
(3) is identically equal to an admissible tangential reflected solution if and 
- only if (7) ==(0),mod T°. Moreover, if a finite set 9 of special solutions 
be replaced by a set A of admissible reflected solutions, equal mod T° respec- 
tively to the special solutions of 9, then the members of A are linearly 
independent of admissible tangential reflected solutions if and only if the 
members of S are linearly independent mod T°. 
We shall prove the following theorem. 


THEOREM 8.3. The nullity of the index form Q(v) equals the order of 
t as a conjugate point of v. 


If the nullity of Q(v) is v, there are v special solutions which are linearly 
independent mod T°, and therefore y admissible reflected solutions which are 
linearly independent of admissible tangential reflected solutions. It follows 
from Theorem 7. 6 that ¢” is a conjugate point of # of order v. | 

On the other hand, if #” is a conjugate point of # of order v, there are v 
reflected solutions 

me (t) (t=1,-",;m;kml,"-:,y) 


which are linearly independent of tangential reflected solutions. The solutions 
net(t) are not necessarily admissible in that they may not satisfy conditions of 
the form (8.5)” at the points t == 40 (e = 1,: --,p—l p41, 7A). 

But any reflected solution can be made admissible by adding a suitably 
chosen tangential reflected solution. Moreover, if a finite set R of reflected 
solutions be replaced by a set A of admissible reflected solutions which are 
equal, modulo a tangential reflected solution, respectively to the members of 
R, then the members of A are linearly independent of admissible tangential 
reflected solutions if and only if the members of R are linearly independent 
` of tangential reflected solutions. 


We assume then, that the v reflected solutions y,'(¢) which are linearly 
independent of tangential reflected solutions have been replaced by v admissi- 
ble reflected solutions t({) which are equal, modulo a tangential reflected. 
solution, respectively to the solutions m*(t). It follows that the solutions. 
q'(t) are linearly independent of admissible tangential reflected solutions. 
But the y admissible reflected solutions 7,‘(¢) are special solutions which are 
of class C* for £ on gı and gz respectively. Hence the nullity of Q(v) is v. 

We continue with the following theorem. 
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`. THEOREM 8.4. The index of Q(v) ‘equals the sum of the orders of the 
conjugate points of Y on Y <t < t. 


To prove Theorem 8. 4 we use the method of Morse (2, Th. 4.2). That 
is; we replace the extremaloid g by the subarc gs on which ¢’ Sib where 
U<bst. If F<b<c, then ge is an extremal whereas if c <b St, 
ge is an extremaloid. On ge we introduce A intermediate manifolds as pre- 
viously with 
(8.13) lo L n LL ™ b, 


and with the points t = ag (q = 1,: > :,À) so distributed on gə» that no one 
of the À + 1 segments into which g» is thereby divided contains a conjugate 
point of its initial end point. For b > c, the point t = c must be taken as 
one of the admissible points t — ag, and the deflecting manifold used as the’ 
corresponding intermediate manifold. 

The family of broken extremals which hereby replaces the family X*+ (t, v) 
is denoted by X*(t,v,6). The functions J’ (v) and Q?(v) are defined for go 
` as were J (v) and Q(v) for g. ` | 
The proof of Theorem 8. 4 will be based on the following statement: 


(a). For any point b the index of Q'(v) is equal to the sum of the 
orders of the conjugate points of Y on  <t<b.. 


Morse proved that statement (a) is true for b on gı; that is, for 
YU <bSc. It remains to prove that (a) is true for c < b S +”. 

As b increases, the index of Q°(v) [written Z(Q?)] will change at most 
when b passes through a conjugate point a of {’ and will then increase by at 
most the order y of a as a conjugate point of #. Hence for each value of 
b >c, we have 
(8. 14) I(Q?) S Sv (U<a<b). 


We shall prove the following lemma. 


Lemma 8.5. For b > c and nearer to c than any conjugate point of Ÿ, 
“excepting possibly c itself, | 
(8. 15) (Q) 2 Zv (Y <a<b). 


With any admissible set (8.13) for which b = a),, satisfies the hypothesis 
of the lemma and a, = c, we proceed as follows: We denote a1 by Am2 or b, 
and a by ax or c, and insert a new-point ax between ar- and c. We intro- 
duce a new intermediate manifold M, cutting ge at ax but not tangent to gb 
at ax. For this construction we replace the set of parameters (v), by a set of 
(A + 1)n parameters (é), the first (A—1)n and the last n of which form 
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the set (v). The broken extremal €(£) determined by. (é) and the end points 
of gy coincides with the broken extremal ANG v,b) for StS ai = 
eStss. 

Understanding ‘that J.t(£) and Qe) denote the Tadon replacing 
J*(v) and Q?(v), we shall first prove that 


(8.16) > 1(Q*) = 1(Q+). 
| To that end let e be a parameter near 0 and set 
ple) = Jd (e£) — J? (ev). 


When the first (à — 1)n and the last n components of (€) are given by (v), 
the inequality (e) Z = Q holds for e sufficiently near 0, and ¢(6) has å mini- 
mum for e= 0. Hence ¢”(0) 20, and (8.16) follows with the aid of 
Lemma 3. la of Morse 2. = 

Next we set the last n variables of (£) equal to zero in Q»? (£), and obtain 
thereby a quadratic form Qo°. Applying Lemma 3. 2b of Morse 2 we see that 


(8.17) 1(Qe) = 1(Qe*) +N (Q), 


where 7(Q.°) and N(Q,°) denote the index and nullity respectively of Qo”. 
But since the nullity of Q.” is equal to the order of c as a conjugate point of 
. © and since the index of Qo° is equal to the sum. of the orders of the conjugate 
points of t on t’ < t< c, the inequality (8.15) follows from (8.16) and 
(8.17), and the proof of the lemma is complete. f 

Upon comparing the inequalities (8.14) and (8.15), we see that for 
b >c, and sufficiently near c, statement (a) is true. 

That statement (a) holds for any point b >c on gz can be proved by 
taking an arbitrary point t°, where c < {© t”, and showing (1) that if (a) 
is true for b < t°, then it is true for b == t and (2) that if (a) is true for 
62°, then it is true for b > #. The method of proof is similar, and the 

details will be omitted. See Morse 2, Lémmas C and D. 

The index and nullity of Q (v) are independent of the number, position 

and representation of the intermediate manifolds, provided they are admissi- ~ 
” bly distributed and represented, and one intermediate manifold coincides with 
the deflecting manifold. The index and nullity of Q (4#) depend only on ane 
conjugate points of t on g, and their orders. ` 

Recall that the nullity of a critical point of a function of a finite num- 

‘ber of variables is defined as the nullity of the Hessian of the function 
at the critical point, and that the index of a critical point is defined as the 
number of negative characteristic roots of the Hessian of the function at the 
critical point. Understanding that each conjugate point is to be counted a 
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number of times equal to its order, we summarize our results in the Index 
Theorem. 


Inpex Tuxorem. The point (v) — (0) is a critical point of J (v) with 
an index equal to the number of conjugate points of t=¥ on g preceding 
t = t”, and a nullity equal to the order of t =t” as a conjugate point of t == t.. 


Let t == a and t=} be any two points on g. We shall prove the following. 
- theorem. 


THEOREM 8.5. The numbers of zeros of the two conjugate point deter- 
minants D,(t,a) and D,(t,b) on any finite open interval of g difer by at > 
most n, where n= m — 1. i 

Let p < t < q be any finite open interval of g. Let r be a point follow- 
ing ponp<t<q. Suppose r is not a or b orc and that there is no con- 
jugate point of a or b on p<tÆr. Similarly let s be a point preceding 
qonr<t<q. Suppose s is not a or b or c and that there is no conjugate 
point of a or b on s&t <q. Understanding that Q (rs) denotes the index — 
form corresponding to the finite open interval r < t< s of g, and that I (rs) 
denotes the index of Q(rs), we shall first establish the following statement. 


(a). The number of zeros of Dit, a) onr <t < 8 is equal to 
(8.18) ; | I(rs) +k (OSkSn) 
where k is an integer or zero. 
There are three cases to be considered: 
Case i. a<r<s, 
Case 2. r<s<a, 
Case 8 r<a<s. 
To prove Case 1 we first set up index forms Q(ar) and Q(rs). We then 
set up Q(as) taking one intermediate manifold, M,, at the point r and the 
‘same intermediate manifolds preceding and following M, as are used to define 
Q(ar) and Q(rs) respectively. When the variables of Q (as) belonging to the: 
intermediate manifolds preceding and following M, are the same as the 
` variables of Q(ar) and Q(rs) respectively, the quadratic form obtained by 


setting the n variables belonging to M, in Q(as) equal to zero is equal to 
Q(ar) + Q(rs). Hence 


Ilas) —nS I (ar) + I (rs) S I (as). 
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See Morse 1, p. 62, Lemma 7.2. It follows that | 
(8.19) (as) —T(ar) = I(rs) +k, = (0SkSn)' 


where & is an integer or zero. But, since ose points are counted accord- 

ing to their orders, the left member of (8.19) represents the number of con- 

jugate points of a on r<t<s. Moreover, since the order of a conjugate 

point of a is equal to the order of vanishing of D,(t, a) at that point, the 

number of zeros of D,{t,a) on r<t<s is given by (8.18) whena<r<s. 
For Case 2 we interchange the réles of r and s in Case 1 and obtain 


I(ar) —I(as) = I (sr) + k, (0SkSn) 


where k is an integer or zero. But J(sr) == I (rs), and statement (a) follows 
as in Case 1. 

For Case 3 we first set up-index forms Q(ra) and Q(as), and then Q(rs), 
taking one intermediate manifold, Ma, at a to define Q(rs), and taking the 
same intermediate manifolds preceding and following M, as are used for Q(ra) 
and Q(as) respectively. If in particular a == c, then Ma must be taken as 
the deflecting manifold M.. As in Case 1 we have 


I(rs) —n SI (ra) + I(as) SI (rs). 
Since I(ar) == I (ra), it follows that 
(8. 20) I(ar) + I(as) = I (rs) —n + k, (0SkSn) 


where k is an integer or zero. The number of conjugate points of a on 
r<t<s is given by the left member of (8.20). But since the conjugate 
point determinant D,(t,a) vanishes to the n-th order at a, and since 
r<a<s, the number of zeros of D,(t,a) on r <t<s is given by the 
tight member of (8.20) increased X n. This completes the proof of 
statement (a). 

Returning to the pos we see that the numbers of zeros of D,(t, a) 
and D(t, b) on r < t< s differ by at most n. From our choice of r and s, 
it follows that the numbers of zeros of D,(t,a) and D,(t,b) on p<t<q 
differ by at most n. | 

The following i is an easy corollary. 


COROLLARY 8.5. The numbers of zeros of iwo conjugate point deter- 
minants D,(t,a) and D,(t,b) on any finite interval (open or closed) of g 
differ by at most n, where n == m — 1. 


Conclusion. The index theory can be directly: extended to the case that 
g is a broken extremal with any finite number of corners, at each of which g 
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is cut across by a regular (m—1)-manifold of class C?, not tangent to either 
are of g at the corner, and at each of which g satisfies a corresponding set of 
primary incidence relations. Moreover, if the initial end point ?” of a broken 
extremal g lies on a regular (m—1)-manifold 9n of class C°, which cuts g 
transversally at ¢’, but is not tangent to g at i’, the index theory has corre- 
sponding theorems, stated in terms of “focal” points of Mm and their orders. 


Sweet BRIAR COLLEGE, 
Swer BRIAR, VIRGINIA. 
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ON 0-REGULAR TRANSFORMATIONS.* 
By A. D. WALLAOE. 


1. Introduction. In this paper we consider a particular type of interior 
transformation which we call a 0-regular transformation. A mapping of this 
type may be roughly described by saying that the inverse sets (of points) are 
uniformly locally connected and in addition form a continuous collection. 
More accurately we require of the continuous transformation T'(4) == B that 
for any sequence b,—>b in B we have (i) T1(b,) — T(b) and (ii) this 
convergence be regular relative to O-cycles in the sense of Whyburn. The 

-condition (i) is a characterization of interior transformations due to Hilen- 
berg. There are many obvious generalizations of this notion with which we 
shall not be concerned. 

We show that any 0-regular transformation may be factored into two 
0-regular transformations of which the first is monotone and the second of 
constant multiplicity. This result is important in studying the effect of the 
transformation. It is also shown that 0-regular convergence is preserved under 
the inverse of a O-regular transformation. We prove that (the mapped space 
being a locally connected continuum) cut-points, end-points and A-sets map 
respectively into cut-points, end-points and A-sets. In particular a 0-regular 
transformation is: topological on a dendrite, and is monotone if the image 
space is a dendrite. 


2. General theorems. We suppose throughout that T(4) =B is a 
continuous transformation defined on the metric space A, and that B contains 
more than one point. The following definition is due to G. T. Whyburn [1]: 
If the sequence of closed sets {Mn} converges to M, then M,— M 0-regularly 
provided that for each e > 0 there are positive numbers 6 and N such that for 
n > N any two points z and y in M, with p(z,y) < 6, lie in an e-continuum * 
in M,. It is readily seen that the following result holds: 


(2.1) If Mn M in a compact metric space, then in order that the con- 
vergence be O-regular it is necessary and sufficient that for each positive e 
there exist positive numbers 8 and N such that for pe M and n> N, the set 
Valp): M, ts contained in a connected subset of Velp): Mn? 


* Received July 31, 1939. 
1 An e-set is a set of diameter less than e. 
*The symbol F, (p) denotes the set of points not farther from p than e. 
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The following example is of interest: Let {f,(x)} be a sequence of real- 
valued continuous functions defined on the unit interval and converging to 
the function f(x). Let M, be the graph of the function y—f;(x). In order 
that f(x) — f(z) uniformly it is necessary and sufficient that M, — M, 
0-regularly. 

I shall say that the transformation T (A) = B is 0-regular provided that 
if yn — y in B, the sets T(y,) > T> (y) O-regularly. It follows immediately 
from a theorem due to Bilenberg [2] that if T is 0-regular it is interior; that 
is, open sets map into open sets [3]. The proof of the following result pre- 
sents no difficulty: | 


(2.2) In order that the interior transformation T(A) = B be 0-regular, 
where A ts compact, it ts necessary and sufficient that for each «> 0 there 
exist a 8 > 0, such that if x and y are in A with p(x, y) < Sand T(x) == T (y), 
then x and y lie in an e-continuum in TT (x) == TT (y). 


(2.3) If T(A) = B ts interior, where À is compact, and if the sequence of 
closed sets Yn — Y in B, then we have T1(Y,) — T1(Y), and if this con- 
vergence is O-regular so is the convergence Yn — Y. 


Proof. For the proof that T-1(Y,) — T1(Y) see [4]. Assume that 
the convergence is 0-regular. Let e be a positive number and e a positive 
number such that if ae A and b = T (a) then T(Ve(a)) C Ve(b) ; take d >0 
and N > 0 for this e as in (2.1). Pick 8 > 0 so that if be B and ae T-*(b) 
we have V3(6) C T(Va(a)). This latter is possible by a theorem due to 
G. T. Whyburn [5]. Let p be any point of Y and let x and y be points 
of Va(p) : Yn, n > N. If ge T(p) C TA(Y), then Va(p) © Yu 
C T(Va(g) : T>(Fn)) and we can find points + and y in Ve(q) :-T7 (Fn) 
mapping into æ and y respectively. Since n > N we know that + and y lie 
in a connected subset H of Vo(q)-T"*(Yx) in virtue of the 0-regular con- 
vergence T1(Y,) — T°(Y). Hence T(H) C T(Ve(q) © T7(¥a)) - 
C V.(p):¥n. Thus + and y lie in a connected subset of V.(p) : Yn., This 
completes the proof in virtue of (2.1) 


(2.81) If T(A) =B is 0-regular and A is compact, and T is factored, 
T = TT, s0 that T,(A) == A’ is interior, then T,(A’) = B is 0-regular. 


Proof. Suppose that y, —> y in B. Then T1(y,) > T-1(y) 0-regularly 
in A. Hence TT (yn) — TT (y) O-regularly by (2.3); but for any be B 
we have T,7°*(b) == T,1(b), so that T” (yn) — T:> (y) 0-regularly. 

The following example shows that this theorem is false if T, is not 
interior: Let A be the circle | z | = 1 and B the circle | w| — 1 and T the 
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transformation w==2". Let T, be the transformation mapping A into a 
lemniscate by identifying the points 1 and — 1. Then T: is not 0-regular. 
i Before proceeding to the proof of a factor theorem we need the following 


Lemma. Let {M,} be a sequence of disjoint locally connected closed sets 
converging 0-regularly to the locally connected set M in a compact metric 
space. Suppose that'M-Mn—=0. If M==X1-----4X* is a decom- 
position into components then there. is an integer N such that if n> WN, 
My Xat +- -4 Xat is a decomposition into components and for each 
i= 1,2,- --,k, we have Xnt—> X’ O-regularly and if i34}, Xt-lim £,” 
ts. vacuous. 


Proof. Let X be a component of M and {X,} a sequence of components 
of the sets {M,} so chosen that X and lim inf X, have a point in common. 
Let {X,,} be a convergent subsequence, say X»; —> X’. From the local con- 
nectivity of M, it follows that M,—X, is closed. Let {Mn,,—Xn,,} be a 
convergent sequence chosen from the sequence {Mx,—-Xn,} and converging 
to a set Y. It follows that M == Y + X’ and since (Ms, —Xn,,) Zn, = 9, 
we have Y - X’ == 0 as a consequence of the 0-regular convergence [1]. Since 
X: X' 20 and X’ is a continuum it follows that X = X’. Thus every con- 
vergent subsequence of {X;} converges to X, so that X — lim Xy. 

- It follows immediately from the definition of eon convergence that 
ZX lim sup (Mn—X,) is empty so that lim sup (A, —X,) C M— X ; from 
the fact that X, — X, a component of M, we deduce that M — X 
C lim inf (M,—X,). Hence M,—X,2>M—X. Since (M,—Xx) Xn 
0 and (M—X):X—0 we conclude that the former sets converge 
0-regularly to the latter and thus [1] that ¥,— X and M, — Xn > M— X 
Q-regularly. The proof of the lemma may be carried through by induction. 

‘The transformation T(4) = B is locally topological or a local homeo- 
morphism, provided that it is interior and that each point in A admits a 
neighborhood on T is topological [6]; T is said to be mangeans provided 
that for each b e B, the set T-1(b) is connected [7]. 


(2.4) If T(A) —=B ts O-regular and A is a continuum, then T can , be 
factored T. = TaT, so that i 


(i) T,(A) = A’ is monotone and 0-regular 
(ii) T:(4) =B is of constant multiplicity and locally topological. 


Proof. As a consequence of a theorem due to Whyburn [8] we know that 
T can be factored so that T, is monotone and T is interior. Also for each 
beB it follows that 7-*(6) is locally connected. If y, y in A’ we must 
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show that T,*(y,) — T;1(y) O-regularly. But from the proof of the factor 
theorem just cited it follows that each set T1 (T), ce A’, is a component of 
T“T,(x) and since T is 0-regular T-*T2(yn) >T“Ta(y). But T, is con- _ 
tinuous so that 7,7 (y) lim sup T,"(y) 340, so that by the lemma Ti (Yn) 
— T;*(y) O-regulerly. This completes the proof of (i). 

By (2.81) T: is 0-regular since we have shown that T, is 0-regular and 
hence interior. Let p(b) be the number of points in 7,7(6), that is, the 
number of components in T-*(b). By the lemma p(b) is a continuous 
function and since B is connected it follows that p(b) is constant. It readily 
follows from this and the O-regularity of T, that this transformation is 
locally topological. 

As a matter of convenience we state explicitly the following: 


(2. 41) If T(A) = B is 0-regular on the continuum A and X ts a component 

of T-*(y) then there exists a sequence of points y, —> y in B and a sequence 

of components {Xn} of {T-1(y,)} converging 0-regularly to X. The sequence 
` {Xn} is essentially unique in the sense that if {Zn} is any sequence of com- 
ponents of {T-*(yn)} converging to X, then the sequences {Xn} and (Za) 

differ only in a finite number of terms. 


(2.5) In order that the transformation T (A) == B be 0-regular on the: 
compact space A, it is necessary and sufficient that for any sequence of closed 
sets Yn —> Y 0-regularly we have T1(Y,) >T7(Y) 0-regularly. 


`. Proof. The sufficiency of the condition is obvious. Assume now that 
T is O-regular and let Y,— Y 0-regularly. Since T is interior it follows © 
that T*(Y,) = T> (y) [4]. Let e > 0 and select u > 0 from (2.2) so.that 
if z and w are any two points of A such that T(z) == T(w) then z and w lie 
in an e/8 continuum in TT(2). Let t be a positive number less than the 
smaller of u/3 and e/3. Let e > 0 be so chosen that if be B and a eT- (b), 
then Ve(b) C T(V:(a)) [8]. Since Fa — F 0-regularly there are positive 
numbers d and N such that if n>N and ge Y, then any two points of 
Va(q) : Yn lie in a connected. subset of Ve(g) : Fa, by (2.1). Since T is 
continuous and À is compact there is a positive number § such that if ae À 
and b= T (a) then T(Vs(a)) C Va(b). If n > N and peT+(Y) I have 
to show that any two points z and y in Va(p) : T4(Y,) lie in a continuum 
in Ve(p) -T(Y:). To this end it is shown that z and y can be FAR 
for all small positive r in the set Voes(p) ` T1(Y,). 
Let r be positive and less than 4/3 and pick s > 0 so that if be B and 
aeT*(b) then V,(b) CT(V;(a)). Since z and y are in Va(p) -T(Yn) 
then + and y’ (the images of z and y respectively) are in Va(p’), p =T (p). 
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It follows that z’ -+y Ca connected subset of Ve(p’)- Yn. Hence we can 
find a chain ` 


T = bo bs > by, Bye Velp) Yn, p(b4, bin) <8. 
Now Ve(p’)-¥nCT(Vi(p)-T(¥n)) so that we can find points 


T= lo, l, © yakm y,  GjeVi(p) TP), - T (aj) = b; 
Since 
bja e Va (bj): Yn CT(Vr(as) : TA (Ln) ) 


there is a point cz in V-(ay) : T1(Y,) mapping onto by. For each j we 
thus havè p(a@j, Cj) < r. Also 


© Plan Gin) S p (Gy, Cir) + p(as, p) + pas, p) C+ at <u. 


Hence aja and cjn lie in a continuum K;,, of diameter less than ¢/3 in 
TT (aja) CT(yn). Since p(aju, p) < e/8, no point of Kja is farther 
from p than 2/3. We can now chain aj, to Cy by an r-chain in 
Fs (p): TAF»). Thus x can be r-chained to y for all small r in 
Voe/s(p)  TA(Yx) and it follows that v and y lie in a connected subset 
of Ve(p) + T7(¥n)2 . 

It is clear that (2.5) implies the following: 


(2.51) If Y ts a locally connected closed set in B then T>(F) is locally 
connected, 


From (2.5) we also get the following product theorem: 


(2.6) If T,(A) =A’ and T;,(A’) =B are 0-regular where À ts compact, 
then so also is T = T,T.. 


Proof. For if y, — y in B, then T.7'(yn) > T” (y) O-regularly. Hence 
TT (yn) — TiTa (y) O-regularly. But for any be B we have 
T>(b) = TaT (b). 


3. 0-Regular transformations on Continua. In this section we shall 
suppose that T(4) = B is 0-regular and that A is a continuum. 


(3.1) If H is any subset of B then T is O-regular on T(H). If H is a 
connected subset of B then T is O-regular on each of the finite number of 
components of T-1(H) and each such component maps onto all of H under T. 


? This proof is similar to proofs given by G. T. Whyburn [6] and W. T. Puckett 
[11] for somewhat different results. The result (2.51) has also been proved in the 
cited paper of Puckett. : 
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| Proof. The proof of the first statement is immediate. Assume that H 

is a connected subset of B and T*(H) — P + where PQ + PQ —0. It 
is readily seen that if P40 we have T(P) — H and since for each b eB 
the set T-1(b) has only a finite number of components, there are at most a 
finite number of components in T--(H). Since any one of these can be 
taken as P in the above decomposition it only remains to show that T is 
0-regular on each component. If K is a component of .T-(H) then K is 
open in 7-*(H) and hence T is interior on K. ‘Thus if y, — y in H we have 
K-T (y) >K-T+(y). But if b«H-then any component of T-1(b) which 
intersects K certainly lies in K. Hence each component of K-T*(b) is 
also a component of T-1(b). By (2.41) the result follows. 


(3.2) If the continuum X separates A and lies in the inverse of a point 
ze B, then X is a component of T1(x). 


Proof. We may write 
AmiHH-+K, H:-KmX, 


where H and K are continua. If ye B— rz, then clearly any component of 
T(y) is either in H— X or K— X. Let X’ be the component of T+(x) 
containing X. In H we can select a sequence of points {za} not in X and 
which converge to a point se X. If X, is the component of T-T (z„) which 
contains z, then X, is in H— X and by (2.41) Xn >X’. Hence X’C H. 
Similarly X’C K, so that X = Y’. nn 


(3.8) If the closed set X separates A irreducibly between two points and 
ts contained in a component of the inverse of a point, then X ts this component. 


(3.4) If the point x of A ts an end-point, a regular point in the sense of 
Menger, or a cut-point of A, then x is a component of TIT (s), and T is 
locally topological in a neighborhood of x. | 


Proof. The first two cases follow because z cannot lie on a continuum 
of convergence., The third follows from (3.2). 


(3.5) If the continuum A does not contain uncountably many non- 
generate mutually exclusive continua, then each O-regular transformation on 
A is locally topological. 


Proof. If we assume that the result is false we can find a non-degenerate ` 
component X of the inverse of a point ce B, and a neighborhood U of X. 
such that for each ye B, any component of T(y) which intersects U is 
non-degenerate. But T(U) is a neighborhood of x and since B is a continuum 
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T(U)~ contains uncountably many points. Hence A contains uncountably 
many non-degenerate mutually exclusive continua. : | | 


4. Results for locally connected continua. We now suppose that A 
is a locally connected continuum and T is 0-regular unless the contrary is 
explicitly stated. We also assume a knowledge of the cyclic element theory 
of such spaces, Kuratowski and Whyburn [9]. 


(4.1) If T is a local homeomorphism and J is a simple closed curve in B, 
then each component of T(J) ts a simple closed curve mapping onto all of J 
under T. If D is a dendrite in B then there k components in T1(D) (k being 
the multiplicity of T) each of which is a dendrite mapping topologically onto 
D under T. | 


Proof. If Z is a locally connected continuum and Z, is a component of 
T(Z) then T(Z;) = Z is locally topological and hence Z, is a locally con- 
nected continuum. The proof of the first statement is immediate. Let D, be 
a component of J-+(D). Since T(D,) = D is interior there exists a dendrite 
D’ C D, such that 7(D’) = D is topological by a theorem due to Whyburn 
[10]. But since T is locally topological it is clear that D’ == D,. 


(4.11) If A is a dendrite then T is topological, if B is a dendrite then T 
ts monotone. | 


Proof. “Using the notation of (2.4), if A is a dendrite then A’ is a 
dendrite since T, is monotone. As in (4.1) it follows that T, is topological. : 
Hence T is monotone. But by (3.4) each point eA is a component of 
T“T (a). Similar reasoning applies to the second statement. ` 


(4.2) If x ts a cut-point of A then y — T(x) is a cut-point of B. 


Proof. By (3.4) and with the notation of (2.4) we see that æ ia a. 
component of T;1T,(x), ie, cm T 17,(x) since T, is monotone. If 
yı — T, (z) did not cut A’ then A’ —y, would be connected and hence so 
_ would Ty3(A’—y,)=A—z. Hence y, cuts A’. Thus we may write 
A’o= M +N, MN —y,, where M and N are non-degenerate continua. Now 
T2(41) — ¥2 is clearly not an end-point and if it is not a cut-point it lies in 
a true cyclic element Æ of B. We can find arcs ay, and by, in M and N 
respectively such that T, is topological on the arc ay, + yıb. Let Ta(a) = 0, 
T,(b) =d. If cy — y: were in B— E it would lie in some component R 
of this set. But then clearly we would have F(R) = È — R = y, and hence 
Y2 is a cut-point, contrary to our assumption. There are thus some subarcs 
of cy, and y.d in E and we may assume that cy: -+ y2d is a subset of E. Now 
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c and d lie on a simple closed curve J’ in E, J’ — cpd + eqd. Let cpd be 
the arc of J’ not containing Ya. We may assume that c is the first point on 
y2c in cpd and that d is the first point on y.d in cpd. Finally let J be the 
simple closed curve cyd + cpd, and let J, and J» be the components of 
T(J) containing ay, and y,b respectively. Now J, and Jz are simple 
closed curves by (4.1) and since no simple closed curve can have points 
other than y, in both M and N it follows that J, and J, are different. Hence 
two different components of T(J) have the point y, in common. This is 
a contradiction. 


(4.3) If H is an A-set in A then T(H) is an A-set in B. 


Proof. Let 1,8, be an are in A’ with 7, and s, in H, = 7,(H). 
Since Ti (ris) is a locally connected continuum we can find an are rs in 
this set with r and s in H. Since H is an A-set rs lies in H and hence 
T(rs) = 18, lies in Hı. Let H, = T,(H;) and assume that À, is not an 
A-set. We can then find an arc wav, in B with UaV: Ia == Us + ve. There 
is an arc WoYovo in He. Let J == usted, + Uoove and let yı be a point of 
Hı Tit(ys). We can find a neighborhood U of y, on which T2 is topological 
and V == 7,(U) will be a neighborhood of ya Now ys is in V and hence 
a subarc tz of us¥av2 containing yz lies in V. We can thus find an arc t, of 
H, which maps topologically onto tə» Let Jı be the component of T(J) 
which contains t,. Since H, is an A-set it is clear that J, is in H, and hence 
we have J in H. Thus we: is in H. 


THE UNIVERSITY OF VIRGINIA. 





BIBLIOGRAPHY 





GQ. T. Whyburn, Fundamenta Mathematicae, vol. 35 (1935), p. 408. 
5. Ellenberg, Fundumenta Mathematioue, vol. 22 (1934), p. 292. 
Stoilow, Principes topologiques de la théorie des fonctions, Paris, 1938. 
À. D. Wallace, American Journal of Mathematics, vol. 61 (1939), p. 757. 
G. T. Whyburn, Duke Mathematical Journal, vol. 4 (1938), p. 1. 
8. Eilenberg, Fundamenta Mathematicae, vol. 24 (1935), p. 160. 
G. T. Whyburn, American Journal of Mathematios, vol. 66 (1934), p. 370. 
G. T. Whyburn, Duke Mathematical Journal, vol. 3 (1937), p. 370. 
Kuratowski and Whyburn, Fundamenta Mathematicae, vol. 16 (1930), p. 306. 
10. G. T. Whyburn, Bulletin of the American Mathematical Society, vol. 44 
(1938), p. 414. 
11. W. T. Puckett, American Journal of Mathematics, vol. 61 (1939), p. 750. 


OoN gH 


TWISTED CUBICS ASSOCIATED WITH A SPACE CURVE* + 


By Lovis GREEN. 
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1. Introduction. Various methods have been employed in investigating 
the projective differential properties of a curve immersed in ordinary space, 
each method having certain advantages. The procedure used here is to start 
with a pair of dual differential equations, to introduce certain transforma- 
tions of codrdinates in order to obtain canonical power-series expansions for 
the curve considered, and to base the remainder of the paper on these expan- 
sions. The objectives of the paper are to characterize certain configurations 
associated with a curve, particularly the five-point twisted cubics, and to begin 
the problem of interpreting a duality formula in geometrical language. 


2. Analytic basis. The differential equations of a twisted curve T, 
not belonging to a linear complex, may be written in the form 
(1.1) ow + ax" + (W — 0) + ct = 0, 
(1.2) E 4 al” + (a + BE + of —9, 
é represents the osculating plane of T at the point æ; differentiation is taken 
with respect to a properly chosen parameter «u; and a, c are scalar functions 
of u. The value of 6 can be chosen arbitrarily (540); if 9 ==— 1, these 
equations are the ones derived by Fubini and Cech;1 if 0 == — 4, then (1.1) 
is the canonical form of Halphen. 

When « is fixed at a suitable value wo, a point O (=) on T is Siac, 
and a local tetrahedron of reference D,{z, 2’, g”, €} is formed, with a unit 
point chosen so that any point whose codrdinates in the original system are 


Oye + aya’ + tat” + ra” 


will have local coérdinates proportional to 2,,-- <, €a It follows readily that 
the local coôrdinates +, of a point P on T “ sufficiently near” O are 


(8 = const. + 0). 


rı = AuAu/n! ARE 
where E 
Axo ae 1, Az = Aso = Ayo = 0, 
Aint = A’, — Chan, 
Aona = Ain + Aan + (8— U Aan (n = 0) 
Azn == Aon + A’sn == GAsn, 
Aana Fr Asn + A’ gn. 


* Received February 13, 1939. + Presented to the Society, September 6, 1938. 
1 Introduction à la Géométrie Projective Différentielle des Surfaces, 1931, p. 26. 
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Halphen’s local tetrahedron H, which will be used throughout this paper 
can be- obtained directly from D,. If local codrdinates referred to H, are 
denoted by #:, then the following relations hold: 


Ya/t = T1 F aise + Asta F Quads, 

Yo/t = ` Qata F AzsTs + Aree, 

Ys/T = < GssTe F Gass, 

Ya/T — f | Neale, 
Gin = /200, ais = (6? — 18006") /6008?, 
Gig == (D? — 1260407 — 10800a’6* + 144006*) /860008, 
Ging = Y, Gag = GY /150, az, = (6? — 42006) wy /60082, 
Agg mms QU, Agg =m hy7/100, Gas = Gy, 
p = 100c — 9a? — 30a”, y= (68/6095) #, ge5#0. 


_ (2. 1) (r arbitrary) 


If non-homogeneous coördinates x, y, z are defined in the customary way, | 


D Y2/th, Y= Y/Y 2 Y/Y 
the equations of T, relative to O as origin, are found to be 


(3. 1) y= +È pr, l z= È ga". 


The coefficients pr, qn (n = 7) are expressible in terms of g, and the coeffi- 
cients in the differential equation (1.1), and are understood, of course, to be 
, evaluated at u == u, The value of qe, as is the case with 8, can be chosen 
arbitrarily (5£ 0), but since a numerical choice for qa would prevent us from 
displaying the weights? of the coefficients, we merely specify that gs be 
independent of the parameter u. The values of Gr; qs, Pr are found from 
_ (2.1) to be 

| qr = $/420044, 

(4.1) qe — ($2 — 45/0 — 36046") 750400046, 
pr (— p? — 6048 — 360a6*) /1512000y56. 

Let m (== €) be the plane osculating T at 0. Then a local plane tetra- 
hedron of reference. D,{é, €, €’, €”} with local codrdinates proportional to 
&,°°°,&, if plane coordinates in the original system are 
| EE + EE + és + EE”, 


is formed from the differential equation (1.2). We replace D: by a new local. 
plane tetrahedron H+, dual to H,. If local plane coürdinates referred to H, 
are denoted by 1, then the relations between & and # are the same as those 
between s; and y, in (2.1) except for the change in sign of 6. | 


* The weights of p, aud q, are n— 2 and n— 3 respectively. 
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Setting l | 
f/m, n= m/n E= m/m, 
we obtain the equations of T in system He: | | 
œ : oO 
(3.2) n—e+Ène, tott Èo 
where we choose 


Kg == Qe- 


The values of xr, Ks, m7 are obtained from gr, ge, Pr as given in equations (4.1) 
by changing the sign of 6. It then follows that 


(4:2) A 
xs = (79698 — 18qoPr — 7417) /da, 71 = (24qegs — 63ga Pr — 281") /9Qe. 


Returning to tetrahedron H, and equations (3.1) we denote homogeneous 
plane coördinates by & and non-homogeneous coördinates by é, 7, &, where 


EEE, Fsbo THE 
Then the plane equations of T in system H, are. 


F= 8/3 + 2qok /81 — Yqé/729 + (8qs — 21p,)É7/2189 + - - -, 
T— 8/27 + 5ge8*/729 — 2q,8"/729 + (gs — 18p1)E/6661 + > -. 


The transformation 


(5) 


& a 27 darn; 

a 81q6°”s — 189ge qra 

é; macs 81q6°m2 — 378ga qs + 441qel N 

Es = Yge’ — 189qe"qrns + 441qoqr°ms + (54qet — 348qr° )m 

carries (5) into (3.2) and hence is the transformation from H, to H+. The 


coördinates of the vertices of tetrahedron Hy referred to system H,' are thus 
found to be 


(7) (1, 0, 0, 0), (en Ye, 0, 0), 5 
(49¢,7, 1494697, 3°; 0), (84398 —<— 54qe*, gi 63ga? rs 2 Te). 


(6) 


The u-derivatives of the local point coérdinates zı of system D, are 
obtained in the following way. In the original codrdinate system any point z 
in space has coordinates 


z = at + mt + aga” + +, ise 
Hence, from r 1), 


= (z — cts) + (Te + a+ re ea 
LA + Ta — as)" + (ri + ts), 
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and placing z; == 0 (i—1,:::,4), we get 


di = CX, v's = Tı + (a me 6) ta, 


(8) Ta — Ta + ae, Lai — Tg. 


The formulas for the derivatives of local codrdinates yı of system H, are 
found by differentiating equations (2.1), in which we choose 


T = exp( fasedu), 


and by replacing the 2’; and the a, by their values in terms of y, as obtained. 
from (8) and the inverse of (2.1). The resulting equations are 


oy = — 68p 742 — 3696742, . 
oy’: ES 8Qoÿ1 -+ TrY2 aaa ER Prya = 18¢e744, | i 
: == 3 
(9) oY a = — 6Qoÿe + 14grYs — 21 prys, (o qe/4) 
oÿ'a = — gays + 21qrys 


Now g == yAu-+--: +, so that at u = uo, 
y'i = dy:/du = ydy:/dz. 


Hence the z-derivatives of the local coürdinates y, are gotten immediately. 
These have also been obtained by Miss Newton.’ 


3, Duality. The differential equations of T show that relative to T at 
0 the dual of a point z, = f; (a, c, 6) in system D, is the plane & = fi (a, c, — 9) - 
in system Də. From the similarity of the canonical expansions (3.1) and 
(3.2) it follows that the dual of a point y; = fi (pn, qm) referred to H, is the 
plane y; = fi (mn, km) referred to He. The codrdinates & of this plane referred 
to H, are then obtainable from transformation (6). The problem of finding 
the dual, relative to T at 0, of a given point is therefore solved. 

But there appears to be a good deal more to the problem than this. For, 
the concept of duality considered here is quite different from the duality 
theory of projective geometry. Two coincident points, for example, may have 
distinct dual planes. Thus, the points 


P(Iqoqs + mgepr + ngr, 7091, 0, 0), 
l E LA f 
P’(U gage + m'qepr + ngi, gogr, 0, 0), Gene en 
in system H, both lie on the z-axis and for properly chosen values of V, m’, n’ 


coincide. Yet their dual planes, which can be found by the method described 
above, are distinct. Furthermore, in order to obtain complete generality for 


3 Consecutive covariant configurations at a point of a space-curve,” Transactions 
of the American Mathematical Soctety, vol. 36 (1934), p. 61. 
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both the curve T and the position of the point 0 on T, we do not wish to specify 
the relations existing, at u = us, among the coefficients in the equations (3.1). 
We therefore regard P as one of a three-parameter family of points generating 
the z-axis as 1, m, n vary independently over all real numbers; i.e. we fail 
to consider the coincidence of P and P’ unless l, m, n = l’, m,n respectively. 
Similarly, the point 
(kg qa; 0, 0) 


referred to. tetrahedron H, lies on the z-axis and coincides with P for proper 

. choice of k. Yet their geometrical characterizations and their dual planes are 
completely unrelated, and the curves they generate as u varies (k,l,m,n 
remaining independent of u), are entirely different. 

The problem of characterizing geometrically the dual of a given point 
relative to T at 0 is completely untouched by the formal, analytic solution 
above. A special case of this problem, for example, is to determine how the 
point P, given above, is related geometrically to its dual plane under the 
assumption that 1, m, n are arbitrary numerical quantities. The simplest 
special case of the general problem is that of characterizing geometrically the 
dual of a point whose codrdinates are expressible in terms of gg and g; alone. 
This problem is readily solved. For, we may write the codrdinates of such a 
point as 

Yi = fi (qs, q) =% . 
when referred to tetrahedron H,; hence the dual plane has coôrdinates 

q = fi (xe, kr) = % 
when referred to Hz. By means of transformation (6) we find that this plane 
has the equation 
272491 + (8190025 — 18990124) Y2 + (8190°22 — 378q0 res + 441969 1°24) Ys 

-+ [2% gota, — 189q0?q 121 + 441qogr°2s + (54ge* — 343qr") zs] ys == 0 

when referred to H,. We have therefore proved the following result. 

THeoREx 1. The dual, relative to T at 0, of a point whose codrdinates 


are expressible in terms of qe and qr ts the polar of the point with respect to a 
quadric Q having the equation 


(10)  54qoyiga + 162q6°ysy¥s — 378q67qrt/s” — 378qe7qry2ys + BBR GOT YY 
+ (54qe* = 343q,°) 94? = 0. 


Sannia has considered * a self-dual tetrahedron S whose vertices, referred 
to H,, are 


4“ Nuova trattazione della geometria proiettivo-differenziale delle curve sghembe,” 
Annali di Matematioa, IV, vol. 1 (1924), pp. 1-18; vol. 3 (1926), pp. 1-25. 
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(11) 0, T (Yqr, 2de; 0, 0), N (4997, 28qoqr, 1296", 0), 
B(343qr — 2166, RIAGar, 252¢6"Gr, 216q,°) < 
The quadric Q has three-point and three-plane contact with T at 0, and has 
among its rulings the edges OT, ON, BT, BN of Sannia’s tetrahedron. It is 
contained in a one-parameter family of quadrics having this property. 
| 4. Fundamental tetrahedra. In system H, the osculating conic K, 
of T at 0 is given by ` | 


(12. 1) 49:42 — Byk = 0 = 4, 
„and its dual, the osculating quadric cone K’s, has the equation 
(12.2) Ys" — ya — 0. | 


| ‘Any tetrahedron {OP,P.P3} with vertices defined in the following Way will 


-. be called a fundamental tetrahedron of F at 0. One vertex is at 0, a second 


at an arbitrary point P,(3,4,0,0), ¢540, on the z-axis; a third vertex 
P2(8, 2t, t°, 0) is the contact point of a tangent from P, to the conic Ks, and 
the fourth vertex P,(1— atè, t, t?, t°), for an arbitrary value of a, lies on 
the contact line of the cone K’, with its tangent plane which passes through P.. 
. The tetrahedra Hı, Hz, O are fundamental tetrahedra determined by the 


- following values of t,a: 00,0; 3¢0/7Qr,2qe; 6G6/7Q7, qa; moreover, the dual 


of any fundamental tetrahedron is another fundamental tetrahedron. 
Associated with the fundamental tetrahedra is a family of twisted cubics, 

Ta, having five-point contact with T at 0, and expressed parametrically by 

the equations 

(18) | yı — 1 — at, Ya= 1, =t, pt. 


All of these cubics belong to the same null system, lie on the cone K’s, and 
have K, as osculating conic at 0.5 

Each choice of the point P,, or of the plane OP,P;, determines a subset 
- of œt fundamental tetrahedra; these can be placed in a one-to-one corre- 
spondence with the cubics T by choosing the vertex P, as the intersection, 
besides O, of Ta and the plane OP,P;. When this is done, the one 
relations exist: 

The polars of the vertices of one of these tetrahedra with respect to the 
common null system of the cubics are the faces of the tetrahedra, the tangent 
to Ta at P, is the edge PPs, and the osculating plane to Ta at Ps is the face 
PiP2P;.° As P, traces the c-axis, a remaining fixed, the edge P,P, generates 
a cubic surface with the equation ` 


5 Lane, Projective Differential Geometry of Curves and Surfaces, 1932, p. 29. 
° Su, “Note on the projective differential geometry of space curves,” Journal of 
the Ohinese Mathematical Soctety, vol. 2 (1937), pp. 88-137. 7 3 
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(14) 9:90 — Byoyaye + Lys + ay. — 0. 
The dual of an arbitrary point on the cubic Tae can he shown to be the 
plane which osculates the cubic Ty ` 


hello, Y T, Y m T’, ye 
for which 
Of == 29g — 4, 
at the point determined, by 


TE 8qet/(Tqr — 8ge). 


THEOREM 2. The dual of the five-point cubic Ta ts another five-point 
cubic Tw for which a + a’ = 2q6. The self-dual cubic ts Tg, and the only 
points of this cubic which lie in their dual planes are the points O and B of 
Sannia’s tetrahedron. 


The self-dual cubic Ty harmonically separates, with 0, the points of any 
two dual five-point cubics. It is called the harmonic cubic by Fubini and 
Čech,” and the coincidence cubic by -Kanitani® Theorem 2 shows that all 
five-point cubics are also five-plane cubics, and since Te is the six-point cubic, 
then the six-plane cubic is Ta(a = 2q¢). 


5. The principal plane of a curve. Halphen’s theorem ° on the princi- 
pal plane at a point of a curve has been extended by Bompiani 1° and dualized. 
by Sannia. We shall carry their results still RE basing all our cal- 
culations on tetrahedron Ay. 

Let the tangent developables of T and of a five-point cubic Fa be cut by 
an arbitrary plane OP;P, (x) passing through the x-axis, and let the plane 
curves of section be denoted by I” and T'a, the latter being a cusped cubic 
Then the following conclusions hold: 


THEOREM 3. 1. If a5£4qe, the curves IY and T'a have exactly six-point 
contact at O for all planes OP, Pa. If a — 4qe, these curves always have just 
seven-point contact, with the single exception that the plane given by ` 


(15.1) Bgoya — 5qrys = 0 


produces curves having eight-point contact. 


* Geometria Proisttiva Differengiale, vol. 1 (1928), p. 42. 

#“Sur les repères mobiles attachés a une courbe gauche,” Memoirs of the Ryojun. 
College of Engineering, vol. 6 (1933), p. 106. 

o“ Sur les invariants différentiels des courbes gauches,” Journal de l’École Poly- 
technique, vol. 28 (1880), p. 26. 

. ©“ Sul contatto di due curve sghembe,” Afemorie’ della Reale Accademia delle 

Scienze dell’ Istituto di Bologna, ser. 8, vol. 3 (1926), pp. 35-38. 

14 Loo. ott. , 
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If a and the plane of section OP,P, are both arbitrary, the cusp of T'a 
lies on the twisted cubic Ta and is the vertex P, of the fundamental tetra- ` 
hedron determined by Ta and OP,P;. The cusp-tangent is the edge PiP; - 
of this tetrahedron. 


Bompiani’s osculants 2 for the curve I’, which has an inflexion at 0 for . 
arbitrary plane OP,P;, are obtained immediately. His fourth-order neighbor- 
hood of T” at 0 is the vertex P, of the fundamental tetrahedra determined 
by the plane OP,Ps, while his neighborhood of the fifth order is the edge OPs. _ 
His neighborhood of the sixth order is the point Pa which lies on the five- 
point twisted cubic Ta(a == 4¢,).¥ 

We shall prove only the first part of our theorem. The tangent de- 
velopable of T has the parametric equations 


: c=u+y, | 
(16) yeu? + putt HE v(2u+ Tp + >), 


z mm ub + gett? + qu + qeu® +: 
+ v (3u? + 6qou5 + Tqru° + 8qsu" pi rs}, 
It meets the plane OP,P;, whose equation is 
(17) - ty—z = 0, 
in a curve I”, represented by (17) and ; 
(18) y= — 409/t — 24r /t? — 1565/1 — (1072 + 128qet*) 2°/t* 
— (7668 -+ 1728qet? + 320g7t*)27/t5 + + ~- 
If the non-homogeneous equations of the cubic Ta are written in series 
tonn, its tangent developable i is seen to have the equations 
Baru v, 
(19) y = — aud + Bus +: Eu (2u— Daut + dau + >), 
z = ub — Rau? + Tau? +: + vu (But — leaw? + 63a%uF +--+). 
Its intersection with the plane OPPs is a curve T whose pai are 
(17) and 
(20) yx — 42° /t — 24r ft? — 156625/1° — (1072 + B2at*) a/t 
— (7668 + 480at?) 27 /t5 + 
The. desired results then follow from (18) and (20). 


12 Per lo studio proiettivo- -differenziale delle singolarità, ” Bollettino della Unione: 
Matematica Italiana, vol. 6 (1926), p. 118. 

18 Su, loc. cit. See also his paper, “On certain twisted cubies projectively con- 
nected with a space curve,” Journal of the Chinese Mathematical Society, vol. 2 
(1937), p. 59. 
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Dually we choose a point Pı (5€ 0) on the z-axis as the vertex of cones 
containing the curves T and Ta. 


THEOREM 3.2. If a 2 — 2qo, these cones have exactly six-plane contact 
along m for all points P,. If «== — 2qe, the cones always have just seven- 
plane contact, with the single exception that the point with codrdinates 


(15. 2) (24:5 qe, 0, 0) 
produces cones having eight-plane contact. 


The next step would be to consider planes through 0 which do not contain 
the z-axis. Let II be such a plane, cutting the tangent developables of r and 
Ta in cusped curves I” and Ta”. 


THEOREM 4.1. The order of contact of T” and Ty” at 0 ts greater for 
a= 5/2 than for any other value of a, regardless of the position of the 
plane Il, When a == 5qo/8, the order of contact ts increased still further if 
II contains the line whose equations are 


(21. 1) Je¥2 — 4qr¥s == 0 = Y4, 


and is greatest for a uniquely determined position of I, namely 


(22. 1) 3q0°Y2— 12qeqr¥a + (18goPr — Ÿqogs + 2097") ys = 0. 

We prefer to prove the dual theoram, and shall state here without proof 
several additional results of Theorem 4.1. The osculating cusped cubics at 0 
of T” and Ta” coincide if, and only if, « == 5q,/2, independently of the plane 
I. For fixed but arbitrary « let Ta” be the osculating cusped cubic of Ta”, 
and let II vary in the bundle of planes through 0. Then the inflexion point 
of Ta” generates the ruled surface (14) while the inflexion tangent of a” 
forms a congruence with the following properties. The focal sheets comprise 
an algebraic surface S of the sixth order whose asymptotic curves are twisted 
cubics; on each line of the congruence the harmonic conjugate of the inflexion 
point with respect to the two focal points lies in the plane a (y, == 0); the 
developables of the congruence meet S in twisted cubics and meet m in a 
family of conics whose envelope is the conic K.. When, in particular, the 
inflexion tangent of 7,” passes through a point P, on Ka, then the plane I 
is the face OPP, of the fundamental tetrahedron determined by Tae and Pe, 
while the two focal points on this inflexion tangent coincide at the point Ps 
of this tetrahedron. 

To obtain the dual theorem, an arbitrary point P in m but not on the 
z-axis is chosen as the vertex of cones containing the curves T and Ta. For 
simplicity we cut these cones by the plane ya = 0, obtaining curves T and Ža. 
Then, | 
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THEOREM 4,2. T and T, have exactly six-point contact at 0 for all 
centers of projection P unless a == — qa/2. If a= — qs/2, they have just 
seven-point contact unless P lies on the line whose equations are 


(21. 2) i 3Get/2 = 20743 = 0 = y. 


In this latter event the curves T and T, have precisely eight-point contact 
at 0 except when P has coérdinates 


(22. 2) (34098 — 497°, — 2qu0n — 390°, 0), 
when they have nine-point contact. 
For, let T and Ta be projected upon the plane y;— 0 from the point P 
(1, m, n, 0) 7 (n 0). 


It can be verified without difficulty that in the plane y, = 0 the projections 
of T and of T have the respective equations 


z= gt 3mat/n + (9m? — 2n)2°/n? + (28m° — 13mn 7 gen®)2°/n° 
À (90mt— 64m?n + ön? + 6gemn® + grn*)a™/n* 
+ (297m° — 285m'n + 5lmn? + 27gem°nt 
— gant + Yarma + gan?)a°/n? Lo, 
as T 4- Bmat/n + (9m? — 2n)2°/n? + (28m? — 18mn — ea 
+ (90m4 — 64m?n + 5n? — 1bamn°)z/n* 
+ (297m> — 285m°n + 51mn? — Blam n” + IRant)r$/n°$ +, 


The results follow immediately from these equations. 
The Halphen-Bompiani theorem referred to above states: 


THEOREM 5.1. The locus of points projecting T and the six-point cubic 
T, into cones having at least seven-plane contact is the principal plane of 
T at 0: . 
(23. 1) . ÿs—0. 


If the center of projection lies on the line with equations 


(24. 1) RQeYa + Prya = 0 = Ys, 


these cones have at least eight-plane contact along the principal plane, while 
for a unique point W on this line nine-plane contact is obtained. 


Theorem 8.2 shows that all points on the z-axis, except 0 possibly, must — 
be excluded from the locus. Further examination indicates that the cones 
projecting T and T, from 0 have but five-plane contact along. T, so that 0 
must also be excluded. 

The dual of this theorem is 
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THEOREM 5.2. AU planes passing through the principal point of 
-T'at 0: | 
(23, 2) (Tgr, des 0, 0), 


intersect the tangent developables of T and of the six-plane cubic Tala — 2qe) 
in curves having at least seven-point contact. No other planes possess this 
property. If the plane of section contains the line whose equations are 


(24. 2) 8e Yi > 14¢eG74/s + (2196P7 — 8q69e + 42937) ys = 0 es Ya 
the curves have at least eight-point contact at the principal point, while for 
a unique plane Q through this line nine-point contact ts obtained. 


The tetrahedra H, and H, are characterized geometrically by the fol- 
lowing dual theorems. . 


THEOREM 6.1. The Halphen point H of T at 0 is the point of inter- 
section, besides 0, of the six-point cubic To and the principal plane of T. 
Its coérdinates are , À 
(25. 1) mo | (0, 0,0, 1); > 


the equation of the osculating plane @ to T, at this point is 
(26. 1) l y0. 


THEOREM 6.2. The Halphen plane © of T at 0 is that osculating plane, 
besides x, of the six-plane cubic Ta (a == gs) which contains the principal 
point (23.2) of T. Its equation is 


(25.2) 2% qa°y, — 189q° gra + 441097 Ys + (54get — 3439,°) ya = 0; 
the coërdinates of the contact point H’ of this cubic and this plane are 
(26.2) (343¢,° — B4qe*, 14% qeqr*, 63qe7qr, ZYE). 

Furthermore, the lines HH’ and @@’ are coplanar with the edge ON of 
Sannia’s tetrahedron,® and together with the self-dual cubic Te, serve 1 
characterize this tetrahedron. 

The principal plane of T and of a five-point cubic Ta (a540) is their 
common osculating plane r, while dually the principal point of T and of a 
five-plane cubic Ta (a 5 2qs) is their common point O. Bompiani’s extension 
of Halphen’s theorem was obtained only when the principal plane of the two 
curves is distinct from the common osculating plane, while Theorems 8.2 and 
4.2 cover the case where the two planes coincide. The complete results can `, 
thus be summarized as follows: i 


14 With the exception of the planes through the w-axis which must be excluded. 
15 See footnote 7. 
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The twisted cubic T, delermines an axis of Bompiani (24.1) and a 
point of Bompiani W in the principal plane (23.1) of T; the cubic 
Ta (%—=— qo/2) determines an axis of Bompiani (21.2) and a point of 
Bompiani (22.2) in the osculatihg plane x, and the cubic Ta (a — — ?qs) 
determines a point of Bompiani (15.2) on the tangent to T at 0. Dually, 
the cubic Ta (a==2q5) determines a ray of Bompiani (24.2) and a plane 
of Bompiani Q through the principal point (23.2) of T; the cubic for which 
a == 5qe/2 determines a ray of Bompiani (21.1) and a plane of Bompiani 
(22.1) through the point 0, and the cubic with a = 4qs determines a plane 
of Bompiani (15.1) through the tangent to T at 0. 


When a five-point cubic Ta is projected upon the plane y, == 0 from a 
point P in the plane x, the projected curve is a cusped or a nodal cubic 
according as P is or is not on the conic Ke. In either case an inflexion point 
is obtained at 0. Now, it follows from Bompiani’s work * that a plane curve 
with an inflexion point sustains at this point a seven-point cusped cubic and 
œt eight-point cubics (two of which are nodal), but that ordinarily it possesses 
no eight-point cusped cubic and no nine-point osculating cubic. If the curve 
does happen to have a nine-point cubic, then every eight-point cubic is a 
nine-point cubic and there exists a unique ten-point cubic. 

Theorem 4.2 therefore states that the point where the line (21.2) meets 
the conic Ka, namely 
(27) (91°, 24097, 396”, 0), 


projects T into a curve in the plane ys==0 which sustains an eight-point 
cusped cubic, and that the point (22.2) projects T into a curve sustaining 
a ten-point cubic. The points (22.2) and (27) are not the only points in 
the plane x, however, with these properties. It can be shown, for example, 
that the locus of all points in + projecting T into a curve in the plane ys = 0 
which sustains a ten-point cubic is the straight line joining the points (15. 2) 
and (22.2). 

Theorems 3.1, 4.1, and 5.2 are concerned with plane sections of the 
tangent developables of T and of Ta, and show that for properly chosen cubics 
Ta there are certain planes which yield curves of section having contact of 
higher orders than are obtained ordinarily. It is natural, then, to inquire 
into the nature of the curve of intersection of the tangent developables of T 
and of Ta. The results are contained in the following theorem: 


| 18 We prefer this terminology to “ principal point” since we wish to use the latter 
for the dual of the principal plane. 
1" See footnote 12. 
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THEOREM 7.1. The tangent developables of T and of Ta intersect in 
the x-axis and in a residual curve C. If a= qo, then C consists of a single 
branch having four-point contact with T at 0. If aqe, then C has four 
branches three of which are linear, each of the three passing through 0 and 
having four-point contact with T. If a == 2q., the fourth branch of C does 
not pass through 0 but through the principal point (23.2). This branch is 
linear and has for tangent and for osculating plane at the principal point the 
ray of Bompiani (24.2) and the plane of Bompiani Q. If a == 5qo/2, the 
fourth branch of C has a cusp at 0 with the ray of Bompiani (21.1) as 
cusp-tangent and the plane of Bompiant (22.1) as osculating plane. If 
a = 4gs, the fourth branch of C has an infletion point at O with the z-aris 
as tangent and the plane of Bompiani (15.1) as osculating plane. For all 
other values of a the fourth branch of C is linear, passes through 0, has two- 
point contact with T, and has w as osculating plane. 


To prove this theorem we equate the space codrdinates (16) of a point 
on the tangent developable of T to the space codrdinates (19) of a point on the 
tangent developable of Ta, after first replacing the parameters u, v in the 
latter set of equations by p, v. Elimination of v and v from the three equations 
obtained yields the equation 


Le 
= tu + 2 MxUx, 
where, in particular, 
Mo[ me — 2 (a — qe) | = 0. 
Three cases thus arise: 


(a) ms = 2(a— qa) 0; 
(b) M: = 0, æ = qs; 
(c) Ma = 0, a4 qe. 


In case (b), 


Ma = — 396"/2q7, 
. while in (c), 


Ms—0, m, = a(2a + go)/2(a— qe), Ms = a(4a— Qo) q1/2 (8 — Ge)”, 
Me = [a (4a — ge) qx? + (a — qe) { (q6? — 2age — 8a?) pr 
` + (5a? — 2age) ga} ]/2 (a — g)". 


We can now express v in terms of u. 


(a) V = met? /2 ++ °° 5 
(b) v = — grt? /3Go +; 
(e) three subeases must be considered: 
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(ci) a Æ 0, a Æge : v = (a —o)u/3(2g6 — à) 

a + (qo — 4a) qru?/9 (2qs — a)? + ou? +: °°, 
where . , 
o = [8 (a — 295) (6a — 2qe) qs — 9 (a — 2ga) (4a — qe) pr. 

+ (4a — qo) g1]/27 (2gs — a)” ; 
(co) a=0 : v=—u/6 +; ' 
(cs) a= 2gs : V= ga/YVqr + (R1goPr — 8qegs — Yq )U/499? +: >: 
Substituting into (16) yields the equations a 


T =u + MmU, 
(a) l y = ma +, 
z = u? + 38mout/2+°° +5 
T = U — qru? /3qa + 7, 
(b) y=u—?qiu/8q6 +", 
z =w —qu/qe +"; | 
2 — (5qe — 2a) u/3 (qs —a) + (Ja— 4a) qr?/9 (gi — a)? + ow +: ` >, 
(ci) y = (4ge—a)u?/3 (298 —a) + 2 (qe— 4a) gru®/9 (Rge—a)? + Rout +, 
2 = qet? / (298 — a) + (qa — 4a) qrut/8(2qa — a)? + Bou 4-- = ; 


z =— 5u/6 + qru?/36q8 +", 
(cs) y = 2u8/3 + qru*/18qe + "7, 
z= U/2 + g/g +; 
T= go/1qr + (21 gop: — 8409s + 42g) U/499 +, 
(cs) y Rqeu/Tqr + (42qop2 — 16gogs + 35qr°)u°/49qr +, 
z = Bqeu?/%qr + (63qep1 — 24qoqs + 28g) W /499° +: 7. 
The theorem follows readily from these equations. 
The dual theorem is 


THEOREM 7.2. The planes containing both a tangent to T and a tangent 
to Ta form an axial pencil through the z-axis and a residual family C’ of 
planes, whose edge of regression ts a curve O”. If a— qey then O” consists ` 
of a single branch having four-plane contact with T at 0. If a qe, then 
C” has four branches three of which are linear, each of the three passing 
through 0 and having four-plane contact with T. If a— 0, the fourth branch 
of O” passes through the point of Bompiani W of Theorem 5.1, and has for 
tangent and for osculating plane at W the avis of Bompiani (24.1) and the 
principal plane (28.1). If a—=—4qo/2, the fourth branch of O” passes 
through the point of Bompiani (22.2) where it has the asis of Bompiani 
(21. 2) as tangent and the plane x as singular osculating plane. If a = — 2Qo, 
the fourth branch of O” passes through the point of Bompiani (15.2) where 
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tt has the z-axis as tangent and the plane x as osculating plane. For all other 


values of a the fourth branch of C” ts linear, passes through O and has two- 
plane contact with T. 


There are certain interesting relations between these two dual theorems, 
but we shall not consider them here. — 


6. Osculating quadrics at a point of a curve. The self-dual quadric 
(10) was found to have both three-point and three-plane contact with T at 0. 
Although there are oo® quadrics with this property, there is no non-singular 
quadric having both four-point and four-plane contact with T at 0. 

The general six-point quadric of T at 0 has the equation 


(28) mi (yiys — ys) + Mo(yiÿe — Yas) + ms (Y? — Yaya) + mays? = 0, 


where the m; are arbitrary, If this quadric contains the five-point cubic Ta, 
then 


“am, = 0 = am, — Ma, ` 
while if the quadric is to have seven-point contact with r at 0, then 
| maqo +m, = 0. 
Hence we have the following theorem characterizing the cubic T4. 


Tarorem 8.1. There exists a one-parameter family of quadrics having 
at least six-point contact with T at 0 and containing the five-point cubic 
Ta (a 40). Neglecting the quadric cone K',, seven-point contact is obtained 
if, and only if, a = — ge. 


The dual theorem, ‘chick will be omitted, characterizes the -cubic 
Ta (a =—.3qa). Another characterization of T4, is due to Sannia? The œ? 
quadrics having seven-point contact with T at O have in common just two. 
points — 0 and the residual intersection 
(qe 0, 0, 1) 


of the line OH (25.1) with the five-point cubic Taye Dually, the co? quadries 
having seven-plane contact with T at 0 having in common just two tangent 
planes — x and the plane 


B7qo°¥1 — 189qe*Gry2 + 441qegr Ya + (81qe* — 343q,°) ya — 0, 


which is coaxial with m and @ (25.2) and osculates the five-plane cubic 
Ta (a = 3ga). i š 


18 Loc. cit. 
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The fundamental tetrahedra are related to the seven-point quadrics 
according to the following theorem. 


THEOREM 9.1. Hach point Pa (£0) on the osculating conic K, de- 
termines a unique seven-point quadric whose intersection with w is a conic 
tangent to K. at 0 and at Pa. The tangent plane to the quadric at Pe 
osculates the six-point cubic To, the tangent plane to the quadric at 0 is the 
face OP P; common to all fundamental tetrahedra which have P, for a vertes, 
and the polar with respect to the quadric of the vertex P, of these tetrahedra 
is the face OPP}. 


If, in particular, the point P, is chosen at (0,0,1,0), then the quadric 
reduces to the cone 
(29) y= 
with vertex at the Halphen point. 

The dual theorem states: 


THEOREM 9.2. Hach plane OPP; (7) tangent to the osculating 
quadric cone K', determines a unique seven-plane quadric whose cone of 
tangents through O touches K’, along x and along OPPs. The contact point 
of the quadric with the plane OP,P, lies on the six-plane cubic Ta (a = 2q6), 
the contact point of the quadric with x is the vertex P, common to all funda- 
mental tetrahedra determined by the plane OP,P;, and the pole with respect 
to the quadric of the face OPP, of these tetrahedra is the vertex Pa ` 


7. Consecutive configurations. A manifold M geometrically defined 
for each value of the parameter u of the curve I’ generates or envelopes 
another manifold M’ as u varies. Several five-point twisted cubics can be 
characterized in this way. 

As u varies, the five-point cubic Ta (a = const.) generates a surface Sa 
and the osculating planes of the dual cubic Ta (a + a’ == 2gs) envelope the 
dual surface Sq. The tangent planes to Sa along Ta form a developable Da, 
while dually the osculating planes of Ta are tangent to Sa along a curve Dw. 
Then, 


THEOREM 10.1. D, is the tangent developable of a twisted cubic except 
in the following four cases. If a == — 44, Da is a cubic cone with verter at 
the point whose codrdinates are 


(30. 1) (149, 5qe, 0,0). 
If a = — 296/3, Da is a cubic cone with vertex at a point, | 


(31.1) (49qr, T0Goÿr, 15Qe?, 0), 
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lying on the osculating conic Kz, If a—0, Da is the quadric cone (29).19 
If a= 64s, Da is the osculating quadric cone K'a That is, the characteristic 
curve at O determined by the set of all osculating cones K’, is the cubic 
Ta (% == 695) (and the x-axis) ; stated differently, the generators of K's form 
a congruence as the parameter u varies, and the surface Sq (a = 6ge) is one ` 
focal surface The cubics Ta on Sq have an envelope other than T if, and 
only if, a = 6qo, the contact point on the envelope associated with O being at 


(343q = 750ge*, 245909", 175g," Vi 125qst). 


The proof runs as follows. A point P on I near 0 ee a canonical 
tetrahedron H,(P). If lôcal homogeneous codrdinates referred to this tetra- 
hedron are denoted by Y4, then the equations of the cubic Ta associated with 
P are 
(32) Foi Yr Vs, Yom, 


Now let P have non-homogeneous coordinates (h, k,l) referred to tetrahedron 
H, at 0. From (9) and the equation 


Y; = y; + (dyi/dz)h +--+, 

we have 

pEi = qoyi — (2lprgs + Lego%ys)h + ` 

p2 = qY: — - (YoY: — YqrY:/3 + TOR + 6qe? ce +: 

Ps = goya — (goys — 1441/3 + Tprys)h +` 

pls = Toys — (3Goÿs — Vqrys)h + + ete 
Solving for y, and using (32) we obtain 
Ags = go(1— art) + (21 pit + 12qes*)h +: +, 
AY: = Got + (Ga — Vq1r/8 + Lpr- qèr — ager*)Jh+---, 
AYs NS: Ger” + (2gor— 14q777/3 + Tp? yh + er “4 
AYs = gor? + (Bgo? — Tir )h +, 
as parametric equations of the surface Sa. When h = 0, t= t, the tangent 
plane to Sa has the form : 


(33) 


ga (6gs — a) ty, — (12g? + Baqe — Vagrt) tys + (695? + 9aqe — 14aq,t) ty 
— (Sage — Vaqrt — ee + aget?) yy = 0. 


19 Newton, loc. cit. Also Tsuboko, “On the locus of the space cubics osculating a 
space curve,” Alemoira of the Ryojun College of Engineering, vol. 10 (1937), pp. 63-74. 

* Wilczynski, “ General projective theory of space curves,” Transactions of the 
American Mathematical Soctety, vol. 6 (1905), p. 109. This result has also been 
obtained independently by Kanitani and Newton, loc. cit. This cubic has been called 
the torsal cubic of T at 0 by Wilczynski. : 
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As ¢ varies, this plane envelopes the developable surface D, whose su of 
regression has the parametric equations : 


Yı — 45age® (2qe + Ba) (4q6 + a) + 315a°qe (2q + &) grt — 147005 qeq t 

| ++ [686a5%,* + Dagu” (2ga + 3a) (4go + a) (6gs — a) lé, 
. Ya = — bag (Zgo + 3a) (Eqa — a)t + 210a°qs" (6gs — a) grt? 

| — 98a*qe (6gs — a) qr't, 

. Ys = — 46ag (490 + %) (Ege — a) t? + Zaga? (4ga + à) (69s — a) qrt", 

ya — 9go” (2G + 3a) (496 + a) (6gs — a) t°. | 


These equations yield all but the last statement of the theorem. To complete 
the proof we set á 
y, = 1 — at, y =t, P=., yet. 
in (33) and find 


t= r + (1— Ygrr/3ge —Vprr*/qe)h +: 7, 

t= r+ (1—Ygrr/3qe — Yprr?/a — Eqo" + art)h +--+, 

l m r + (1 — qrr/3qe — Tprr”/qe — qer? — Zar? + Yagır*/3qe. 
— 14apz7°/qa — bager? + arf) h/ (1 4 ar?) piee 

` The desired result then follows immediately. 

The dual theorem states: 


THEOREM 10.2. The curve Da is a twisted cubic except in the following 
four cases. If œ == qe, Da: is a curve of class three lying in the plane dual to 
the point (30.1). If a = 8q./3, Da ts a curve of class three lying in the 
plane dual to the point (31.1). If a == 2gs, Da: is a conic lying in the 
Halphen plane (25.2). If a ==— 4qe, Da is the osculating conic K3; that 
is, the surface Sq: ts thé locus of all osculating conics K,;7' stated differently, 
the tangents to K form a congruence us the parameter u varies, and the 
surface Sq (a’ ==— 498) is one focal surface. 


8. Projections of a space curve. When the space curve T and one of 
its five-point twisted cubics Ta are projected from an arbitrary point P (h, k,l), 
140, upon the osculating plane m at 0, the projected curves I”, 7’, possess 
certain interesting relations. 

The equations of IY are readily found to be 


(34) y = ot — kr/l + 2het/l— (3hk + 1)2°/P 
i + (7h? + 2k — qekl) tt /F +: +, g=0, 
while those of T”, are l 


à: Tsuboko, “On the locus of the conics osculating a space curve,” Memoirs of the 
Ryojun College of Engineering, vol. 10 (1937), pp. 11-17. 
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(35) y = 2? — kat /1 + Qhat/1— (3hk +144 al?) 25/2 
+ (Mh 4 2k p Dakl}at/E +--+, 20. 


These curves always have at least five-point contact, and hence have at 0 a 
common osculating conic K. Higher-order contact is obtained only when 
a == 0, as seen from (34) and (35) or from the theorem that the principal 
plane of T and Ta (a 340) is the osculating plane v. 

| The equations of K are 


(36) ya — key/l + (2h — k?) 92/12, z= 0. 
This conic has six-point contact with I” if, and only if, 
(37) m= l2 + 2k? — Bhkl — 0, 


and has six-point contact with T'a if, and only if, 
(38) R == p, + al? == 0, 


i.e. if, and only if, the center of projection P lies on the cubic surface (14). 
If P traces a line L through 0, the conic K remains unchanged. More- 
over, we can readily prove 


THEOREM 11. The conics K and K: T double contact if, and only +f, 
the center of projection P lies on the quadric cone K'a. If this is the case, 
the line OP and the chord of double contact of the conics are edges of a 
common fundamental tetrahedron. 


We shall be concerned in this section with the projective normal and the 
flex-ray of T” at 0,° and in order that these be well-defined we must have a 
non-composite osculating nodal cubic of I” at 0, which means that we must 
` assume that #0. This assumption furthermore prevents the center of 
projection from lying on the six-point cubic To. 

The osculating nodal cubic of I” at 0 can be shown to have the equations 


Pusey + Puy? re yet + P (km — v1) ay + LCR pa — hlp, + kri) xy? 
+ (m? + kv — do LE —0—2, 
where w, is given by (37) and : 


v, = Bhkl — hits — QE — 5k* — qokl®. 
Hence the projective normal of T” at 0 is expressed by 
(39) Var + vıy = 0 == 2, 
while the flex-ray is seen to be 


23 The flex-ray of I” at 0 is defined as the line of inflexions of the osculating nodal 
cubic of I” at 0. 
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(40) Pa? + lya (271 + km) + (km? — hlm? + ni?) y = 0 2. 
In the same way the projective normal at 0 of T”, is of the form 
(41) luot + voy = 0 = z, 
where y: is given by (38), and 
va = Bhk?l — hP — 2k — 5k — akl, 
while the flex-ray of T'a at 0 has the equations 
(42)  Ppè + lp (ku + 22) + (k?p? — hlp? + vo?) y = 0 = 2. 


THEOREM 12. As the center:of projection P traces a line L through 0, 
the projective normal of I’ varies in the pencil at 0 unless L is a generator 
of the quartic cone 
(43) (y? — az)? + qoy2? = 0. 


Neglecting the x-axis which must be excluded, this cone meets the quadric 
cone It’, in the z-axis. As P traces the z-avis, the projective normal of I” 
at 0 remains coincident with the y-axis. 

As P traces a line L through 0, the projective normal of T’, varies unless 
L is a generator of K’,, As P traces a generator OP, of K's, the projective 
normal of T'a at 0 coincides with lhe edge OP; of the fundamental tetrahedra 
determined by OP. 


The proof offers no difficulties. In (39) and in (41) we replace (A, k,l) 
by homogeneous codrdinates (k:,:::,h4) and demand that the resulting 
equation be independent of hı. In both cases we obtain 


haz — Rhsy = 0 =z, 
so that 
Rlhapu + hav ras 0 ” (i = 1, 2) . 


Replacing u; and v; by their values we have the results immediately. 
The locus of all centers of projection. P determining curves I” which 
have at 0 a fixed projective normal, say 


(44) T+ my = 0 =z, 
is a fourth-order surface S*, determined by the condition 
(45) ` lum — n = 0. 


The only five-point twisted cubic Ta which lies on this surface is the one for 
which a = q/2, this sttuation occurring only when the gwen projectwe 
normal of I’ at 0 is the y-aris. 
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If the projective normal of T'a is chosen as the same line (44), then P 
generates another fourth-order surface 8*, determined by the condition 


(46) : lugm — va = 0. 


The surfaces S*p and S*, coincide if, and only if, both a = qẹ/2 and (44) 
is the y-axis. If a==q/2 but (44) is not the y-axis, these surfaces meet 
only in the plane + so that there is no center of projection yielding coincident 
projective normals at 0 for I” and T'a. If a4 qe/2 and (44) is the y-axis, 
the surfaces meet in the z-axis. In all other cases the surfaces intersect in a 
non-composite conic. 

From (45) and (46) we ae. 


(47) l == (gs — 2a)k/am, 
and upon substituting into (45) we find as the equations of the conic 
| (qe — 2a)y — amz — 0, 
qe (qe — 2a) mz + (gs - — 2a) 4x? — am? (qe — 2a)? (3qe + 2a) az 
, + am[aèm? (2gs + a) + qo (qo — 2a)*]2? = 0: 
As m varies, « remaining fixed, these conics generate’ a surface whose equa- 
tion is 


(25) qay + aa? — (3q + 2a)ayte + (2go + a)y" (a70) 
+ agoy2° = 0 : 
This surface is composite if a == — 2g, and EER when «= 0 into the 


plane y == 0, as is evident from (47). 

- The flex-ray of T” at 0 is tangent to the osculating conic K of T at the 
point of intersection of flex-ray and projective normal. A similar statement 
holds for T'as Hence, as P -traces a line L through 0 the envelope of the 
flex-rays at 0 of the curves I” is the conic K unless L lies on the cone (43). 
The following theorem can be readily proved. : 


THEOREM 13. Let the center of: projection P trace a five-point cubic 
Ta (40). If a = 294/8, the flex-rays at O of the curves IY form a pencil 
through the point (0,1,0,0). If a = 6/3, or tf a = qe, the flec-rays all pass 
through the point (0,0,1,0). In all other cases the flex-rays at 0 of Y 
envelope a non-degenerate conic. This conic has three-point contact with Ka . 
at 0 tf, and only tf, à =m 4/3. . 


If P traces a five-point cubic Ta, the flex-ray at 0 of Tu (a 3&8) f 
envelopes the conic K; for all values of «, 8. 
Let the flex-ray at 0 of I” be a given line, say 


(49) a: re + sy +1=0=2. 
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. Then from (40) we find that the center of projection P lies on a space curve 
Oy which is the intersection of a quadric cone . 
(50) 8hl — 5k? + 2rkl — (r° — 48)? = 0 


and a quadric surface 
| w + 50gekl = 0, 
where 


o m Yök + ör] + 42h? + 181rhk + (r? —4e)hd + (12r? + 1608)? + 
— (2675 — 104rs) kl. 


Hence Cp has a node at 0 with the z-axis as one tangent. If the residual 
tangent at 0 lies on the cone K’, then the flex-ray (49) is the edge PiP: 
of the fundamental tetrahedra determined by the residual tangent, and con- 
versely. Ordinarily Cp is a quartic curve, but under the condition 


(ri a 3s)? — 12ger = 0 


it consists of a generator of the cone (43) and a twisted cubic. 

If the flex-ray at 0 of 7’, is chosen as the same line (49), then the 
center of projection P lies on a curve Ca which is the intersection of the cone 
(60) and a quadric surface 


w -+ V5akl + 25arl? == 0, 


The curves Cp and Ca coincide if, and only if, both a == '2qo/3 and the fiez- 
ray (49) passes through the point (0,1, 0,0). 


If a — 2g6/3 but the flex-ray does not pass through (0,1,0,0), there 
are no centers of projection P yielding coincident flex-rays at 0 for IY and T'a 
If «+ 2q./3 and the flex-ray is the line yı = y4 == 0, then the locus of P is 
the z-axis. If «54 2q4/8 and the flex-ray passes through (0,1,0,0) but not 
` through (0,0,1,0), then there are no points P. In all other cases there is a 
unique center of projection P which determines coincident flex-rays at 0 for 
T” and T”,, the locus of these points P being the surface (48). 

Results of interest can also be obtained by studying other elements asso- 
ciated with I” and Ta, such as the focal point on the projective normal * 
or on the. flex-ray, the Halphen point, the condition for a coincidence point 
at 0, ete. 
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THE UNLOADING PROBLEM FOR PLANE. CURVES.* 


By Parsiox Du Vat. | 


This paper relates to an earlier one of mine in this Journal, and uses 
the same notation. In particular, Clarendon type indicates matrices, capitals 
being used for rows or columns of geometrical entities and small letters for 
numerical matrices. (By an oversight for which I apologise, E was used 
' instead of e in the former paper for the unit or identical matrix.) The 
transpose of a matrix is indicated by ~; any inequality between matrices is 
understood to apply to corresponding elements, i. e., h = k means hag = kag 
for all a, 8, and in particular h = 0 means hag = 0 for all a, £. 

It is familiar that if O = (0,---+ 0) is a set of distinct points in a plane, 
and h = (hi: - <ha) an arbitrary row of non-negative numbers, then curves . 
of sufficiently high order exist having multiplicity he in Oa (@2—1,---,9), 
and no other multiple point, and not all passing through any other one point; 
but that this is no longer true if some of the points © are in the neighbour- 
hoods of others, unless certain inequalities, which we call the consistency 
conditions, are satisfied by the assigned multiplicities h. In fact, since the 
multiplicity of a curve in any point is the sum of its multiplicities in points 
proximate to that, 2 the consistency conditions are 


(1) l mh = 0, 
where m is the matrix defined in the paper referred to, in which 


Mag == 1 if a== $, 
>- ——1if Og is proximate to Oa, 
==0 otherwise; _ 


we shall call it the proximity matrix of the points O. 

It is also tolerably familiar that the conditions of having multiplicity 
ha in Oa and: multiplicities kg: - : he in Og- -Oe (proximate to Oa) are 
formally satisfied by curves whose actual multiplicities in these points are 
ha + 1,hg—1---he—1 respectively; since ‘these conditions reduce essen- 


* Received August 15, 1939. 
?P. Du Val, American Journal of Mathematics, vol. 18 (1936), p. 285. 
? F, Enriques and O. Chisini, Teoria geometrioa delle equaeioni, vol. 2, pp. 425- 438. 
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tially to having at least ha coincident intersections with every simple branch 
through Oa, at least ha + hg with every simple branch touching OO, etc. 
This is the unloading (scaricamento) principle of Enriques è in its simplest 
form. The alteration in the multiplicities consists of adding to the row h the 
a-th row of the matrix m; so that generalising the process we clearly have: 


(2) The conditions of having multiplicities h in points O are formally 
satisfied by curves whose actual multiplicities there are 


h + xm, «x= 0. 


This we may regard as the general statement of the unloading principle. 
The question arises, if we attempt to impose on a curve multiplicities 
not satisfying the conditions (1), what multiplicities will it in fact have? 
Enriques* asserts that this question can always be answered by the applica- 
tion of the unloading principle, and of another which he calls that of 
smoothing or evening (scorrimento). The latter is in fact the solution of the 
problem, as far as it concerns a set of points consecutive on a simple branch, 
and with only one of the inequalities (1) unsatisfied, namely that which 
relates to the last point but one of the sequence. The attack on the problem 
in general is not given explicitly, but is illustrated by æ comparatively simple 
example, though it seems to have been generally regarded as clear that a solu- 
tion can always be arrived at by a finite number of unloadings and smoothings. 
What I shall now shew is that, given perfectly arbitrary proximity relations 
between the points, and perfectly arbitrary assigned multiplicities, h, we can 
always find a set of numbers k such that: 


(a) k are consistent actual multiplicities; i.e., satisfy (1). 


(b) Curves with multiplicities k formally satisfy the conditions for having 
the assigned multiplicities; i. e., k == h + am, x= 0. 


(c) All curves satisfying (a), (b), are formally contained in the system that 
will be found. 


Eliminating k between the conditions (a), (b), it is clear that the 
inequalities (1) reduce to 
m(mx +h) = 0, 
which we rewrite in the form | 
(i) az +cz0; 


2 F, Enriques and O. Chisini, Ibid., vol. 2, pp. 426-438. 
* F. Enriques and O. Chisini, Ibid., vol. 2, pp. 425-438. 
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where ë = mh, and a= mm = — n is the negative of hte intersection matrix ` 
of the diminished neighbourhoods L of the points O, as explained in my 
former paper. What remains of the Condition (b) is of course just the 
inequalities 

Gi) x= 0; 
e 


while the Condition (c) clearly means that the solution x of the inequalities 
(i), (ii) which we seek is such that if x’ is any other solution, 
xx. 

The matrix a is symmetrical and positive definite, and being the negative 
of an intersection matrix of distinct irreducible curves, has no positive element 
off the diagonal. It is of unit determinant, and thus a™ also consists of 
integers; we prove first of all that a?-= 0. An algebraic proof of this was 
first given by Coxeter,® some years ago. I subsequently noticed that the result 
is equivalent (interpreting the matrix as the scalar product matrix of a set of 
vectors in Euclidean space) to the theorem that a spherical simplex which has 
no obtuse dihedral angle has no obtuse edges either; and of this it is not hard 
to construct an elementary trigonometrical proof. The simpler argument 
which I give here was suggested to me by Mahler.’ 

To say that at = 0 is the same as to say that az = 0 implies z = 0. 
Suppose if possible that some of the z’s are negative, whereas az = 0; and let 
z’ be the row of just those of the z's that are < 0, the rest being omitted, and 
a’ the diagonal minor of a obtained by omitting the rows and columns corre- 
sponding to the columns omitted in #. Then a fortiori a's’ =0, since the 
terms omitted are all of the form @agzg, where za < 0, zg = 0, so that a =£ 8 
and dag = 0. Consequently va'z <0, whichis impossible, since a’, being a 
diagonal minor of the positive definite matrix a, is itself positive definite. 

We conclude that 


(3) If y ts chosen to satisfy the inequalities 
y= 0, yte=0, 
then x gwen by s 
x=—a ly 
is a solution of, the inequalities (i), (ii). 


5H. S. M. Coxeter, Annals of Mathematics, vol. 35 (1934), p. 601. 
8 In conversation. 
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(Clearly the second of these conditions is identical with (i), and the 
first implies (ii) ). 

We next observe that if s*a is the least value of za in all solutions x of 
(i), (ii), then the row of numbers x* is itself a solution. (The proof of this 
was suggested to me by Rado.) For (ii) is clearly satisfied; and as regards 
the a-th of the inequalities (i), if 2’ is a solution in which +, = g*a, we have ` 


Baage" p = Zaat" p, 


since the a-th term is the same on both sides, and in every other term 
dap = 0, 2*g S 2g; and hence of course 


2 apita + Ca = D dap’ + Ca f 
| = 0. 
In other words, 


(4) There emists a solution x* of the inequalities (i), (ii), such that tf x is 
any other solution of them 
x > x*, 


Putting x* for x in (b), we obtain the value of k satisfying (c). 

An explicit formula for x* in terms of a, ¢ is not easy to find. I am able 
only to give a method of finding it which involves a finite process of trial. 
For this it is convenient to drop the restriction on x to be a row of integers, 
and consider instead a row x of real numbers. The foregoing argument is 
practically unaltered, and we conclude that: there is a minimum solution 2*- 
of the corresponding inequalities j os 

(7); GY) a4+e=0, 220; 
now Erdôs ® has remarked that for every q, either a = 0 or app = 0; 
for if z is a solution in which z > 0, > aagzg > 0, then % can be diminished 
B 
` without destroying either of these inequalities, the rest of (ii’) will be 
unaffected, and all the rest of (i’) will be strengthened, since in each of them 
the coefficient of za is <0. Thus we see that 


(5) If x is the row obtained from 2* by omitting all elements that vanish, 
and a,c are obtained from a,c by omitting the rows and columns corre- 
sponding to these, then 

ax + = 


T In conversation. 
® In conversation. 
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Another form of this result is the following: 
(6) 2% == — be, 


where b is a matrix in which certain rows and the corresponding columns 
consist entirely of zeros, and the diagonal minor obtained by omitting these 
is the inverse of the corresponding minor of a. 


I have not been able to find any direct criterion to determine which are 
the vanishing z*s. It is easily seen that if za == 0 in a solution of (ï), (i), 
then Ca = 0, ha = 0, and ga = 0 (where g = m h = a*t) ; these conditions 
however are only necessary and not sufficient for the vanishing of 24. In an 
actual case we should first try putting bag <= bga = 0 for all values of « satis- 
fying these conditions, then for every combination: of all but one of these 
values, and so on, constructing each time the matrix b and the row 2* in 
accordance with (6); the rows we obtain will all satisfy (ii”), and the first 
one to arise satisfying (i’) also is in fact #*. From this of course we obtain 
x* (which is what we really want) by the obvious relation 


(7) g*a tg the least integer which is = 744. 


THE UNIVERSITY, 
MANOHESTER. 


A COMPLETENESS THEOREM.* 
By R. P. Boas, Jr. 


1. Introduction. This note and the following one developed out of 
the problem of proving that the set of functions 


(1.1). ema, gema (n=0,=1, +2, -) 


is complete in L?(— ~,r). This problem is equivalent to the problem of 
showing that an entire function F(z) of the form 


(12) + Paya f fadh, f(t) er), 


is identically zero if F(2n) = F (2n) = 0, n == 0, +1, +2,---; this 
theorem is easily proved.? If the original problem is generalized by replacing 
the multiplier v in (1.1) by a more general function G(s), it is more satis- 
- factory to attack the problem directly; some uniqueness theorems for entire 
functions can be obtained as corollaries. In the following note, on the other 
hand, it is the uniqueness theorem which is generalized; the two kinds of 
generalization lead in different directions, and are studied by different methods. 

In this note, I shall establish the following completeness theorem, which 
is quite easily proved once the correct formulation has been found. 


 Taxorem 1. Let G(z)eL*(—x,x). The set of functions 
(1. 3) gime, G(x) ent) ta, . (n=O, +1, = oa 
* ts complete n L?(—7x,7) if and only if 


(1. 4) G(a +r) + a(z) 50, —r <r <0, 


except perhaps on a set of measure zero. 


* Received November 10,. 1939. 

1 Most of the results of this note were obtained while the author was a National 
Research Fellow. | 

* It is contained in Theorem 1 of the following note (“Some uniqueness theorems 
for entire functions,” American Journal of Mathematics, vol, 62 (1940), pp. 319-324). 

5 Since completeness and closure in L* are equivalent properties, any element of L? 
can be approximated, in the metric of L*, by a sequence of linear combinations of the 
functions (1.3). If (1.4) is replaced by the stronger condition that @(w) is essen- 
tially bouhded and |@(e-+ 7) + G(æ)| >8>0 almost everywhere, the functions 
(1.3) are easily shown to have the stronger property-that any element of Z> can be 
expanded in a series of them, the series converging in the L* metric ( converging in 
the mean). ` 
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COROLLARY. The set 
gemo, G (x)gm™e (n—=0,+1,+2,---) 
is complete in L'(— 7,7) if and only if 
G(z+7)—G(z) 40, —7r<z<0, 
except perhaps on a set of measure zero. 


This follows from the theorem if G(x) is replaced by QG (x). The 
corollary, with G(x) = z, corresponds to the original problem; we formula- 
tion with the set (1.3) is more suitable for generalization. 

The general problem of determining necessary and sufficient conditions 
for the completeness of 

{aime G (z) etre}, 


where {ma} and {nx} are mutually exclusive sequences containing all the 
integers between them, appears to be difficult; but the case in which {nx} is 
an arithmetic progression is easy, and not essentially different from Theorem 1 
(see Theorem 5). Another generalization, in which the set of Fourier func- 
tions is broken into more than two sequences, is discussed in 8 4. 

By means of a theorem of R. E. A. C. Paley and N. Wiener, Theorems 1 
and 5 can be transformed into uniqueness theorems for entire functions of 
exponential type. Let Ws be the class of entire functions of exponential 
type* r, belonging to L? on the real axis. I state only the theorem equivalent 
to Theorem ‘1. 


THEOREM 2. Let g(2) e Ws; let G(t) be the Fourier transform of-g(2). 
A necessary and sufficient condition that every f(z) e Ws, satisfying 


(1. 6) fn) = f f(@)9(@n-+1—a)de— 0, 
-00 
(n=0,+1,+2,:--), 
ts identically zero, ts that 
(1.6) G(t+n) +G(t) 40, —r<i<0, 
except perhaps on a set of measure zero. 


A slightly less general theorem, with a considerable formal difference 
from Theorem 2, can be obtained by application of a theorem of S. Bochner. 


Tueorem 3. Let A be a linear® operator from L?(—«, œ) to 


‘The entire function f(s) is of exponential type c (o > 0) if |f(e)| < Ae*ll. 
5 Linear” means “additive, homogeneous, and continuous.” 
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L?(— 0, œ), permutable with differentiation A necessary and sufficient 
condition that every f(z) e Wx satisfying 


(1.7) Fn) — A(F(2n+1)}—0, . (n—0,41,42,-°°), 


ts identically zero, is that 
: » | 
(L. 8) f A} SBT À cos TE tqu 7 0, = 
-0 , u 2 


except perhaps on a set of measure zero. 


E 
2 


For comparison with Theorem 2 of the following note, I mention the 
following special case of Theorems 2 and 3. 


THEOREM 4. If f(z) «Wr, gisa Ponnu integer, and 


(1.9) f(2n) =f% (2n) —0, (n—0, +1,42), | 

then f(z) = 0. i 
Here Q(t) is equal to (it)te-t* on (—r, r), ‘and vanishes outside 

Cam A is defined on f(x) e Ws by the relation A{f(2)} — f@ (z— 1). 


2. Proof of the completeness theorem. Let G(x) ‘be a function of 
_L?(— r,r), having the Fourier series 


(2.1) KOR > ; pet, 


If B is a sequence of integers, I denote by G(s) the function whose 
Fourier series is the part of the Fourier series of G(x) with exponents in B; 
that is, 
(2. 2) G(x) ~ 2, yer”. 


I shall prove the following theorem, which includes Theorem 1 as a special 
case (when N is the set of odd integers). 


THEOREM 5. Let N be an arithmetia progression with elements a + kb, | 
b20 (k=0,41,+2,-- -); and let B be the set of all integers kb 
(k=0,+1,+2,---). Let G(r) «L*(—2a,7). Then the set of functions 


è That is, when f and g are elements of L?(— ©, ©) such that g(@) =f (@), we 
have [Af(a@)]’ = Ag(@). 

7 We can state a theorem, similar to Theorem 3, but entirely equivalent to Theorem 
2, by introducing the space L* whose elements are functions f(æ) which are Fourier 


E ~~ 
transforms ‘of elements F(t) of L(—©,©), with the norm || f || =f | F(#) | dé. 
CO 


Then Theorem 8 remains true if A is a linear operator from L?(—™,~) to L*, 
permutable with differentiation. 
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(2. 3) eimo, G(x) etme (me dN, nee N) 
ts complete in L?(—-x, r) if and only if | ti 
(2.4) | Gale) bot res 


except perhaps on a set of measuré zero. 


To establish the sufficiency of (2.4), we have to show: that if it is 
satisfied, if F(s) « I?(—-, r), and if i 


(2.5) L F(æ)eimsdz — 0, (k= 0,-+1,42,- +), 
and . . : oH 
(2. 6) E F(a) G(x) e™*dx = 0, RE os, 


then F(z) == 0 almost everywhere. © ; 
Relation (2.5) shows that F(s) has a Fourier series of the form 


(2.7 | F(a) ~ Š morte, à 


Let G(z) have the Fourier series (2. 1), and let Gsa(z) be defined by (2.2). 
Then, from (2.6), we have — 


0— f F(E) Go(ayemae + 3 > ef _P(a)otomneda, 


Since me + sé’ N if ngee N and se’ B, the series on the oe is zero, by (2. DE 
Thus we have 


J. F@)Go(a)emedn — 0, (k = 0, + 1, +R: s 
-r « 
so that F(x) @s(x) has a Fourier series of the form 
(2. 8) F(2)Gr(x) — À Bret, 
k=-co 


But the Fourier series of F (x) Gs(x) can be obtained by formal multiplication 
of the Fourier series of F(x) and Gz3(x) ; consequently, by: (2. 7), 


(2.9) P(e) @a(a) ~ È acim, 


since ns — lb e N for any integer l. 

Now (2.8) and (2.9) are in contradiction .unless F(z) Gale) = 0 
almost everywhere; since, by (2.4), Ga(x) is almost nowhere zero, F(z) is 
almost everywhere zero. This completes the proof of the sufficiency of (2. 4). 


. 816 R. P. BOAS, JR. 


To establish the necessity. of (2.4), we suppose that it is not satisfied. 
We may suppose that Q(z) is periodic with period 27, and not almost every- 
where zero, since the set (2.3) is certainly not complete if G(x) — 0 almost 
everywhere. Let E (of positive measure) be the set of zeros of Gy(z) in 
(— 7,7); let C(x) be the characteristic function of E. We have b=£0 
(since if b == 0, Gz(z) == yee! and has no zeros); then Gs(a) has period 
2r/b, and hence C(x) has period 2r/6. | 

Now let F(z) =— etC (x). Then 


f T B(x) G(a) eoade — f 7 O(a) G(w)e*ede — 0, 


- since C(x)G(z) = 0 for all æ. - 
Thus (2.6) is satisfied. Also, since any ie m'which is not in N 
bas the form m = a + kb + c, 0 < « <b, we have 


E F(T) etme de = f C(x) oto sd = 0, 


since ef? (0 < c < b) is orthogonal to every function of period 27/0. 

We have therefore constructed, if (2.4) is not satisfied, a function F(x) 
of LA, differing from zero on a set of positive measure, and satisfying (2. 5) 
and (2.6). Hence (2.4) is a necessary condition for the completeness of 
the set (2.3). | 


- 


8. Deduction of Theorems 2, 3, 4. Consider two functions 9(#)s f(z) 
of Ws. By a theorem of Paley and Wiener,® 


(3.1) f(s) = fera, Peba 

(3.2) g0) = f oradi, GU) eL(— r); 

by Plancherel’s theorem, 

(3.3) [rene f7 ett F(t) G (i) dt. 

Thus if (1.5) is satisfied, we obtain 

(3.4) fe etm P(t) dt — 0, Cn RES eee T 


(3. 5) f emt (L) G(t) dt = 0, (n= 0,+1,42,-°--). 


8R. E. A. C. Paley and N. Wiener, Fourier Transforms in the Complex Domain, 
1934, p. 13. For another proof, see M. Plancherel and G. Pélya, “ Fonctions entières et 
intégrales de Fourier multiples,” Commentartt AHathematici Helvetiot, vol. 9 (1936- -87), 
pp. 224-248; pp. 228 ff. 
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If (1.6) is also satisfied, we obtain, w Theorem 1, PEJ- = 0 almost every- 
where, and consequently f(z) = 0. 

On the other hand, if (1.6) fails, there is a function F(t) of L?, differ- 
ing from zero on a set of positive measure, and satisfying (3.4) and (3.5). 
The functions f(z) and g(z) defined by (3.1) and (3.2) then belong to Ws 
and satisfy (1.5). This completes the proof of Theorem 2. j 

If G(t) == (it)? on (—7+,x), the function defined by (3.3) is 2af (2), 
and Theorem 4 follows. 

We now consider Theorem 3. From a Mae of the us A 
given by Bochner? it follows in particular that if f(z) is a function of Ws 
having the form (3.1), then 


(3.6). A{f(2)} = fT aO, 
where G(t) is essentially bounded ; conversely, any essentially: bounded a(t) 
defines, through (3.6), an operator A having the properties specified in 
Theorem 3. | 
We write 
g(a) f " etetG(t) dt, 

so that g(z) e Wr. From (3.6) and (3.3) we have 

1 oo g 

MF) =g S f(@)g(u—2) de. 


Theorem 3 now follows from Theorem 2 ify we shoe that (1.6) and z 8) are 
equivalent for any given G(t). But we have 


a D etat; 
u 
aji sia SAN eit G(t) dt; 
@(t) = if. a f Sam = ete dy, mr LÉi<LT; 
GE) + GE) -i7 {us etes 4 1)du 


2.. Pare 











= À cos TE réa, 


«7 -00 


where 8 = t + 47, —r<i<0. 


°S, Bochner, “Ein Satz fiber lineare Operationen,” Mathematisohe Zeitschrift, 
vol. 29 (1929), pp. 737- 743. 
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In the same way we can establish the theorem stated in footnote 7. We 
need for this the result thab any linear operator A from L? (— œ, œ) to the 
space L* (defined in footnote 7), permutable with differentiation, has the form 


AC) = fT ROG), (ele, w), 
where : 


f(z) = [© siF(t)dt, F(t) eL2(— 0, 0). 


This theorem can be established by an appropriate modification of Bochner’s 
proof of his theorem cited in footnote 9. 


4 A generalization. It is natural to generalize Theorem 5 by breaking 
the set of Fourier functions into three or more sequences instead of only two. 
The results which can be obtained in this way are sufficiently indicated by a 
special case. Let F(a) belong to L? and have the Fourier series 


X 
F(z) — $ ae? 
== 00 


LS > Gaves? + >; Aat n is + > llay OD) iD, 


Let the functions whose Fourier series are the three sums on ‘the right be . 
respectively Fo(z), F:(x), F2(x). Then we have the following theorem. 


THEOREM 6. If G(x) and H(x) belong to L?(—-7,7), the set of 
functions 
(4. 1) genta G(x) end to, H (x) e(@nt2) ts 
(n=—0,+1,+2,:--) 
is complete in L?(— r, r) if and only tf 


G(x) Ho (£) — Gi(r) Het) 0, —aw7SeSr 
except perhaps on a set of measure zero. 


This can be proved in the same way as Theorem 5. 
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SOME UNIQUENESS THEOREMS FOR ENTIRE FUNCTIONS.* 
By R. P. Boas, Jr 


1. Introduction. A theorem of Valiron ? states that an entire function 
of exponential type? k < x, having a zero in each interval (n,n + 1) of the 
real axis (n = 0, + 1, + 2,- - -), is identically zero. If we let the zeros run 
together in pairs, and slightly weaken the hypothesis that the function is of 
type less than x, we are led to conjecture the truth of the following theorem. 


THEOREM 1. Jf f(z) is an entire function of exponential type such that — 


(1.1) (iy) = 0 (e), |y] o, k<; 
and j 
(1.2) f(2n) =F (2n) =0,  (n=0, + 1, £2, s); 


then f(z) ==0. 


It is easy to prove Theorem 1 by considering the entire function 
f (2) csc? frz, but this attack fails for the following more general theorem. 


THEOREM 2. If f(z) is an entire function of exponential type* satis- 
fying (1.1) and 


(1.3) f(z) = O(ell), |], 
with : 
T T 1 
(1.4) 1<Fett(1—-z+4), 
(q a positive integer), then f(z) =0 if 
(1. 5) f(2n) = fC (2n) = 0, (n= 0, +1, +2,6). 


This theorem resembles Theorem 4 of the preceding note; the function 
f(z) is now more general, but the condition (1.5) is more restrictive than 
the corresponding condition (1.9) of that note. We cannot use derivatives 
of even order in (1.6) as long as / > 0 in (1.3) ; this follows from Theorem 3, 
or more directly from the examples f(z) == [sin(r2/r) |", r—2,3,- °°, 


* Received November 10, 1939. 

1 This note was begun while the author was a National Research Fellow. 

2G. Valiron, “Sur la formule d’interpolation de Lagrange,” Bulletin des Sotences 
Mathématiques (2), vol. 49 (1925), pp. 181-192, 203-224; 213. I am not quoting the 
most precise form of Valiron’s theorem. ` 

3 The entire function f(#) is of exponential type © (o > 0) if | f(z) | < Aer". 

+ It would be enough to suppose that f(e) is of order less than 2; that f(z) is of 
exponential type would then follow from (1.1) and (1.3) by a Phragmén-Lindelôf 
theorem, 

=“ A completeness theorem,” American Journal of Mathematics, vol. 62 (1940), 
pp. 312-318. 
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I shall establish a still more general theorem in which the (2q — 1)-th 
derivative in (1.5) is replaced by a linear differential operator; this theorem 
is somewhat similar to Theorem 3 of the preceding note. 

Let $(z) be regular in the rectangle | «| < L, | y| < r, so that 


(1.6) $2) = Saw, |z| <mn(Lr). 

Writing D for d/dz, we can form, for any entire function f(z) of exponential 
type c, the expression 

(1.7) #(D)F(2) =È of” (2); 

* the series is convergent (for all z) if c < min(Z,7), and (as will be shown 


below) summable by the method of Mittag-Leffler in any case. With these 
conventions concerning ¢ (z), the following theorem holds. © 


THEOREM 3. A necessary and sufficient condition that sey entire func- 
‘tion f(z), of exponential type, satisfying 


(1.8) F(iy) = O(#W), |y] œo, E<7r; 

(1.9) f(z) = O(etlel), |a|— wo, l< L; 

and 

(1.10) Fn) = o(D)f(2n) = 0, (n = 0, + 1,+2,---), 


should vanish identically, is that 
(1.11) (a+ $ir)— e(z — dir) 0, || <L, |y|< r 


Theorem 2 is the special case where ¢(z) == 2217, To see this, we observe 
that the zeros zx of (z + him)" — (2— fir)", if n is odd, are determined by 
the equation 

gr F Ye = (ze — fir) 7r t/n, (k = 0, 1,2, > *,n—1). 


Then we have 
in 1+ ethrt/n 


Z; = — — 


9 Ja etkri/n 


= $r cot(kr/n), 
so that the 23 are real and outside (— Z, L) if 


r n—l_r r 1 
Sg OG gi =z ott (1-4), 
if n= 2m + 1. 


See, e.g., P. Dienes, The Taylor series, 1931, p. 311. 
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2. Preliminary discussion. Let f(z) be of exponential type and satisfy 
(1.8) and (1.9). By a theorem of Pólya,’ we can write 


(2.1) f(z) -Í oF (w) dw, 


where C is a curve containing the “conjugate indicator-diagram ” of f(z) in 

its interior, and F (w) is regular on and outside C. By (1.8), (1.9), and 

the convexity of the indicator-diagram, we see that we may take as C any 

curve outside the rectangle |u|—1, | v | = k, where w = u + iv; a suitable 

choice is the rectangle | u | = V, |v|—#, where k< k <r, 1<1V<L. 
Let the function ¢(z) of the theorem have the power series 


(3.2) $(2) = ar. 


y=0 


If p(z) is regular in a circle containing C in its interior, we clearly have 
f oF (w)p(w)dw — Sa, f eww F (w)dw 
c y=0 c 


(2.3) co 
— Daf (2) = o(D)f(z). 

If ¢(z) is not regular in a circle containing C in its interior, but is regular 
in the rectangle |u| < L, |v| <r, the integral on the left of (2.3) still 
exists for all z. It is clear that if the series Xa,w” is uniformly summable 
on C by any linear summation method, the formal calculation in (2.3) will 
still be possible, and the result will be that the series Xa,f? (z) is summable 
by the same method, with the integral on the left of (2.3) as its sum. Now 
the power series of ¢(z) is uniformly summable by any Mittag-Leffler method 
in any closed subset of its Mittag-Leffler star, which includes at least the 
rectangle |u| < L, |v| <r. If we take, for definiteness, the summation 
method’ defined by Lindelôfs function ° 


œ a” 00 : 
(2. 4) E(a) — 2 iea yr 2" X 
we shall have | 
sie Val 
J oF (wow) dw mim 5 





2 À 
> Sn (2) Cr Qt, 
n=0 

where 


Sa (2) — Saf (2). 


1G. Pólya, “ Untersuchungen über Lücken und Singularitäten von Potenzreihen,” 
Mathematische Zeitsohrift, vol. 29 (1929), pp. 549-640; 580 f. 

8 P, Dienes, Leçons sur les singularités des fonotions analytiques, 1913, p. 113. 

° P, Dienes, op. cit., loc. oit. 
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We are therefore justified in writing 
(2.5) o(D)F (2) =  eF(w)$(u) du. 
| 3. Theorem 3: sufficiency. We first prove the sufficiency of our con- 


dition (1.11). In the first place, if (1.10) is satisfied, g(z) — f(z)cschaz 
is an entire function, obviously satisfying 


(8.1) g(iy) = O(etamlsl), |y|— œ. 
It is easy to see that | 
. (8.2) g(t) = O(eïlsl), [e| 00. 


In fact, we have, for each 8 in (0, 2r), different from 0 or r, 


f(reté) = O (er tleosél+rk|sin 4) | r— o, 
and therefore 
g (ret) = O (er Heosd|-+ Gx) [ain d| ) 
= O (ertlcosél) , 

Since g(z) is of order one, (2.7) follows by the FRA NON theorem 
for an angle,** applied to g(z)e**. 

We now define a function #(w), regular in |u| < L, |v | < $r, by the 
relation | 


(8.3) Rup (2) — p(z + fri) — p (2 — Emi). 
Let O* be the rectangle | u | =? | v | = k’ — $m; since g(z) satisfies (3.1) 
and (3.2), we have 

gle) = f, erey(w) dw, 


where y(w) is regular on and outside C*. We define A(z) by 


(3.4) h(a) —¥(D)9(2) = f 7y (w)y(w) dw. 


Evidently A(z) is an entire function of exponential type, satisfying 
h(iy) = 0 (W), [y], 
(8. 5) , 
h(x) = O (elel), [æ|— 00. 
We are going to show that 
(3.6) h(2n) = (—1)"#(D)f(2n) (n—0,+1,#2,: 50), 


19. C. Titchmarsh, The Theory of Funotions, 1932, p. 183. 
1 Or by the theorem cited in footnote 10. 
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so that h(2n)—0, n—-0,+1,+2,---. Then, by Carlson’s theorem,” 
h(2)=0. But, if y(w) 0 in |u| <L, lv|<k (which is assumed in 
(1.11)), the function w(w) = 1/4 (w) is regular in the same region, and 


o(D)h(z) -Í fy (w) dw = g (2). 
From the representation of w(D)h(z) as a summable infinite series, it is clear 
that w(D)h(z) = 0 if h(z) = 0. Thus g(z) = 0, and consequently 
f(z) = g (z) sin $7z = 0, which we were to prove. 
It remains to establish (3.6). In what follows, any infinite series, 


00 
> An, 
n=0 


is to be understood as a Mittag-Leffler sum as defined in § 2, i.e. as 


. 1 & à 
bm Fay 2 049 È 4r 


We have 


#(D)f (2n) = È af” (27); 
then, since f(z) = g (2) sin 4nz, i 
BD (DOn = Say $ (7) omg (2n), 


E= 
where 


on = (— 1)*(sin dre) Ox) |ezan 
=— p[i” — (—14)"] (87). 
Now we have, uniformly for w + 4ir in a closed subset of the star of ẹ (w), 
and in particular for |u| SV, |v | S k’ — ẹr, 


&(0 + tir) — È w(u + Hn)? 


and there is a similar expression for (w — dir). Combining these expres- 
sions, we have (referring to (3.3)) 


oO La y 
bu) = Sa EC) cvs, 
p=0  p=0 \H 
the series being uniformly summable on C*. Consequently we may substitute 


this expression for #(w) in (3.4) and integrate “termwise” along C*, 
obtaining 


12K, C. Titchmarsh, op. ctt., p. 186. 
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h(2n) — > a ov-ng® (2n) 
= (— 1)" (D)f (en) 


by (3.7). This is (3.6); the proof of the sufficiency part of Theorem 3 is 
thus complete. | 


| 4 Theorem 3: necessity. ‘Suppose that (1.11) is not satisfied. We 
have to construct an entire function of exponential type satisfying (1.8) 
and (1.9) (with some k <r and I< L), and (1.10), but not vanishing 
identically. 
Since (1.11) is not satisfied, there is at least one point wo, with | uto | < L, 
| vo | < $r, such that 


(4.1) sak ta) E E E 
Let 

a — A) 

BO) = Toa E E 


where A(w) is an entire function taking the value tr at w == w, + dm; thus 
F(w) has residues + 1 and — 1 at w = w + hr. 
We take numbers k, l, such that 
|u| <1<L, |vtr|<k< r; 


then the points w, + dim are inside the rectangle bounded by the curve C: 
|u| =1, |o | =k. If we set 


f(z) = nf. FF (w) dw, 


it is clear that f(z) satisfies (1.8) and (1.9). Moreover, we have, calculating 
residues, 


f(2n) = 5 Í. eerop (w) dw = en Coté) ptm (wrt) un, 
C 
5 (n=0,+1,+42,:::); 


and 
$D) (Rm) ms fete F(w)$(w) dw 
= en) (105 + fin) — (Ag (wy — din) 
= (1) EB + Hie) — e(o fir) 
= 0, (n=0,+1,42,-°°), 


by (4.1). 
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ERGODIC CURVES AND THE ERGODIC FUNCTION.* 
By RICHARD KERSHNER. 


1. Introduction. Let M denote a bounded subset of the Euelidean 
plane and let e > 0 be fixed. Then, following M. H. Martin + we have 


DEFINITION 1. A continuous curve 


G) = Gt sœa(t),y=y(t);. OStS1; 

will be said to be «-ergodic to M (or to have the property (e) with respect 
to M?) if, for every point of M, there is a point of O at a distance Se. 
In general, an arbitrary set C satisfying this last condition will be said to 
have the property (e) with respect to M. 


DEFINITION 2. A continuous . rectifiable curve (1) will be called an 
e-ergodic curve for M if tt is e-ergodic to M and such that its length A(e) ts 
an absolute minimum for the lengths of all continuous rectifiable curves 
_ eergodtc to M. 


. DEFINITION 8. The length A(«) of an e-ergodic curve for M, considered 
for varying e, is called the ergodic function for M. 


Martin has shown ê that, for arbitrary M and e > 0, there is at least one 
` C= O (e) satisfying Definition 2, so that the function A(e) of Definition 3 
is well defined for all e> 0. This function is clearly non-negative and non- 
increasing with e. Recently * Martin has shown it to be a continuous function 
of e He had previously pointed out® that A(e) > œ as e—>0 unless M is 
a point set lying on a continuous rectifiable curve. In the last section of the 
present paper the description of the asymptotic behavior of A(e), for small e, 
is extended by showing that, for an arbitrary set M, 


lim 2eA(e) = meas À, 
Cod - 


* Received March 16, 1939, : 

7M. H. Martin, “Ergodic curves,” American Journal of Mathematics, vol. 58 
(1986), pp. 727-734. 

* Of. A. Errera, “Un Problème de Géométrie Infinitésimale,” Académie Royale de 
Belgique Mémoires, vol. 12 (1932), p. 4. | 

* Loo. ott., 1, p. 731. 

1M. H. Martin, “Note on the continuity of the ergodic function,” Bulletin of the 
American Mathematical Society, vol. 43 (1937), pp. 541-546. i 

5 Loo. cit., 1, p. 733. 
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where Mf is the closure of M. It should be mentioned that this last section 
can be read independently of what precedes it. 

At the present stage it seems hopeless to expect the explicit determination 
of C(e), for all « > 0, for even the simplest sets M of positive plane measure. 
However it is possible to find considerable information about the nature of 
C(e) both locally and in the large. The greater part of this paper is devoted 
to investigations of this nature. The main results are that C has no double 
points, has at every point a right and left hand tangent, has a well-defined 
tangent up to a countable number of corners and has no, cusps. 


2. Preliminary lemma. This section will be devoted to a general lemma 
on the parametrization of rectifiable curves which will be very useful in the 
sequel. This lemma may also be stated in such a way as to have, apparently, 
nothing to do with parametrization ; viz., f 


Lemma 1. Any continuous rectifiable curve C may be uniformly approxi- 
mated by simple polygons, whose lengths approximate that of C. 


Proof. The proof of this Lemma 1 is routine and will simply be outlined. 
First one chooses a polygon B, which approximates C. Then, if B, has 
multiple points which are not simple isolated double points one performs a 
slight deformation so as to obtain a polygon B. approximating B, and such 
` that all multiple points are simple isolated double points (and there are only 
a finite number of these). Now let the polygon B, be traced in a definite 
manner and suppose that at a given double point p the polygon B: actually’ 
crosses itself when traced in this manner. Then if the sense of tracing is 
reversed along that portion of B» which consists of a closed curve through p, 
B, will no longer cross itself at p. Then, evidently, the double point p may 
be “ pulled apart” without introducing any new double points. In this way 
the (finite number of) double points of B, may be removed and a simple 
polygon Bs found which approximates B, and therefore C. 

A restatement of Lemma 1 which explicitly introduces the parametric 
representation (1) of C will also be convenient. Firat 


DEFINITION 4. The paramelric representation (1) of C will be satd to 
be non-crossing tf the following condition ts satisfied: Let tı < tz be the 
parameters of any double point of C and let T be any simple closed curve 
containing this double point which meets the four branches of C corresponding 
tot<h,t >t, t<te,t >t. Let pi, pe, pa, Pa be the four points of T whose 
parameters are, respectively, the greatest, least, greatest, least value of t satis- 
fying these four inequalities and giving points on T. Then pı, pa do not 
separate Ps, pa on T. 


` 
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Clearly, if C has no double points, then any parametric representation is 
non-crossing. On the other hand Lemma 1 shows that 


Lemma 1bis. Any continuous rectifiable curve C has a parametric repre- 
sentation (1) which is non-crossing. 


Proof. In fact one simply chooses a sequence of simple polygons con- 
verging to C even in length, parametrizes each polygon by its arc length, and 
lets the limit of these parametrizations define a parametrization for C. Then 
it is very easily verified that Definition 4 is satisfied by this representation. 


ASSUMPTION À. In the sequel it will always be assumed that the para- 
metric representation 


(1) 0: gai), y=y(t); OStS1; 
satisfies Definition 4. 


3. Terminology and notation. Let C be an «ergodic curve (1) for M. 
Then 


DEFINITION 5. By the point [{]° of C will be meant the point with 
coërdinates x(t), y(t). 


DEFINITION 6. By the arc (s,u), (where s < u ts always understood) 
will be meant the set of all points [t] with s<t<u. If one or both of 
these parentheses is replaced by a square bracket, it is understood that equality 
ts allowed at the corresponding end or ends of this last inequality. 


DEFINITION 7. The arc (s,u) of C will be called non-significant if the 
point set consisting of the two arcs [0,s], [u,1] of C— (s,u) has the 
property (e) with respect to M. In the contrary case (s,u) will be called 
significant. 

DEFINITION 8. A point [t] of C will be called non-significant if some 
arc (s,u), with s< t< u, ts non-significant. In the contrary case,’ that 
every such (8, u) is significant, the point [t] will be called significant. 


DEFINITION 9. A point of M will be said to be salient to (s,u) tf it is 
at a distance Se from some point of (s,u) and at a distance > from every 
point of C— (s,u). Such a point will be denoted by p(s,u), and the 
totality of all such points by P(s,u). 


$ The letters s, t, will all be used as parameter values in the sequel but the generic 
parameter value will always be referred to as t. 

* Here, and throughout the paper, it is assumed that we are not dealing with.an 
end point 0 or 1. This assumption is made simply for simplicity of statement and the 
definitions and results are readily extended to include the case of end points. 


4 + 
, 
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Dainai 10. A Point. of À will be said to be salient to [t] if i ts 
at a distance e from [t] and at a distance =e from any potnt of C. Such a 
' point will be denoted by p(t), and the totality of all such points by P(t). 


DEFINITION 11. The circle of radius e about a point p(t) will be called 
a salient circle through [t] and denoted by S(t). 


DEFINITION 12. A significant point [t] will be called simply significant 
if there is exactly one salient circle S(t) through [t]. 


DEFINITION 18. A significant point [t] will be called doubly significant 
‘if there are exactly two salient circles 8,(t) and S2(t) through [t] and if 
these are mutually tangent. 


DEFINITION 14. A significant point [t] will be called multisignificant 
if there are at least two intersecting salient circles through [t]. 


Notice that a salient circle S(t) passes through [t] but has no point of c 
in its interior. It will be shown, in the next section, that there is at least 
one S(t) through every significant point [t] so that Definitions 12, 18, 14 
provide a classification of all significant points. Notice also that, according 
to Definition 10, the set P(t) of points salient to [t] is a closed set lying on 
the boundary of the e-circle about [t]. In view of hig one can make the 
following definition : 


DEFINITION 15. Let [A] be ated and let Se be the circle of radius 
c about [t]. Let A. be a! closed arc of B. of minimum length which contains 
the set P(t). Then the angular measure 8(t) of this arc +s called the salient 
angle for [t], and its endpoints are called the terminal salient points p(t) ; 
pa(t) for [t]. 


‘Notice that . 
(2a) O(t) — 0, if [t] is simply significant; 
(2b) O(t) =x, if [t] is doubly significant; 
(2c) 0 < 6(t) S~, if [t] is multisignificant. 


The last of these relations comes from the fact that if O(t) > =, then the set. 
of circles S(t) would completely cover some neighborhood of [¢] so that [t] 
would be an isolated point of C, contradicting the continuity of C. It will be 
shown in the next section that the equality sign in (2c) cannot hold. 


' 8 This are will not be unique if [t] is doubly. significant but its length and end 
points are, of course, determined. 
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DEFINITION 16. By K(p1,p2) and p(pi, pa), where pi and p, are points, 
will be understood the linear segment joining p, and po, and tls length, 
respectively. to 

4. Fundamental relations. This section will be devoted to a study of 
some of the fundamental relations between the concepts introduced in the 
preceding section. 

Lemma 2. Suppose the point [t] of O ts significant. Then there is at 
least one salient circle S(t) through [t]. 


Proof. Let (si, us) be a sequence of arcs such that 


(3a) Se L Sin LÉ <L tin < ti; l (i—1,2,- -+ -) 
and 
(3b) lim Si = lim Uy = À. 

400 400 


Then, by Definition 8, (s1, us) is significant for all i. According to Definition 7 
this means that the set P, which is defined. as the closure of the set of points 
P(s;, ui), is non-empty. In fact Definition 6 implies the existence of points 
of M which are at a distance > e from [0,s;] + [w,1]. These points must 
be at a distance Se from (si, w:) since C has the property (e) with respect 
to M and consequently must be points of P (s: ui) by Definition 9. 

It is clear from Definition 9 that P (s1, ui) D P (Sis, Win) in view of (3a). 
Thus 

(4) P, D Pus 1,2, +; Pi not empty. 


By a well-known theorem on closed sets (4) implies that there is a closed, 
Don- enpi set I = H(¢) such that 


(5) u(t) = lim Ë; = HP 


Let po be any point of I(t). Then, by (5) and the definition of Pi, Po 
is at a distance Se from (ss, wi) for allt. Thus, by (3b), po is at a distance 
Sc from [t]. On the other hand po is at a distance = e from any point of 
C— (Si, u) for every 1, and so at a distance Ze from C. Thus, by Defini- 
tion 10, po is a salient point p(t) to [t], and the e-cirele about po is a salient 
circle through [t]. This completes the proof of Lemma 2. 

Incidentally the following fact, which will be useful in the sequel, has 
been demonstrated during the course of the last proof. 


Lemurs 3 Let s<t<u. Then 
_ P(s,u) C P(t) 


su 
where P (s,u) ts the closure of P(s,u). 
7 ; 
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Adia this was proved above only in the case that [t] was a significant 
point, but the lemma is vacuously true if [t] is non-significant. 

Lemma 3 is quite restrictive in case [t] is simply or doubly significant; 
i.e. if P(t) consists of one’ or of two points, but in case [¢] is multisignificant 
a stronger result is needed and will be giyen next. 

Lemma 4. Let [t] be a mullisignificant point of O and let p,(t), ps(t) 
be the two terminal salient points to [t]. Let s<t<u Then, tf the 
notation is chosen appropriately 


lim P(s, t) C (p(t); lim P(t, u) C (pa(t)). 
Proof. First, by Definition 9, | 
P(s,t) + P(t,u) C P(s,u). 
Now, by Lemma 3, 
| lim P(s, à) C P(t) 


au 


so that, à fortiori, 
(6). ` lim P(s, t) + lim P(t, u) C P(t). 
ý at ut 


It will now be shown that no point p(t) different from p(t), po(t) can 
be a point of the left side of (6). To this end let ps(f) be a fixed point of 
P(t) which.is not terminal. The point p(t) is then an interior point of the 
arc À. mentioned in Definition 15. Let Sy be a circle with center at p(t) 
and radius y, where 7 is so small that S, does not cross either chord K([t], 
pi(t)), K([t], pe(t)). The circle Sy is divided by an arc of A, into two | 
parts, one lying within and one outside Se. The points of Sy within or on Se 
are all at a distance Se from [t] and so, according to Definition 9, cannot 
bélohg to P(s,t) + P(t,u) for any s<t<u. On the other hand, the 
points of Iy outside Se are obviously at a distance >e from any point of 
(s,u) for sufficiently small u — s in view of the fact that (s, u) cannot enter 
either circle 8:(#), S2(t) of radius e about pi(t), pa(t), respectively. Thus 
. no point of Sy is a point of P(s,t) + P(t, u) and pa(t) cannot be a point of 


lim P(s, t) + lim P(t, u). 
: at ut 
Thus (6) may be strengthened to 
lim B(s, t) + lim P(t, u) C (p1(t)) + (pa(#)). 


The separation of this last relation into the two separate inclusions 
required by Lemma 4 is accomplished very easily by recalling the Assumption A 
stating that the two arcs (s,¢) and (t, u) do not cross so that one is “ nearer ” 
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p.(t) and the other “nearer” pe(t). The details of the separation will not 
be given as they are readily supplied. — 
The next two lemmas are immediate consequences of the definitions of 
the concepts involved and are stated simply for reference. 
LEMMA 5. The set of significant points of C(e) form a closed subset 
of C(e). 
Lemma 6. For any t, 
lim P(u) C P(t). 
ut 


It should be mentioned that the limit here, as in Lemma 3 and Lemma 4, 
is the ordinary point set limit; i.e., the set of all limit points obtained using 
any sequence of u-values and any choice of particular points of P(u). Notice 
that actually P (u) = P (u) ; (i. e., P (u) is closed) but P(u) has been written 
for the sake of the analogy with Lemmas 3 and 4. 


Lemma % For any to, 


lim sup 0 (u) S 8(t). 
“to 


Proof. This is an easy consequence of Lemma 6. In fact if limsup6@(u) —0, 
there is nothing to prove. Then suppose lim sup 8 (u) —8 > sand let {wi}, 
t==1,2,---, be a sequence of ¢ values fae Hk ui —> to and 
(7) lim 6(t) =0 > 0. 

4-00 


Now let pı (u:i), po(us) be the terminal salient points for [u;:]. Then the 
points pı(t4) are an infinite set in a bounded closed region M and have at 
least one cluster point p, in Æ. Let {un,} be a subsequence of the {us} 
such that 


(8) | ee Pi (Uni) = Pre 
Then, in view of Definition 15, the relations (7) and (8) imply that 


(9) lim p2(tn,) = ps 


exists; and, further, that p, and po are two points on the «-circle about [él 
which are separated by an angle 4 on this circle. Since (8) and (9) imply 
that pı and pz are points of P(t), in view of Lemma 6, the proof of Lemma 7 
is complete. 

In general the inequality given by Lemma 7 cannot be replaced by 
equality; but the case when this is possible, namely when lim sup O(u) =r, 


deserves special mention in view of its later usefulness. In particular, 
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Lemma 8. Let [t] be a limit point of doubly significant points. Then 
either [t] ts doubly significant or multisignificant with 6(¢) = r. D 

As mentioned before, it will later be shown that the second alternative 
here cannot actually occur so this Lemma 8 will be strengthened. 


5. Local properties. This section. will be devoted to establishing a few 
local properties of C which will be needed later. 


Lemma 9. Let [to] be a non-significant point of C. Then if u—s is 
suffictently small, s < to < u, the arc (s, u) ts linear. 


Proof. According to Definition 8, some (s, u) is non-significant. Sup- 
pose this (s, u) is not linear. Then the curve obtained from C by replacing 
the are (s, u) by its chord K([s], [u]) is a shorter curve which, according to 
Definition 7, has the property («) with respect to A7. This contradicts the 
assumption that C was an e-ergodic curve for M. 


Lemma 10. Let [to] be a simply significant point of C and let L(to) 

be the line tangent to the unique salient circle S(to) at [to]. Then if u—s 

ts sufficiently small, s < to < u, the arc (s,u) lies in that closed half plane 
determined by L(t) which does not contain p(to). 


Proof. Suppose the statement is false; i. e., that there is a sequence of 
points [t] lying i in that open half plane determined by L(to) which contains 
p(t) and such that t; > to For the sake of definiteness let it be supposed 
that t < Lo. 

Let S* be the circle of radius $e about p(t). Then, by Lemma 3, values 
So, Uo may be chosen so that 


I P (so Uo) C S*, 80 < lo < to, : 
and, à fortiori, 


(10) P (8, to) C S*. 


Now about p(t.) draw a circle S(t, p) of radius e + p, where p > 0 is 
so small that S (to, p) intersects both arcs (so, to) and (to, %o).® Let sı be the 
greatest ¢ < to and u, the least £ > to such that [s,] and [wu] are on S(t, p). 
Then by the first paragraph of this proof there are points of (so, to) lying in 
that open half plane determined by L(t) which contains p(t.). Thus some 
half line K (to), terminated by [to] and lying in this same half plane, meets 
(51, to) in a point [t*] ~ [to]. Clearly K (to) may be chosen in such a way 
that it does not meet S*. It is supposed that this has been done and also that 


° This is possible unless one of (8), ty), (ty % 0) lies exactly along St ). In this 
case the argument which follows may be modified by choosing the notation so that 


(8, to) lies along S(t) and then -choosing p=0, u, = typ 8, =t* =). 
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[¢*] is the first point distinct from [to] where K (to) meets (s, to). (That 
there is such a first point is clear from the fact that K (to), near [40] ‘lies in 
#(t) and so does not meet C.) - 
Now, by (10), 
© P(t, i) C 8%, 


This means that the only points of # ‘at a distance Se from (t*, to) which 
are not also at a distance Sc from C — (t*, to) are points of S*. But, by. 
Assumption A, the arc (¢*, to), which does not cross K (to) by the definition 
of t*, cannot cross either of the two arcs comprising {sı tu) — (t*, to). 
This (¢*,t)) is separated from S* in the convex curve S (top) by the arc 
A (81, 1) defined by 


A (81,1) = (81, t*) (ET, [to]) + (to, w). 
Thus any point of S* at a distance Se from (t*, to) is also at a distance 
SS from A(s,,u,). Then the curve 3 ù 


C— (Si, t) + A(8, t) = C — (t*, to) + K(LEX], [éo]) 
is e-ergodic to M. Since this new curve is obviously shorter than C, this 


contradicts the assumption that C was an c-ergodic curve for M and completes | 
the proof of Lemma 10. 


Lemma 11. Let [to] be a doubly significant point of C. Then if u—s 
is sufficiently small, 8 < to < u, the arc (s,u) lies between two mutually 
tangent e-circles through [to]. 

Proof. This is trivial (the circles being the two salient circles S1(to), 
S:(t) assumed in Definition 13) and has been included for reference only. 


Lemma 12. Let [to] be a multisignificant point of O and let L,(to), 
La(to) be the tangents to the two terminal salient circles S(t), S2(to), 
respectively, at [to]. Then if u — s ts sufficiently small, 3 < to < u, the arc 
(8,4) is contained in that closed angle determined by L,(to), L(to) which 
contains no interior point of Si(to) or S2(tp).2° 


Proof. The proof of Lemma 12 precisely parallels that of Lemma 10 and 
will not be given. It should be noticed’ that Lemma 4 serves here the purpose 
of establishing a relation like (10), where, of course, S* will be a 4e-circle 
about pi(to) or Pa(ta) according to which arc is to be modified. — - 


40 In case 6(¢,) == this closed angle degenerates to à half line and is not uniquely 
determined by the condition that it contain no interior point of 8,(¢,) or &,(#,). 
However, since [t] is multisignificant, there must be, in this case, a third salient circle 
8, (#5) intersecting S, (t,) and 8,(¢,) (ef, definition 14). Then the half line is deter- 
mined by the condition that it does not eut B,(t,): age | 


384 RICHARD KERSHNER. 


6. Double points. This section will be devoted to the proof of 


THesoren 1. Let M be an arbitrary plane point set. Let «> 0 be fixed, 
and let O = C (e) be an e-ergodic curve for M. Then C has no double points. 


Proof. Suppose the statement is false; i.e., that there is a t,540 and 
8 ta £ 1," such that t, < t: while [t1] = [t:]. Then there are a number of 
cases to be considered which are not mutually exclusive but which together 
exhaust all possibilities. 


Case 1. The points sı < tı < ti, S2 < tz < Us can be chosen so that the 
four arcs (si, t), tou) (S2, a), (t2, u2) coincide (as point sets) in two 
identical pairs. Suppose that these four arcs have been extended so that they 
are as long as possible satisfying the required condition of coinciding in two 
pairs and such that each coincident pair have coincident end points. Then 
there are the following possibilities (not mutually exclusive). 


Case 1.1: s,==0. Then the curve obtained from C by deleting the arc 
[s1 4) = [0, 4) is “shorter” in the parametric sense but identical in a point 
set sense with C. This contradicts the assumption that C was an e-ergodic 
curve for M. 


Case 1.2: u—1. A contradiction is reached, as in Case 1.1, by 
deleting (to, uz] kanei (ta, 1]. 


Case 1.3: 8,540; wz£1; [s1] = [uw]. In this case [sı] — [we] is a 
double point which does not come under Case 1. For if [s,] = [w] came 
- under Case 1, then the given arcs [s,, t1] = [%2, ue] could be extended to 
longer point set identical arcs with coincident end points, contradicting the 
assumption that the given arcs were the longest such. Thus to exclude Case 
1.8 it will be sufficient to show there are no double points which are not in 
Case 1. 


Case 1.4: 8,540; [s]—T[s:]. This is treated exactly as is Case 1. 8. 
Case 1.5: s10; [sı] =— [u]. This is treated exactly as is Case 1. 3. 
The above five possibilities exhaust Case 1 in view of the assumption 


that the identical arcs had identical end points. 


Case 2. The point 8 < bı <u, 82 < lz <u: can be chosen so that 
some three of the four arcs (sı, t1), (His), (S2,t2), (la, Ue) are identical (as 
point sets). This case can be treated in essentially the same way as Case 1 
and the detailed treatment will not be given. Either a contradiction is reached 


11 Cf. footnote 7. Of course t, = 0 and ¢,= 1 is not considered as a double point 
but merely means that C is closed, which is trivially seen to be possible. 
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or one is led to the existence of a double point which is not in Case 1 or Case 2. 
Thus it will be sufficient to prove the impossibility of this last. 


Case 3. There are no salient circles through [tı] —[t.]. Then, by 
Lemma 2, both [¢,] and [t,] are non-significant; i.e., some arcs (s, ta), 
(Sz, Ue), with 8, < ti < th, 82 < te < ua are non-significant. Then, by Lemma 
9, these two arcs are linear. But by Assumption A these two segments through 
[t:] = [fe] are non-crossing. This is possible only if [¢,] = [t2] is in Case 1. 


Case 4. There is exactly one salient circle S through [t] = [te]. Let 
p be the center of S and L the tangent line to 8 at [t] —[t.]. Now let 
Si, Ui, t= 1, 2, be chosen so that 


(11a) 1 Li LU <L 82 < be LU; 
(11b) * P(aw) CS, (i— 1,2); 
(11e) (Si, wu) CH, (i—1, 2); 


where H denotes that closed half plane determined by L which does not con- 
tain p. The possibility of satisfying (11b) is assured by Lemma 3 while 
(11c) may be satisfied ini view of Lemma 10 if [t] is simply significant and 
in view of Lemma 9 if [t;] is non-significant. 

Now let a circle S(p) be drawn with center at p and with radius e + p 
where p > 0 is so small that S(p) intersects all four arcs, (si, ti), (ti; wad, 
i= 1,2. That such a p > 0 exists is clear from (11c). Finally, let 3'4, wi, 
respectively, be the greatest t < t; and the least t > t, such that [s’;] and 
[uj] are on S(p), i= 1,2. : 

Consider the are A(p) of S(p) which lies in H. The four points [si], 
[w] lie on A(p) in some linear order. (It is not excluded that certain, or 
even all, of these points coincide; in which case there will be a corresponding 
ambiguity in this linear order.) It will be unimportant in which sense this 
order is established so that the twenty-four permutations on four letters reduce 
to twelve cases that will be considered distinct. Of these twelve possibilities, 
four are eliminated, in view of Definition 4, by Assumption A. Then the 
remaining eight possibilities may be reduced, by making use of the fact that 
the above notation may be changed, by reversing the direction of the para- 
metrization along C, to one of the following two types: 


Case 4.1. The linear order is [s’1], {[s’2], [w2]}, [w]. 
Case 4.2. The linear order is {[s’,], [u’s]}, {[s’2], [w2]}. 


Here the curly brackets signify that it is of no consequence which of the 
two symbols contained occurs first; i. e., {p1, po} means either pı, pz OT Po, Pie 
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In Case 4. 1 the are [s’;, W] separates, in a weak sense, the arc [ss w2] 
from S in S(p).* In particular, every point of 9 is as close to some point 
of [s’1, w] as to any point of [s’2, w2]. Thus there are no points of P (3a, wa) 
in 8. In view of (11b) this means that P (s's, w2) is empty; i. e., that (s’2, w2) 
is non-significant. Then by Lemma 9, (s's, u’,) is linear. But, by (11c) and 
the fact that [s’,, w] separates [s’2, Wa] from S, this implies that also [s:, w] 
is linear and these two arcs [s1, u’;] are both segments of L. Thus Case 4.1 
reduces to Case 1. 

Case 4.2 is somewhat more troublesome. Let it be assumed, for the sake 
of definiteness, that the order is actually [s’,], [w’:], [s’2], [w2]. It will be 
clear that this is no restriction since the proof will not refer to the nature of 
C outside S(p).% Now let L be chosen as the Y-axis of a Cartesian (X, Y) 
plane with origin at the point [t4] =-[t.]. Then either [s’,] and [w] are 
both in the closed upper half plane or [s’2], [w] are both in the closed lower 
half plane (provided the positive Y direction is chosen appropriately). It will. 
be a notational assumption that the first of these alternatives is true. It will 
also be assumed that not both [s’,] and [w] are on the X-axis.* 

Now consider the ellipse with foci at [s] and [w’1] and passing through 
[ti] — [fo]. It is easily seen that this ellipse has, at the point [¢,] = [te], 
a negative slope; i. e., that a point p* may be chosen on S, in the open second 
quadrant and in this ellipse. It is supposed that p* is chosen so that the 
principal are S(p*, [t,]) of 8, joining p* and [tı] has an angular measure 
< fr. The fact that p* is in the given ellipse means that 


PCs], p*) + p(p*, La) < eE] [4]) + elh] [w]. 
(cf. Definition 16). Now let {* be the greatest t < t, such that [¢*] is on 
the chord K (p*, [s’,]) joining p* and [s,]. Then it is immediately seen that 


PCT, p*) + o(p* LT) <e] CUT) + (fh), w]. 
Thus it is seen that the curve C* which results from C by replacing the arc 
(é*, wz) by the two chords K([i*}, p*) + K(p*, [w1]) is shorter than C. 
It will now be shown that this C* has the property (e) with respect to 
M so that C is not an «ergodic curve for M. This contradiction will complete 


1]n case [8] = [s], [w] = [w] or [s,]= [w], [9’,] = [w], this state- 
ment may be understood as a notational assumption, in view of assumption A. 

-18 Tt is easily seen, by an argument similar to that used in Case 4.1, that the arcs 
(tp w,) and (3, t) are non-significant and therefore linear, but this fact will not 
be needed. , 

1t This is clearly justified unless all four points s’,, w, coincide on the X-axis. 
If this occurs one simply chooses a smaller value of p in defining S(p). If the same 
trouble occurs for all small p > 0, then [t] = [¢,] is actually in Case 1 which has 
been treated. 
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the elimination of Case 4.2. To this end note first that the arc (¢*, wx) is 
separated from § in S(p) by the arc A(S, w2) defined by 
A(s/x, Wa) == [sn 7] + K([t*], p*) + 8(p*, [h]) + [te v2] 

(where S(p*, [4]) is the principal are of S joining p* and [t,] as mentioned 
above). This is true since (¢*,w’,) cannot cross [s’1,é*] or [ta wa] by 
Assumption A, K([t*], p*), by the definition of #*, or S(p*, [t1]) by (11c). 

Now suppose C* has not the property («) with respect to M. Then some 
point p of M at a distance Sc from € would be at a distance > e from C*. 
Since C* contains all of C save (¢*,u,), this point p, in particular, would 
have to be salient to (4*,w:). Then, by (11b), CS. Then the e-ircle 
about the point p CS has an interior or boundary point on (t*, w1) but 
does not cross C* and in particular does not cross A(s’x, We) — S(p*, [t:]). 
Thus, according to the preceding paragraph, this e-circle about p must cross 
_ S(p*, [t:]) twice. But this is impossible for p CS since S(p*, [t:]) has 
been assumed to have an angular measure < $r. According to the preceding 
paragraph the treatment of Case 4. 2 (and consequently of Case 4) is complete. 


Case 5. There are at least two intersecting salient circles through 
[hi] = [te]. In this case clearly both [t] and [t1] are multisignificant and 
6(t:) = 6(t2). Here two cases are distinguished of which one is trivial. 


Case 5.1: 6 (ty) == 0(t:) ==. This case reduces immediately to Case 1 
in view of Lemma 12 (cf. particularly footnote 9). 


Case 5.2: 6(t,) —6(t:) <m. The treatment of Case 5. 2 closely paral- 
lels that of Case 4 and will not be given. It is enough to note that Lemma 4 
serves here the purpose served by Lemma 3 in Case 4 while Lemma 12 replaces 
Lemma 10. Of course, in view of Lemma 4, the four arcs (si, ti), (ti, t4) 
must always be considered separately, but the only difficulty introduced by 
this is a notational one. 


Case 6. There are exactly two, mutually tangent, salient circles through 
[ti] [é]. Let pi, p2 be the centers of these two salient circles Sı, Sa and 
let S*,, S*2 be the circles of radius 4e about pı, po, respectively. Let si, wi, 
i— 1,2, be chosen so that 


(12a) Sı [Lti < th < Sr < te < t; 
(12b) Psy) + P (se ua) C 84 + 5%; 


as is possible by Lemma 3. 
Let Si(p:), t= 1,2, be a circle of center p; and radius e + p; where 
pi > 0 is so small that S;(p:) meets all four arcs (Sı, t1), (ti th), (82, fe), 
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(ta, U2). Let s4j, Uiz, respectively, be the greatest t < t, and the least 
{> ts, such that [sı] and [wij] are on Sj(p;); j= 1,2; += 1,2. Finally, let 
sy = MAX Sij Wi == MID wij. 
j=1,2 4=1,2 

© Now let the points p,, pz, respectively, be the points (0,e), (0, — €) of a 
Cartesian (X,Y) plane. Suppose that the point [tı] = [t2] is not in Case 1 
or Case 2. Then there are two cases to be distinguished. 


.Case 6.1. Two of the arcs (Sa, ti), (ti, ws) lie in the right half plane 
and two in the left half plane. Since [t,] = [tz] is not in Case 1, these four 
arcs do not all lie on the -axis. Choose the quadrant which contains a point 
of one of these arcs as the first quadrant by making the appropriate changes 
of notation. Let K be a half line terminated by [é,] = [t+] and lying in the 
first quadrant. Then, clearly, K can be chosen so near the positive X-axis 
that it does not meet S*, or S*, but does meet one of-the four ares (si, ti), 
(ti, uw). Let 4 be the first point ~ [tı] where K meets one of these four 
arcs. (That there is such a first point is clear from the fact that near [4], 
K lies in one of the salient circles S:, Sa and so does not meet C.) For the 
sake of definiteness let it be assumed that s^, < t* < tı and that [s’s, t’2] is 
the other arc in the right half plane. It is clear from the definition of t*, 
together with Assumption A, that [s’2, t2] is “below” [s, t1]. Assume also 
that S*, is in the upper half plane. 

Then there are no points of S*. salient to (t*,t:). In fact, the entire 
arc (82, l1) is separated by (S22, t2) from S*, in the convex curve consisting 
of that part of Ss(p2) lying below and to the right of that tangent line to S*. 
through [tı] = [t2] which has positive slope. Thus, by (12b), all points 
P (t*, to) lie in S*. 

Now it may be shown, by an argument exactly like that used in the proof 
of Lemma 10, that the curve obtained from C by replacing the are (t*, tı) 
by the chord K([t*], [{]) is a shorter curve with the property (e). This 
contradiction completes the treatment of Case 6.1. 


Case 6,2. At least three of the arcs (Sa, ti), (ti, W4) le in the same 
half plane determined by the Y-axis. In this case choose the half plane which 
contains at least three of these arcs as the right half plane. Since [t] = [t2] 
is not in Case 2, these three arcs do not lie on the X-axis. Choose the quadrant 
which contains a point of one of these arcs as the first quadrant. Then all 


15 Again this is possible except in the case that some one of these four arcs lies 
on the boundary of 8,(0) =8,. The necessary modification in this case is trivial. 
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the essential features of Case 6.1 are obtained and the treatment proceeds in 
the same way by simply neglecting the extraneous arc or arcs. 
This completes the proof of Theorem 1. 


7. Corollaries. Theorem 1 allows several previous results to be strength- 
ened. In connection with the inequality (2c) for 6(¢) we have 


LEMMA 13. 6(¢) =x if and only if [t] is doubly significant. 


Proof. If 6(¢) =~ for a multisignificant point [t], then, by Lemma 12 
(cf. footnote 9), C has a double point. But this is impossible by Theorem 1. 
Then Lemma 8 can be stated more simply as 


Lemma 8bis. The set of doubly significant points of C is closed. 


8. Local properties resumed. In this section the discussion of the 
nature of C in the neighborhood of the various types of points will be resumed 
and further results obtained which are more conveniently proved with the help 
of Theorem 1. The first of these results is complementary to Lemma 11 and ` 
supplementary to Lemmas 10 and 12. 


Leuma 14 Let [to] be not doubly significant. Then some arc (s,u) 
with s < to < u ts convex toward P (to) 


Proof. If [to] is non-significant, this is so by Lemma 9. 
Suppose first, then, that [¢)] is simply significant and let sı, th be 
chosen so that 


(13a) Sı < to < ths, 
(13b) P(si, M) C S(t). 


The possibility of satisfying (18b) follows from Lemma 6. From (18b) it 
follows, à fortiori, that 


(14) P(t ta) CR). Si Sy 


Now let S(p) denote a circle of radius e + p about p(to), where p > 0 
is so small that S(p) cuts both ares (s1, to), (to, t1) and let s, u, respectively, 
be the greatest £ < to and the least t > to such that [s] and [u] are on S(p). 
(The existence of such a p > 0 is obvious from Lemma 10.) Let A(s, u) 
be the principal are of S(p) joining [s] and [u]. 

Then the closed curve T defined by 


T= [s,u] + A(s,u) 


10 An arc is said to be convex if it can be made an arc of a convex curve. A non 
linear are is said to be convex toward a given set if the given set necessarily lies outside 
any such convex curve. A linear segment is said to be convex toward a given set if the 
set lies on one side of the linear extension of the segment. 
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is simple. In fact [s, u] cannot touch itself by Theorem 1 and cannot touch 
A(s,u) (except at [s] and [w]) by the definition of s and u. It will now 
be shown that the simple closed curve T is convex. Suppose this is not the 
case ; i. e., that some chord K([t1], [t2]) lies outside T, where s S t < t S u. 
Then the arc consisting of 


[s, u] — (t; te) + K( [tH], [t2]) 
separates (tı, t2) from S(t.) in S(p). But, by a now familiar argument, 
this contradicts (14). The fact that the convex arc (s, u) is convex toward 
p(to) is obvious from Lemma 10. 
The only case remaining to be considered is that [to] is multisignificant. 
In this case (13a) is replaced by a choice of sı, u, such that 


81 to < th, 
P (sı, to) C Si(to), P (to, t) C S2 (to) 


(where 8, (to), 82(to) are, of course, the terminal salient circles through [to]), 
by using Lemma 4. Then a double repetition of the preceding argument, 
somewhat modified of course, shows that some (s, to) is convex toward p; (to) 
and some (4,4%) is convex toward po(to). These facts, together with Lemma 
12, are easily seen to imply the convexity of (s, u) toward P (to). 

It should be remarked that this Lemma 14 is not simply a consequence 
of the results of Lemma 8 bis, Lemma 10, Theorem 1 (as might be suspected) 
in view of the possible existence of linear segments converging to [to]. In 
fact it is quite easy to construct an example of a simple curve with the local 
half plane or “local supporting line” property of Lemma 10 at every point 
but which is not locally convex at some point. However, this can only be done 
by the introduction of linear segments. The a aa Lemmas 14 and 
11 lead immediately to 


THEOREM 2. - Let M be an arbitrary bounded plane point set. Let «> 0 
be fixed and let C be an <-ergodic curve for M. Then at any point [t] of O, 
0<t <1, there is a right and left hand tangent to C. 


Proof. If [to] is not doubly significant, then, in view of Lemma 14, 
this follows from a well known theorem on convex curves. If [to] is doubly 
significant, this is trivial in view of Lemma 11. 

The difficulty (of linear segments) mentioned above in connection with 
the extension from the local supporting line to local convexity also prevents 
the extension from local convexity to convexity in the large. In the case at 
hand, however, the extension from local convexity to a kind of convexity in 
the large can be made without eliminating all linear segments—it is enough 
to eliminate those linear segments which are non-significant. 


` ‘ P ‘ Á 
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Lemma 15. Let (s,u) be an arc of O containing no non-significant or 
doubly significant points. Then (s,u) consists of.a finite number of conver 
ares. o 


` Proof. Let $(t) be the inclination which the left hand tangent to C at 
[t] has with the z-axis. In view of Lemma 14 the indetermination (+ 2kr) 
of (t) may be determined so that $(¢) is monotone (in the weak sense) in 
some neighborhood of each ¢ for which [#] is not doubly significant. At the 
same time ¢(¢) may be supposed bounded. To see this it is enough to notice 
that C cannot spirally converge to a point (remembering that C is simple and 
of finite length). But this is easily seen since if spiral convergence occurred 
the points of C near the limit point would be non-significant and hence C 
would be linear near this point which is a contradiction. It is now supposed 
that some well determined ¢(¢) is chosen which is bounded and weakly mono- 
tone in some neighborhood of every ¢ where [¢] is not doubly significant. 
Now let (s,u) contain no non-significant or doubly significant points so 
that #(¢) is weakly monotone in some neighborhood of every ¢ for 8 < t< u. 
It will be shown now that ({) is weakly monotone in the entire interval 
8<t<u, For suppose this is not so. Then there is some sub-interval 
8, Stu, of s<t<u such that the point tı where (t) attains its 
maximum * (or its minimum) over the interval s, & t S u, is an interior 
point s, < tı < uù. For the sake of definiteness consider the case of a maxi- 
mum; i. e., suppose that 4 


(15a) SLALU LU; 
(15b) h) 2o(t), aStSu. 


But (15b) contradicts the local monotony of p(t) unless the equality sign 
holds in (15b) near ¢,; at least on one side of #.. Thus there are values so, ts 
such that * i 

(16a) 81 L 82 S h EU < th, Sa Ua; 


(16b) pt) —=p(t), ss << Ua 


Without loss of generality, let it be assumed that the segment (Ss, uz) 
— K([s:], [w]) lies along the z-axis and that $(t,) —0. By the local 
monotony of p(t) at s:. and us, in connection with (16b) and footnote 18, 
values 83,3 may be chosen so that Ss < $3, Uz < us and: 


17 The fact that (t) actually attains its maximum (minimum) for any guch closed 
interval is a trivial consequence of the local monotony of (t). 

18 It is supposed 8,, t, are chosen so that u,-s, is as large as possible. The equation 
of (16b) may or may P hot hold with ta, or t ih, 
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(17a) > p(t) <0, ` 88 LÉ sz; 

(17) HD <O m<i<u 

Où the other hand, by Lemma 14, these values Ss; us may also be chosen so that 
(18a) (ss, 82) is convex toward P (82) ; 

(18b) (te, Us) is convex toward P(uz). 


Now, by assumption, all points [t] for s < t< us are either simply 
significant or multisignificant. But this last alternative is seen to be impossible 
if Lemma 12 is compared with (16b). ‘Thus 


P(t)—=(p(t)), a< t< ih. 


The set of all points {p(t)}, for ss < t< ts, is, in view of Lemma 6 and 
Definitions 10 and 12, a linear segment parallel to (sz, uz) and lying e units 
above or € units below (s2,us). By comparing (17a) and (18a), it is seen, 
in view of Lemma 6, that this {p(t)} lies below (s2, uz). However (17b) 
and (18b) show, in the same way, that {p(t)} lies above (s2, us). This con- 
tradiction ‘completes the proof of the fact that ¢(¢) is weakly monotone in 
the entire interval s < t < u. 

To complete.the proof of Lemma 15, it is enough to divide (s, u) into 
sub-ares such that the variation of (t) over any sub-arc is <~. By the 
monotony and boundedness of ¢(¢) the number of such sub-arcs is finite. 
That each such sub-arc is convex is then trivial. 

As usual a point where the right and left hand tangents differ will be 
called a corner. In particular, every multisignificant point is a corner. On 
the other hand a non-significant point or a doubly significant point cannot be 
a corner. À simply significant point may or may not be a corner as trivial 
examples show. With regard to corners, in addition to these remarks, there 
will now be shown: 


. THEOREM 3. With the notations of Theorem 2, there are only a countable 
number of corners on C. 


Proof. Let Qn, Qa denote, respectively, the set of non-significant, doubly 
significant points of C. The set ©, is, by Lemma 5, an open subset of C 80 
that ©, is of the form 


oo 
Qn= F (84, i). 
41 


But, by Lemma 9, no point of Q» is a corner. Thus the set 


ERGODIC OURVES AND THE ERGODIO FÜNOTION. 343 


ë r [se] A 
i 7 On == 3 [84, w] 7s 
41 . 4 
contains at most a countable number of corners. 
Now the set Qa, which is closed by Lemma 8 bis, can contain no corners 
in view of Lemma 11. Thus to prove this Theorem 3, it is enough to show 
that there are at most a countable number of corners among the points of 


Q=C—Qa— Qn ere 
But this Q is an open subset of C;i.e., 


g= S (via. 
en 
Also (ss, u’;) consists of a finite number of convex arcs, by Lemma 15. Thus 
2 . 
Q = 3 (s, w”). 
41 


where (s;”, u”) is convex. But it is a standard theorem that a convex arc 
` contains only a countable number of corners. This completes the proof of. 
Theorem 3. 
As usual, by a cusp will be meant a point where there is a well defined , 
tangent line but where the curve does not- “Cross the formal, With regard to 
these it is easy to prove 


THEOREM 4. With the notations of Theorem 2, the curve O has no cusps. 


Proof. In view of Lemma 14, a point [t] which is not doubly significant 
cannot be a cusp of C, since a convex curve can have no cusps. Thus it is 
sufficient ‘to show that a doubly significant point cannot be a cusp. This can 
easily be done by an argument exactly like that which showed that 6(¢) == 
was impossible for a multisignificant point. 


9. The ergodic function. In this section, contrasting with the pre- 
ceding, C will be used to denote any continuous rectifiable plane Jordan curve 
of length L while an e-ergodic curve for M will be denoted by C(e) and its 
length by A(e). 

Let D(C;e) denote the set of all points in the plane at a dites: Se 
from some point of C. Thus D(C;e) is the domain swept out by a circle of 
radius e whose center traverses O. Then D(C;e) is measurable (in fact 
closed) and 

meas D(C; e) = RL + ré. 


This Sénat is given by Errera ® for simple Jordan arcs, but ile t 


1° Loc. oit., 2. 
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his proof, nd use is made of the fact that the arc is simple. In fact the 
inequality is strengthened if C is not simple. 
The fact that C has the property (e) with respect to H is clearly 


expressed by 

(19) D(C;e) DH. 

But since D(C;e) is closed, (19) may be strengthened to 
(20) D(0;) DM 


where M is the closure of M. Then by (19) and (20) and the fact that 
L=A(e) for C = C (e) 
(21) meas M S 2A (e) + re. 
Now, immediately, we have 

Lemma 16. For an arbitrary di set M 


i lim: inf 2eA (€) Z meas Ñ. 


Next it is to be shown that lim sup oe (e) S meas #. In this direction. 
we prove first 


Lemma 17. Let R be a rectangle of perimeter P. Then 
ReA(e) = meas R + 2Pe + 166. 


Proof. Let the rectangle be placed on a Cartesian codrdinate system with 
its, vertices at (0,0), (a, 0), (0,5), (a,b). Then let a curve CC, be 
traced in the following way: First trace the horizontal segment (0,0) to 
(a, 0), then the semicircle of radius e which has the segment (a, 0) to (a, 2e). 
as diameter and which lies outside R, then the segment (a, 2e) to (0, 2e), 
then the semicircle of radius e which has the segment (0, 2«) to (0, 4e) as 
diameter and which lies outside K, then the segment (0, 4e) to (a, 4e), ete. 
Continue in this manner until a horizontal segment has been drawn which lies 
above À. It may be easily verified that for the curve Ce 80 constructed, the 
inequality (21) becomes an equality; i. e., we have | 


. (22) ! meas D (Ce; e) = 2eL(e) + rê 
where L(e) is the length of Ce. It is equally obvious that 
(23) RC D(C.3«) C R* 


where R* is the rectangle with vertices (— 2e, — 2e), (a + 2e, — 2e), 
(— 2e, b + 2e), (a+ 2e, b + 2e). The first inclusion (23) shows that Ce 
is «ergodic to D so that : a 

(24) | AG SL) | 


and the second inclusion (23) gives 
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(25) meas D (Ce;e) < meas À + 2Pe + 16e. 


Combination of (22), (24), and (25) gives the inequality of Lemma 17. 
We are now ready to demonstrate 


LEMMA 18. For an arbttrary bounded set M 
lim 2eA(e) S meas À. 
e0 


Proof. Lety > 0 be chosen arbitrarily. Then, since Æ is closed, there 
exists a finite set of rectangles, Ri, R2,- © <, Ra, such that — 


(26) 3k, D MDM 
421 

and 

(27) meas X R; = meas H + y. 
4=1 


In each of the rectangles R, consider an e-ergodic curve Cj == C; (e) of length 
Au(e). Let Cy be oriented so that it is possible to speak of the beginning 
point of C4 and the end point of C;. Finally consider the curve C = Oe 
consisting of the n curves C; and the n — 1 linear segments joining the end 
point of C4 to the beginning point of Ci,.(¢—=1,2,---,n—1). Then C 
is clearly e-ergodic to $ Ri and, by (26), à fortiori, to M. Thus 

4=1 
(28) A(e) S Lle) 
where A(e) is the ergodic function for M and L(e) is the length of Ce. On 
the other hand 


(29) L(e) SZA) + (n—1)D 


n 

where D is the maximum distance between two points of 3 R;. Applying 
4-1 

Lemma 17 to each A; we have from (29) 


(30) 2eL(e) S mens $ Ri + 2e $ Py + 16n€ + 2(n— 1) eD 
4=1 421 


where P; is the perimeter of Ry. Combining (27), (28), and (30) we have 
at once 
(31) lim sup 2eA(e) = meas # + 7. 

sh 7 


Since (31) holds for an arbitrary 7 > 0, the proof of Lemma 18 is complete. 
Lemma 16 and Lemma 18 together imply 
THEOREM 5. For an arbitrary bounded set M 
lim 2eA(e) == meas Ñ. 
€ 
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REMARKS ON A SPECIAL CLASS OF ALGEBRAS.* 
By 0. F. G. SCHILLING. 


It was shown by Hasse and Witt that the structure of normal simple 
algebras over algebraic numberfields and certain fields of algebraic functions 
can be described in terms of the arithmetic of the underlying groundfield.* 
In this note we discuss algebras over function fields of one variable whose 
coefficient fields are fields which have only cyclic extensions. It turns out 
that quite a few of the results of the afore-mentioned theories are still true 
under our assumptions, e.g. the theorem concerning the sum of the local 
invariants of an algebra. However, the step from the theory of algebras to 
class field theory can no more be made. Our results throw some light on the 
axiomatic treatment of the class field theory in the large. They clearly 
indicate that the validity of the norm theorem does not imply the law of 
reciprocity. The reason for this deviation from the classical theory can be 
found in the fact that the Takagi group of a cyclic extension is in general 
a proper subgroup of .a suitably defined Artin group. 


1. Structure of the groundfield. Let T be a field which has only 
cyclic extensions. We shall suppose that for every integer n there exists at 
least one cyclic extension T» of degree n over T. The Galois theory then 
immediately implies that the extensions T, are unique, i.e. for every integer 
n there exists exactly one field Ta. We now want to investigate the structure 
of the field T. 


Lemma 1. The field T ts either an absolutely algebraic field of char- 
acteristic x 54 © whose Steinttz number has no infinite component or tt is 


relatively complete with respect to a non-trivial valuation V. 
Proof. ‘We distinguish two cases 
i) T admits no valuation but the trivial one, 
ii) T has non-trivial valuations V. 


* Received August 21, 1939. 

1H. Hasse, “Theorie der relativ-zyklischen algebraischen Funktionenkbrper, ins- 
besondere bei endlichem Konstantenkérper,” Journ. f. d. r. u. a. Math., vol. 172 (1935), 
pp. 37-64; E. Witt, “ Riemann-Rochscher Satz und Z-Funktion im Hyperkomplexen,” 
Mathematische Annalen, vol. 110 (1936), pp. 12-28. These papers will be referred to 
as H and W, respectively. | 
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In the first case 7’ necessarily is a field of characteristic x +4 oF Moreover, 
T must be absolutely algebraic over its prime field Ty for otherwise we could. 
construct non-trivial valuations by means of a transcendence basis? Let 
Tx < TM C++ THM <-+ ++ Tome ST be an approximating tower of 
T over Tx. Then the formal least common multiple of the degrees [T(# : Tx] 
is called the Steinitz number of T. The assumption that T have for every 
integer n exactly one (cyclic) extension 7, implies then that the Steinitz 
number has no infinite component,’ 

In the second case the field 7 admits at least one non-trivial valuation V. 
If the value group of V is well-ordered in a suitable fashion then it can be : 
shown that V is the composite of a rank 1 valuation V and a valuation V’ 
of the residue class field of V.‘ Let 7’ be the complete closure of the field T 
with respect to the valuation P. In order to prove that the field T is relatively 
complete with respect to the valuation V it suffices to show that. T contains 
no other elements algebraic over 7 but the elements of T.S In other words, 
we must prove that T is the universal decomposition field (with respect to V) 
of its algebraic closure. Let f,(z)=2"+azriL...+a,=0 be an 
irreducible equation of degree n with coefficients in Ÿ. We associate with f, (x) 
a polynomial gn(z) = z" + bai. -+ ba with coefficients in T such 
that V(a,—6;) > M > 0, where M is sufficiently large. It can be shown 
that gn(z) == 0 is irreducible in T and that its roots generate the same field 
over T as the roots of fn(2) —0.° ‘Since gs (x) = 0 is also irreducible in T 
its roots generate the cyclic extension Ta of degree n over T. Hence the field 
generated by fn(x) 0 is given as TT x, i.e. it is cyclic and has relative : 
degree n. Thus T is relatively complete with respect to the valuation Ÿ. 
If the valuation V is discrete (i.e., if its value group is isomorphic with the 
additive group of all integers), then T —T by a theorem of F. K. Schmidt.’ 

In general, we can prove that T is relatively complete with respect to 
exactly one rank 1 valuation. Namely SHE One that T is ose complete 


4 A. Ostrowski, “ Untersuchungen zur arithmetischen Theorie der Körper,” Mathe- 
matische Zeitsohrift, vol. 39 (1934), pp. 269-404. 

*M. Moriya and O. F. G. Schilling, “ Zur Klassenkérpertheorie über unendlichen 
perfekten Körpern,” Journal of the Fao. of Science Hokkaido Imperial University, 
Ser. I, vol. 5 (1937), pp. 189-205. 

+W. Krull, “ Allgemeine Bewertungstheorie,” Journ. f. d. r. u. a. Math., vol. 167 
(1931), pp. 160-196. 

5 A. Ostrowski, loo. oit. . 

#0. F. G. Schilling, “A generalization of local class field theory,” American 
Journal of Mathematics, vol. 60 (1938), pp. 667-704. 

TF, K. Schmidt, “ Mehrfach perfekte Körper,” Mathematische Amitis Kel: 108 
(1933), pp. 1-25. 
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with respect to another rank 1 valuation V;. Then we can construct irre- 
ducible equations h(t) —0 with coefficients in T which have prescribed 
characters of decomposition with respect to V and V;.* Repeating the 
preceding argument we immediately see that the existence of a valuation V, 
with the asserted properties leads to a contradiction to the assumptions on 
the field T. 


Remark. For the actual construction of fields T see a paper of the 
author on formal power series of several variables.” 


DEFINITION. A field T is said to be quast-algebraically closed if it is 
never the center of proper division algebras of finite rank.1° 


THEOREM 1. A field T which has only cyclic extensions ts quasi- 
algebraically closed. l 


Proof. First, our hypothesis implies that the field T is algebraically 
perfect. Namely T is supposed to possess only cyclic extensions. Now it 
follows immediately, by a theorem of Albert, that T never is the center of a 
proper division algebra of degree x", xÆ œ. Thus it remains to discuss 


algebras A whose degrees n = IT pat are relatively prime to the characteristic. 
ti 


The structure theory of algebras yields that 4 — D, X - - + X D, where the 
algebras D, are normal division algebras of degrees p;™ over T, respectively; 
0<b;a,. We want to show Di—T. Let p==p; Since T has, by 
hypothesis, only cyclic extensions, it follows that D, is split by a field TA of 
degree pf, j Sbi Let T < TU) <o TUD LTO C++ < TY be the 
chain of cyclic subfields of T/T such that [7:74] =— p. We shall 
prove by induction that D; X TU) ~ TU) implies D; ~T. Suppose that we 
already proved Di X TO TO, Then D, X TED ~ (TO /TAN arm), 
ar 0 in TU), If arı is a p-th power nothing has to be proved. So let 
a7. 5£C?;,. Suppose that TU contains the p-th roots of unity. Then 
PO = TU) (51/7). Consequently, 


(LO /TID, a) ~ (LEY (atle) /TOD, bra), 


B., L. van der Waerden, Moderne Algebra, vol. I (Berlin, 1937), 2nd edition, 
pp. 201-202. | 

0. F. G. Schilling, “Arithmetic in fields of formal power series in several 
variables,” Annals of Mathematics, vol. 38 (1937), pp. 551-576. 

40. F. G. Schilling, “ The structure of local class field theory,” American Journal 
of Mathematics, vol. 60 (1938), pp. 76-100. 

A. À. Albert, “Normal division algebras of degree p* over fields of characteristic 
p,” Transaotions of the American Mathematical Society, vol. 39 (1936), pp. 183-188. 
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This similarity implies that arı = bradPpa, ie. Di X TUM ~ TUV, The 
preceding argument can also be applied for p == 2 for our assumptions exclude 
that T is a totally real field. Suppose next that TU-1 does not contain 
the p-th roots of unity (4, A==m1,: - -,p. ‘Consider then the algebra 
D, X TO) X TU) (g) over TY) (¢) as groundfield. Since [TE (£):T] 
= p— 1, it follows that TU) (£) is a splitting field of the extended algebra. 
As before we conclude that TU-N(£) is a splitting field of D; X TO (E). 
But this is impossible if D, X TU-N p Uw, 


2. Foundations of local class field theory of discrete complete fields. 

Let © be a field which is complete with respect to a rank 1 valuation p 
and has the field T as residue class field. Since Hensel’s Lemma holds for C 
it follows that the unramified extensions C, of degree n over C are in (1 —1)- 
correspondence with the extensions T, of T. Consequently the generating 
automorphisms F, of the various Galois groups G(C;|C) can be selected such 
that they induce the generating automorphisms of the Galois groups G(T, | T). 
Let Coo denote the maximal unramified extension of C. The Galois group 
G(Coo|C) is an ideal cyclic group. Selecting once and for all an element F 
in G(Co|C) we observe that the infinite cyclic group {F*, À running over 
the additive group of all integers} is everywhere dense in G(Cw|C). Con- 
sequently the element F induces for every n a generating automorphism Fy 
of G(C,|C).° Having fixed the automorphism F, the substitutions F, have 
the same algebraic properties as the Frobenius automorphisms of the classical 
ramification theory. 

In order to derive the local class field theory relative to the field C it is 
sufficient to prove the following lemma. 


Leama 2. All units of C are norms of units in Cn. 


Proof. Let u be an arbitrary unit of C. Then, by Theorem 1, its residue 
class u mod p in T is the norm of an element R of T, i.e. u==NR (mod p). 
Thus we have a first p-adic approximation of u as a norm of a unit VR (modp) 
in C. Since u(NU)-! = 1 (mod p), the customary procedure of p-adic 
approximation yields that u(NU)-! = NH, where H=1 (mod p).¥8 

The usual arguments of local class field theory imply that Lemma 2 yields 
the following theorem. 


THEOREM 2. Every normal simple algebra A over C is similar to a 


120. F. G. Schilling, “Regular normal extensions over complete fields,” To appear 
in the Annals of Mathematics. 

13 E. Witt, “ Schiefkôürper über diskret bewerteten Körpern,” Journ. f. d. T. u. a. 
Afath., vol. 176 (1937), pp. 153-156. 
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cyclic algebra (Cn/C, Fn, +)” where m denotes a fixed prime element of the 
valuation pi ` 


As in the classical theory we define the residue »/n mod 1 as the invariant 
of the algebra A. Having selected F and m every care A is aan 
determined in its class by its invariant. 


3. Algebras in the large. Let k be an arbitrary function field of one 
variable with coefficients in the field T. Now let A be an arbitrary normal 
simple algebra over k as groundfield. Since & is a function field of one variable 
it follows, by a theorem of Tsen, that the algebraically closed field To of T 
suffices as a splitting field of A when adjoined to k.” Hence a suitable finite 
extension kT, of k already splits the given algebra A, 


AX Tak ~ Tok. 


Thus, Le (Tik/k, Fa, a), a0 in k. 

We next want to determine the local invarianis r(p) of the algebra Ajke 10, 
These characters r(p) of A are a a determined. by virtue of Theorem 2. 
First let us observe that 


(kTa/k, Fua) ~ (kTn/k, Fn, ab) for any b 40 in T.’ 
Namely, (Ta/k, Fa, b) ~ (Ta/T, Fn, b) Xk~k. Consequently PET 
ture of A depends only on the divisor (a) -Ú past. Since À ~ CE F,,a), 


we get 
À; om (Tu p/ Ky; F,4, a°) 


where d == (n,f(p)) and e == f(p}d"t. Hére f(r) denotes the absolute degree 
of the prime divisor p. As a consequence of Lemma 2 and Theorem 2 
we find that the algebra A, is completely determined by the invariant 
r(p) = f(p)a(p)n* (mod 1), where pe®™/ (a). We remark that 
r(p) =0 (mod 1) if p{ (a); namely.then a is a unit for the prime divisor p. 
Hence, by Lemma 2, 4, ~ kp- Therefore, the algebra A is ramified at most 
at the prime divisors of (a). ‘As usual it follows that 


à r(p) ==0 (mod 1) for the invariants of an arbitrary algebra A/k, 


1 C. Chevalley, “La théorie du symbole de restes normiques” Journ. f. a. r. u. a. 
Math., ‘vol. 169 (1933); pp. 140-157. 

6 Ch. C. Tsen, “ Divisionsalgebren über APRES Nach. v. d. Gesell. 
d. Wiss. Güttingen (1933), pp. 335-339. 

16 The local invariants r( p) are defined to be the invariants of the limit algebras 
A, ‘of À.” : : 
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for $ fı(p)ai(p) — 0.7 Thus, we established a generalization of the classical 
1 


norm theorem. | 

Suppose now that k is a rational function field T(x). Then the prime 
divisors p at finite distance with respect to x can be represented by irreducible 
polynomials m of degree f(p) with respect to x Moreover, every divisor of 
degree 0 in k = T (x) is a principal divisor, i. e. it belongs to an element of k. 
We then can prove that for every finite set b,,- : :,p, of prime divisors to 
which there are associated rational fractions abs? whose sum is 0, there 
exists an algebra (T,k/k, Fn, a) whose local invariants r(p,) == aib”, 
r(p) =0 if DAP," ` ty Pa 


To prove this assertion we proceed as follows." Put n == Il bif (pı) and 
4=1 


ay = an (bif (ps) 7, t— 1, 2,-- -,8. Then the divisor Il p;** has the order 
: | ii 
È af (ps) = >» ayn (bif (Ds) f (pi) =n À abc: ox Q. 


Consequently, by assumption, fre == (a). Hence the none Fy, a) 


obviously “has all the required ere 

If the genus of k is greater than 0 then one can readily construct examples 
for which there exist no algebras with prescribed invariants. Namely, take 
for k a field of genus > 0 whose defining equation f(x, y) — 0 has coefficients 
‘ in the field of all complex numbers C. There exist then infinitely many 
‘divisor classes whose orders are infinite. Selecting the p; and ab, appro- 
priately one easily. can construct the necessary counter examples. 

We now want to prove that every division algebra 


D~ (kT n/k, Fn, a) is ramified. 
Let a=] m“, 0 < a <n, where the 7; are irreducible polynomials in s 
41 


belonging as uniformizing variables to the (finite) prime divisors py. Then 
the algebra (KT,/k, Fn, a) is similar to the direct product of the s algebras 


Ay == (kT 'n/k, Fm mit). 


Each one of these algebras is at most ramified at p; and bo, where poo denotes 
the denominator of v. The invariants r(p,) of A are the same as the in- 
variants of A, at p, according to the structure of local algebras. Thus, 
a finite prime divisor p, gives rise to a local division algebra, if and only if 








“H, p. 45. 
18 W, Theorem 18, p. 27. 
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af (pi) 5E 0 (mod n). 


Now let us prove that af(p;) = 0 (mod n) implies A; ~ k. 
Let (Tuk/k, Fnyn*) be such an algebra. Denote (f n) by d. Then 


p= Pi Ba 


with prime divisors P, in T,k. All these prime divisors P, == (I;) are 
- principal for Tpk == T(x). We have 


N Pi = pë — (x) "a, 


Next NP, == (3)"*/¢. Hence (m) == N (IL )®. ‘Namely, there exist integers 
a, v, such that ndu + fdv == 1. Now, as a consequence of af == 0 (mod n), 
af == gn. Whence we get afd == gnd. Consequently, 


N (I)? == (x) = (x) 91/4, and 
N(T:) = (a) eri, 

Hence 
N (1) "N (ory) #4 = (p)* == N(,)8. 


Since units are irrelevant for the structure of factor sets in cyclic algebras, 
we get 

a r’ == NU, or 
(Tnk/k, Fn, 7t) ~k if af = 0 (mod n). 


Consequently, the algebras A; for which af (p4) == 0 (mod n) can be omitted 
in the representation of the algebra A. Combining these results we find that 
a cyclic product A which is similar to a proper division algebra over k must 
have at least two ramifications. 

As usual we have 1? 


TueoreM 3. The class group of normal algebras over k—T(ax) is 
isomorphic with a subgroup S of the additive group {r(p)} of all vectors of 
' rational numbers mod 1. The group S consists of all vectors for which: 
= r(p) = 0 (mod 1) and r(p) = 0 for almost all p. 

+ Finally, we remark that the index and exponent of any normal -algebra 
over T(x) conicide.?? | | 
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2° W, Theorem 19, p. 27. 
*°W, Theorem 20, p. 28. 


ON A CERTAIN PARTITION FUNCTION.* 


By Ivan Nrven.** 


1. Introduction. It has been shown by Schur? (and by Gleissberg ? 
with a different method) that the number a, of partitions of an integer m 
with summands of the form 6n + 1 equals the number of partitions of m 
such that the difference between any two summands is at least three, and at 
least six in case both summands are divisible by three. The purpose of the 
present paper is the evaluation of the a, which may be considered as the 
coefficients of the powers of x in the expansion of the function . 

LIGNE) LS en 


as a power series, where. : 
fos} 
(1.2) f(x) -U (Az). 


That am represents the number of partitions of the integer m having summands 
of the form 6n + 1 is immediately verified by expanding F(s). 

The method used is essentially that employed by Professor Rademacher * 
in his investigation of the modular function J(r). The author takes this 
opportunity to thank Professor. Rademacher. for suggesting the problem and 
for advice on its solution. 


2. Transformation formulas. We employ the familiar transformation 
formula * 


. 


(2.1) | $ exp (2ri ett) 


= Ve j 3; (4—+) ts j exp (ri =) 


* Received May 5, 1938. 

** Harrison Research Fellow. 

1I. Schur, “Zur additiven Zahlentheorie,” Siteungsberiohte der Berliner Akademie, 
1926, pp. 488-495.. ; | 

z“ Über einen Satz von Herrn I. Schur,” Mathematische Zeitschrift, vol. 28 (1928), 
pp. 372-382. 

> Hans Rademacher, “The Fourier coefficients of the modular invariant J(7r),” 
American Journal of Mathematios, vol. 60 (1938), pp. 601-512. 

+G. H. Hardy and S. Ramanujan, “ Asymptotic formulae in combinatory analysis,” 
Proceedings of the London Mathematical Society (2), vol. 17 (1918), pp. 76-115, 
Lemma 4.31. | 
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in which ` | 
(2.2) (h,k) —1, hh’ =— 1 (mod k), 
and ows,» is a root of unity frequently used in modular function theory. Hardy 
and Ramanujan * give the values 
(2.3) ors = (—F]h) exp(—ri( 4 (2 — hk — h) 
+ Ma (k—1/k) (2h—W + hh’) }) 
for h odd, and 
(2.4) onx = (—h|k) exp(— mi{4(k — 1) 
+ Kae (k—1/k) (2h —h! + hth’) }) 
for k odd. We wish to obtain a transformation formula similar to (2.1) 
for the function ; 


(2.5) 7 | ep (ami +4) b 


by applying (2.1) to (1.1). There arise four cases, according as (k, 6) has 
the value 6, 3, 2, or 1, and the value of the function (2.5) is, respectively, 


oO ont {exp (ani A), 

(2. 7) One (2) F exp (= Gatt 2 
(2.8) On, xe (2) E exp (2 cait bs and 
(2.9) One (2)F exp (xi ee) a cer) i . 


Tn the last three gases, h’ is a solution of the congruence hh’ = — 1 (mod k) 

such that it is divisible by 2, 3, and 6 respectively; clearly this is possible 

because of the divisibility properties of k in the various cases. Also we have 
(2.10) Oy == SRE, ga (3) — exp j ag (1/42) l for (k, 6) = 6, 


Oh, &/8Wh,k/2 


(2.11) Qype DEDRA e(z) = exp j — Te (1/2 + 22) | tor (k, 6) — 8, 


Oh, k/aW2h,k 


ə, e WA, Bh, k/2 pe! Oo r T 
(2.12) Qax mua Wn (%) = exp SE (a + z) >for (k, 6) = 2, and 
WA, kDsh k 


(2.13) Ore = SRE, yala) — exp À — Ee (—(1/32)+ 26) | tor (k, 6) = 


® Loc. oit., p. 85. 
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3. Applying Cauchy’s Integral Formula to (1.1) we obtain 


1 F(z) 1 F(z) 
3. = ee = d — 
( 1) i mri Jo am | > Rri Jeux TU 
j OSA <ISN 


wherein ¥ means that h is summed over values prime to k. We choose C 
to be the circle | z | — exp(—2rN), so that the Farey arc & x is given by 








T = EXP (— 2 #42 ons), 
where — Wir = D S One; and, if h;/k:, ha/ka are the neighbours on the left 
and right respectively of h/k in the Farey series of order N, 


: 1 r: Teha 
GX Ouen, fus: 


Thus we have | 
D as ih , 
n= S Í F j exp (arr 5) l 
X exp | — m (an + HE Lans) a 


the subscripts being omitted from #14 and 61x. Now set w == N — ip and 
z = kw. 


(3.3). Qm == > exp (— amin) 


m X fr Le xp (754 — =) | ss 


In ordér to make use of the formulas (2.6), (2.7), (2.8), and (2.9), 
we break a, above into the parts dn), am‘, dn, and dm according as 
œ 6) APA 6, 3, 2,1 respectively. Applying (2.6) to an, we have 


(3. 4) Am (9) — x ‘exp (- mm) (0727 $ 
h, 
ORK ESN 
(k,6)=6 





o: e rihn 
x Í Ye(kw) 2 an exp ( ; — = as zamu ) dé. 


Splitting off the first term in the summation in the integrand, we write, 
aking use of the fact that ao = 1, and (2. 10), 





(3.5) Le Š Y ep (— 222) 0, o 


sm k 
aa) z 
- (2 


T rw e 
x fe (és + Drm — 7) ae 
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Hence am® == I; + I, I, being the same as (3.4) with the summation in’ 
the integrand ranging from 1 to œ. 

4. The evaluation of J; Formulas (2.3) and (2.10) imply 


(4.1) (miel mi (Ht we EE KE) | tor (k,6) — 6, 


where h’ has been chosen so that hh’ = — 1 (mod 12k). Using 





i 1 ani 1 
; == = ESS — #" 
7 CE ES TE +H SEW FH) n FE) 
we may write - . 
EES 
0 N œ z 
(4.3) L= X Saf exp j — zy C” — %) + rw (2m — 4) | dé 
(8) =8 poe 1 
KUN +k) : 
1D exp LE ET à Fox 
h mod k l k ; 
` 1 
g 1+1) 
pA N+k-1 
43 Se > ze | AE nha Lowes f 
k=1 n=l kit 
(&,6)=6 ee 
ki 
- 1 
xt 


+3 Sa D exp | — 28 (mh nv) pos ES" 


kl n=l km 





— S, + 8.4 89s, 


where the integrand ig the same in all three expressions. By (4.1) the inner 
sum in S, is the Kloosterman sum 


[4 ri 7 k? 
(4.4) ee P j — aa h (12m — —1— FHA (— 12n + E i L. 
The quantities in parentheses are integers since k is divisible by 6. Since we 
required hh’ == — 1 (mod 12k), this sum may be considered as an incomplete 
sum mod 12k; using a device of Estermann,’ and an estimate of Salié,’ the 
sum (4.4) is 
O (5/3 (12m — 1 — k?/3, k)*/*) == O (ktm), 


ST. Estermann, “ Vereinfachter Beweis eines Satzes von Kloosterman,” Abhand- 
lungen Hamburg. Aath. Seminar, vol. T (1929), p. 94. 

TH. Balié, “ Zur Abschätzung der Fourierkoeffizienten ganzer Modulformen,” Mathe- 
matische Zeitschrift, vol. 36 (1933), p. 264. 
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Using the inequality 


(45) Rfi ey toe} mt 





we obtain 


8,=-0 (3 5 D On EE P \- m+ 1N~*(2m—%) jme) 


kel nel 
oo N 
= 0 G exp (2rmN-*) m1/* F, a, exp a =) > vn) 
N n=l 2 k=l 
= 0 (= exp (2rmN-*) m Nere) 
(4. 6) S, = O (exp (rm N -2 ym N-13+¢), 


. Because of the similarity of S, and Ss, we shall treat only the latter. 
Interchanging the summations with respect to h and } we get 


1 
oo N+k-1 Ki 
BS Lan > ei pry Cr — %) + ww (2n — %) l ag 
ris aly I=N+1 


, _ Bri EL Le 
Z| test l. 
HSS 
In order to interpret the restriction on kz in the inner Lun we recall from 
the theory of Farey series that if 


hy h ha 
Ek ke 


are three neighbors in the Farey series of order N, then 


hk, — hık = 1 = hak — hk: 
so that 

hk, = — hk, = 1 (mod k), 
or, by applying (2.2) 
(4.7) — kı, = kı = k (mod k). 
From this it follows that the above restriction on k, implies a restriction on h’ 
to an interval mod k equivalent to one or two intervals in the range 0 = W < k. 
Tf the Kloosterman sum is considered mod 12k, we have a restriction on both 
h and k’. We proceed to remove the restriction on h, so that the sum may be 
treated as was (4. 4). 
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We shall prove that if the sum 
Rri k? 
(4.8) Las exp | in (ièm—1 À) 
N <EHkaS1 ka | 
: +H —1n+i—2)} 


is multiplied by 


(4.9) Heels i f ak (12m —1— F) | 


+ pt (en 41-2) | S 


where 8 == a or Ta according as (k, 12) — 12 or 6, the product is. ` 
j Rri k? 
(PA nmas exp | tgp | (19m —1— 4 
N CHE ie ) 
where A’ satisfies the congruence hh’ == — 1 (mod 12k). We must prove that 
(h + ak) (h + pk) = — 1(mod 12k), 
or, multiplying by h, a 
Bh? — a + aBhk = 0 (mod 12). 
If (k, w) == 12, we use 8 = a and require 
. a(h? — 1) = 0 (mod 12) 
which is obviously true since h is prime to k which. is divisible by 6. If 
(k, 12) == 6, we use 8 = Ya and require 
a(th? — 1) + Yathk = 0(mod 12) 
which reduces, since 7h? — 1 == 6 (mod 12), to 
a(6 + Yakh) = 0 (mod 12). 


This is true since the factor in parentheses is divisible by 6 when « is even, 
and by 12 when a is odd. The one following (4. 4) shows that (4. 10), 
and hence (4.8), equals 


(4.11) O( (12k)? (12m — 1 — k?/3, 12k) 1) = O (k/m), 


Thus : 
g 0 Nc Ss 1 . ik 
(9 -0(% Sm Sdn TETE) 
atten zt + aN (2m — 16) Finn) 


=0(3 5 >> EW exp (2amN-*) prem”), 
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(4.12) © Sa = O(exp(2rmN-*) m5 Ny (1/8) 4€) , 
Combining (4. 3), (4.6), and (4.12), we obtain . 
(4. 18) I L O (exp (2rmN-2) mN -0/), | 


5. The evaluation of I. We divide the expression I, in (3.5) into 
three parts Qo(m), Qi(m), and Qa(m) by splitting the limits of integration 
into the parts 


, 
? 


(- + 2 , CEE (= BTR = EN) 








and 
(ETE 5:55) 
k(N +h)? k(k:+Ek)/" 
Thus ; 
O(m) =E Bm) fo exp (aes + xu(2m—%) ) as, 
(k,6)=6 ENS 
where 


(5.1)  Bifm)— SY exp (- mtn) Ona for (k, 6) — 6. 
h mod k | 


Let R be the positive circuit of the rectangular path with vertices 





-2 i 
+N GB) * 


Then, using w == N?— jp, the integral in Q)(m) above may be equated to 


(5.2) +f; exp ee L nto (2m — #)) aw 


| : T 4 


4 K i i 
“ty, —————, -N~ n -N-21 —— 
N™ NET N+ LIN) NA EN) 


; j 
— erly (m)— 7 (Ji+d2+Js), 
where all four integrals have the same integrand. 
The integral 
— 1 PAA E a 
Li(m) an), oP (Bs + atv (2m — #)) dw 
may be expressed in terms of well known integrals from the theory of Bessel 
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functions. For the Bessel function of the first kind with purely imaginary 
argument ê we have 





(2/2)? con ( 
(5.3) Ip(2) = ft exp +2 di 
and ; 
2 
(5. 4) ` Ipa (2) —Ipa (2) — Ip(2), 
the latter being used with p= 0. From these we obtain 
1 o) 
5.5 L = — I 
Cu dé em | 3k 
Now, along the paths of integration in J, and Js we have 
— —. N-? << < -3 
w Er ES Ne Sus NT, 
It follows that 
R(w)=u<N3, R(1/w) = — < N (N +k)? S 4k 
7 2 
w + aN bP 


so that the absolute value of the integrand in each of J, and Js is 


S exp (Yr + 2rmN*) 
and hence 
| J; | 2 -2 
jea exp (Hr + 2rmN*). 


In J, we have w = — No ee tv, where 


D eo z) = Saro ky 
It follows that 


—N? 
NT 
E(w) N? <0, LO!) = “Wap <% 
so that the PT vas of the integrand is less than unity, whence 
-1 AT-1 
EARS ENTE < kN. 


Collecting the last few results, and recalling that ` 
By(m) == O (k/m), 
we obtain 


(5.6) Qo(m) = 2x È > Bri(m)lnx(m) M) du a a 


. (%,6)=6 


"G. N. Watson, Theory of Bessel Functions (Cambridge, 1922), p. 181, (1), 
p- 79, (1). 
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We now treat Q,(m), and point out that Q:(m) may be handled analo- 
gously. We had 


Qs(m)— SB esp (= rh, 





k=l kAmodk 
(ky 
1 
TECH 
Sf T 9 a 
X a exp 6k?w + m ( m — w) ẹ. 

1 

“kl 


Interchanging the summations with respect to 1 and h, we obtain 


1 





: 7 KG) 
N N+k-1 
(5.7) Qi(m) = = À, f exp (sect rw (2m — %)) dẹ 
(k,6)=8 i BE: 
ki P Rrthm 
| nei ap (— k ES 
N <kies 


The inner sum may be treated as was (4.8), yielding 


0 (H2/8+emt /3) ; 
Noting that 


R(T 
6k*w) Gk (N + 4°) 
oT: T 


à 
= 6k N? (N= + k- (N + k)-*) = 6 (k? N- + 14) 


m 


and 
R(aw (2m — %)) < 2rmN=?, 
we conclude that 


Q O 5 S E 1 l Irm N-21) k2/3 m17? 
CS (= tk kl k(1+1) exp ( ) m/8) 


N , 
= 0 Y i exp (2armN-*) k?/3+¢mi/8 
ie kN 
= 0 (N-00) + exp (QrmN-*) m). 
Since a similar result is valid for Q2(m), we combine this result with (4. 13) 
and (5.6) to obtain 
N 
(5.8) dy) me 2r X, By (m) Le (m) + O (exp (RrmN-?) mN- 4/8) +) | 
k=1 
(k,6)=6 . 


6. Estimations for an) and a;(%). If the power series expansion for 
à co $ . 
Fo (2) is X baz” then a, is, from (3.3) and (2.11), 
n=0 . . 


9 
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No ` ; 
(6. 1) = > bn 2 exp ex = (2hm — h'n) lans 


kzi n=0 


x fo exp | — i + Ka) + rotèm— 4) | as. 


This differs from am‘® in that the coefficient of 1/w in the exponent of the 
integrand is always negative, whereas in am‘) it was positive for n == 0, which 
forced us to consider this case separately (section 5). It is clear, then, that 
we proceed with (6.1) as we did with J, in section 4. Formulas (2.4) and 
(2.11) imply 

š / / 
(6.2) Qx—— exp | n(m t zE) \ tor (k, 6) =3 


provided that h’ is chosen so that 
hh’ = — 1 mod 3k. 


Corresponding to (4.4) we now have 


; ` _B+38\ K eeey) | 
1650) oof Te a a TR Jr | 
Since (k,6) = 3, (k? + 3) is divisible by 12. Also we may choose h’ to be 
even, restricting it to the interval 0 < W’ < 6k. Then set A” —h’/2, so that 
3k—1 l 
2 


The sum (6.3) is then an incomplete Kloosterman sum mod 8k, since for 
hh” = a mod k, (a, k) —1 








0O<h” < 38k and hh”= mod 3k. 


Sep (2 (uh + vk”) p= SY exp j 2a (uh Fak) l 


where hh’ = 1 mod k. Corresponding to (4.10) we have 


tea Bei aa a 


N < krki 





to which we apply the argument following (4.10). Thus 
(6.5) Om ®) = O (exp (2rmN-*) m¥/2N-(1/8)+e), 
The quantity 
am” = Š D ba SY exp j Pis (3hm — h’n) lan 
k= n0 hmodk 3k | 


(k,8)=2 


x fap} = en (2+ H) + mi(èm — H) | ag, 
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derived from formulas (2.8), (2.12), and (3.3), may be treated similarly. 
From (2.3) and (2.12) we obtain 


TESE p(t c ay? Daos 


T Geer ae 


(k,6) —2 and w= or SEL mod 4k 


80 that the Kissen sum is 











according as k= 1 or 2 mod 3. This sum is incomplete mod 4k, and we 
conclude that 
(6.6) m2) = O (exp (24mN~*) m¥/3N-(/8)+), 


7. The evaluation of am. The quantity 


æ z 
(7. 1) Gin (1) = = = an > exp { TA = (6hm — h’n) l an 
Goa n=0 hmodk 
0” 


x, ord sopp C — 12n) + mu (2m — 4) bag 


resembles 4% (% in that the coefficient of 1/w in the exponent of the integrand 
is positive for n = 0, so that the first term of the inner sum must be treated 
separately, which treatment follows that of section 5. “Evaluating Qs with 
the aid'of (2.4) and (2.18), we have 





2h Wk h 
(7.2) ma exp | T (zak E +E) | for (k, 6) —1. 


We may choose h’ divisible by 6 and satisfying 
hh’ = — 1 mod k 


since (k, 6) —1. The Kloosterman sum is 


e gel TRE 


Note that (k*— 1) is divisible by 12. Then h’/6 may be replaced by an 
integer AV satisfying 


bi; sp 1 


Veo 
AMV = 6 or 6 mod k 
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according as k= 1 or — 1 mod 6. Thus we obtain a result similar to W 8), 


(7. 8) Om) = 2a > oi. ats O (exp (2amN-2) m¥/8N-0/8)+6), 


(k, Pi 
wherein 


S 1 NET +) 
. L fe 
GP sus MES ( 18k 


and By(m) has the same form as in (5.1) but with (4,6) now equal to 
unity, Qx being defined by (7.2) in this case. 

We now collect our results. Formulas (5.8), (6.5), (6.6), and (7.3) 
combine to give 
(7. 5) Am == Em © + Ay ® + Am?) oe Am) 

N 
om 22 D Br(m)Lx(m) + O (exp (2rmN-*) mV8N- 3/846)” 

k=l 
provided we set . : 
(7.6) Lx(m) = 0 when (k, 6) == 3 or 2. 


In (7.5) we hold m fixed aid let N become infinite so that the error term 
. becomes zero, arid dm-is expressed by the convergent series 


(7.7) | dm = 2 È By m) La(m) 


the various quantities in this result being given by formulas (5. 1) with (4.1) 
and (7.8), (5.5), (7.4), and (7.6). 
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FINITE METABELIAN GROUPS AND PLÜCKER LINE. 
COORDINATES.* 


By H. R. BRAHANA. 


1. Introduction. We are concerned with finite metabelian groups whose 
operators are all, except identity, of order p. The metabelian groups are those 
whose commutators are all in the central. If G is such a group and C is its 
central, then G/C is abelian and of type 1,1,---. Corresponding to every 
subgroup of G/C there is a subgroup of G which is either abelian or metabelian. 
Whenever G/C is the direct product of two of its subgroups, T, and Ta, which 
are such that the corresponding subgroups G, and Ge of G are both abelian, 
then G is a subgroup of the holomorph of G, and also a subgroup of the holo- 
morph of G. Conversely, when G is a subgroup of the holomorph of one of 
its abelian subgroups, then G/C is the direct product of two subgroups, Ty 
and T+, which correspond to abelian subgroups of G. A method of classifica- 
tion of groups possessing this property has been given.! Our main interest 
here is in the groups which do not possess this property. 

A group G which is not abelian has a commutator subgroup which is not 
identity. The central C either coincides with this commutator subgroup or 
else is the direct product of it and an abelian subgroup O” which is of type 
1,1,:::. In the latter case G itself is the direct product of C’ and a meta- 
belian group @ which has all the interesting properties of G.” We shall 
assume in what follows that the central and the commutator subgroup of G 
coincide. 

The abelian subgroup C of G is not maximal abelian, for the group {C, s}, 
where s is any operator of @ not in C, is abelian. The group {C,s} may or 
may not be maximal abelian. Whether or not {C,s} is maximal abelian will 
depend in general on the choice of s. The possession of a maximal abelian 
subgroup {C,s} will be a characteristic property of @. We propose to investi- 
gate such groups G as have a maximal abelian subgroup {C,s}.° This class 


* Received December 12, 1938; Revised September 22, 1939. 

1Cf. for references H. R. Brahana, “Metabelian groups and trilinear forms,” 
Duke Mathematical Journal, vol. 1 (1935), pp. 185-197. 

“More specifically, if two groups have the same order and possess the same G’, 
then they are simply isomorphic. 

SCf. American Journal of Mathematics, vol. 56 (1934), p. 496. The theorem 
(5.2) which purports to deal with these groups is incorrect. We regret the incorrect 
theorem and the erroneous proof. The theorem was beside the main line of development 
of the paper and those which follow it. 
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of groups contains most of the groups which have been considered in the 
papers referred to above. However, our present investigation will be greatly 
facilitated by the work which has been done, for once it is recognized that G 
belongs to the holomorph of one of its abelian subgr oups G is quickly ia identified 
by reference to those papers. 
We denote the order of C by p°; we denote the maximal abelian subgroup l 
- {C,s} of order p°* by H. Then we suppose G to be {H, Un Unt - +, Ux}. 
The order of G is p™™*, If H is transformed by U = {U,, Un" : ‘ , Ux} 
there is obtained a commutator subgroup, which lies in C. The fact that H 
ie maximal abelian requires that the commutator subgroup obtained by trans- 
forming H by U be of order p*. Hence we must have c = k. If U were 
abelian, G would be in the holomorph of H. We may therefore suppose that 
U is non-abelian. U then contains a commutator subgroup of order p! where 
}==1. If the commutators of pairs of Ups are all independent, then : 
1—k(k—1)/2 Since C is the commutator subgroup as well as the central 
of G, we have kS cS k(k-+1)/2. We note that G is generated by the k 
Uys and s. The numbers k and c are characteristic for a given G. We shall 
. consider the groups @ for a given k in subclasses according to the values of c. 

When c—%(# + 1)/2 the group is completely determined by the number 
k, since two such groups with the same k may be made simply isomorphic by 
letting generators correspond and letting commutators of corresponding pairs 
of generators correspond. We shall refer to this group as the master group 
for a given k. Such a group contains no abelian subgroup of order p°*. 
For all other values of c there exist groups @ which contain abelian subgroups 
of order p**. The possession of such subgroups is a characteristic property 
of a group and hence any set of invariants which determines the group must’ 
determine the number of such abelian subgroups contained in the group. 
Our method is to examine G for its abelian subgroups. This brings into 
l prominence a certain matrix M whose properties give a set of definitive 
l properties of G for k < 4. The elements of M are linear forms in certain 
indeterminates and the existence of certain abelian subgroups of G implies 
the existence of sets of values for the indeterminates which determine certain 
ranks for M. This investigation is carried out in §2 for k< 4. 

In § 3 the problem of the classification of these groups is approached from 
another direction, and for the case k = 3 is seen to be closely connected with 
Plücker line-codrdinates in a finite three-space. The geometric formulation 
cf the problem gives it an appearance of simplicity which would be misleading 
were we not warned by the intricacy of the considerations in §2. The exposi- 
tion of the close connection between these two seemingly distinct subjects is, 
of course, the important contribution of this paper. 
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| 2. The groups G for k<3. When k—1, then c—1. (G is of order 
p° and there is only one such metabelian group. This is the master group 
described in the preceding paragraph, and is otherwise well-known. 

When k= 2, then c is 2 or 3. There is but one group for c = 8 as was 
seen above. For c= 2 there is a group G which belongs to-the holomorph 
of H. This group is generated by s, Ui, and U, which satisfy the following 
relations with a == 8 —0: 

(1) Uy 180, = 58, ; Us *U U: = U:6,%8.8, 
Ur 1802 = 883. 


H == {8, $1, 82} is of order p° since s is not in C and H is maximal abelian. 

© Any group G with k == 2 is generated by operators satisfying (1) with 
a and £ having suitable values. Two groups generated by operators satisfying 
(1) both having «== 8 = 0 are obviously simply isomorphic. Hence for a 
group G, not in the holomorph of H, not both «0 and B—0. The sub- 
group of the holomorph of H described in the last paragraph contains the 
abelian subgroup {C, Ui, U2} whose order is p*? == pt. If a and £ exist, 
such that Œ contains no abelian subgroup of order p*, then the resulting G 
will be distinct from the one already obtained. An abelian subgroup of order 
p* will contain two independent operators which are not in C and neither is 
in the group generated by the other and C. Every operator of G can be 
written in the form c,s*U,*U2', where c; is some operator in C. The com- 
mutator of this and any other operator of G is independent of c+; hence for 
the purpose of investigating commutators we may assume that c;==1. Let 
V, = 0 MU, 11,2. Then 

ViVi. =e 8, oa Ort: $,01la709h (81482) Palais, 


If yı and V: arą permutable this commutator is identity and we have the 
following congruences, mod p: 


lika — lakı +a (kil = els) = 0, 
le — dal, + B(kils =, kel) == 0. 
These are linear in dg, ka, l3. .The matrix of coefficients is 
M Z ky a—al, — n) 
— h MER bl a, + Bki À 


whose rank-is of course at most 2. Hence there is always a solution ‘of the 
system of congruences. This corresponds to the fact that V, is permutable 


‘This ig the only non-abelian group of order p* which contains operators of order 
p only. It is to be noted that p = 2, since the only groups whose operators are all 
except identity of order 2 are the abelian groups of type 1,1,. - .. 
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with any power of itself, If V exists and is independent of and permutable 
with V1, then for some V, the above system must have two independent 
solutions and the rank of M must be at most 1. This requires that 


Bla + al, —al,? =0, 
akılı ——* Qik, Pre, Bk? = 0. 


Since H is maximal abelian we cannot have k, = l, — 0, and hence the two 
conditions reduce to Bk, + a — al, == 0. Therefore, whatever the values of 
a and f, Ga, kı, lı may be found such that the rank of Mf is 1. Consequently, 
if c = 2, @ contains an abelian subgroup of order p°*?. 

The abelian group {C, V;, V2} does not contain s, for in that case H 
would-not be maximal abelian. Hence G is generated by s, Vi, and Va which 
satisfy the relations (1) with a = 8'= 0. G is therefore a subgroup of the 
holomorph of H whenever c = 2. | 

When k—3, we have 3=c<6. -There is one and only one group 
_ for c—6. The simpler cases are those for which c is large; we consider the 
case where c= 5. The commutators obtained by transforming s by Ui, Us, 
and U, are independent and at least two of those obtained by transforming | 
one U, by another are independent of each other and the three preceding 
commutators. We may therefore suppose that generators of G satisfy the 
` relations: | 

Uy SU, — 581, DU U2 = U181%3 28575 .5356, 
(2) Ut SU = 889, UUU; = U:84, 
Ut sU 5 = 863, Us UU, = Us. 








There exists one such group with a = 8 == y == ò = e = 0. This group 
contains the abelian subgroup {C, Ui, U2} whose order is p°*. Conversely, 
any group G with k =3 and c= 5, which contains H as a maximal abelian 
subgroup and contains also an abelian subgroup of order p***, is simply iso- 
morphic with this group, for the abelian subgroup of order p°** contains two: 
operators V, and FV, such that {C, V.,-V2} is of order p°** and does not con- 
tain H. Then V, and F: may be used for U, and U: in rélations (2). 

Therefore, if any other group exists, it must contain no abelian subgroup 
of order po, Let V, = sU, =U, Up, i= 1,2. The condition that G 
have an abelian subgroup of order p°*? is that there exist V, and V. which are 
independent and permutable. This leads as before to a set of congruences 
bilinear in the two sets of exponents. Considering these as linear congruences 
in the exponents of V: and writing the condition that the system have a 
solution, we obtain the matrix 
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—k, a, — al, akı 0 

— À, — Bl, a+ Bk, 0 
M= | — m — yl, yki a 
0 — m — ôl 8h, k 

0 — € —m + ek, h 


If V, and V; independent and permutable exist it is necessary and sufficient 
that a, kı, l, m, exist such that the rank of M is at most 2. Now a solution 
of the system of congruences is a set of values for Ge, kz, l2, Ms Which define a 
V2 permutable with another operator and hence is a set of values which when 
used in place of a., kı, li, m, in M reduce the rank to 2. Also the existence 
of such a V, implies that any operator of {V:, V.} when expressed in terms 
cf 8, U,, Uz, Us has a set of exponents which will reduce the rank of M to 2. 
Consequently, if there exists a V, which reduces the rank of Jf to 2, there 
exists a V, with m, =— 0 which reduces the rank of M to 2. Letting m, — 0 
and recalling that we may not have a, =k, == l, == 0 also, we obtain 


y + B5— ae= 0 


as the condition that it be possible to reduce the rank of M to 2. Since there 
exist numbers a, 8, y, 6, e which do not satisfy this condition there exist 
groups G with c==5 which contain no abelian subgroup of order p***. 

We now determine a canonical form for a set of generating relations for 
the group with c == 5, and no abelian subgroup of order p°**, and in so doing 
show that these conditions are sufficient to determine the group. This canoni- 
cal form is a particular set of values for &, B, y, 8, e This set of numbers 
determines the expression of the commutator of U, and Uz in terms of the 
commutators of the other pairs of generators. Taking sı, 8e, °° :;,s as defined 
by (2), we see that the commutator of U, and Ue cannot be in {Ss 8s} for 
then the group {U,, U2, U3} would be that metabelian group we considered 
with k == 2 and c = 2 and hence would contain an abelian subgroup of order 
p*. Though this group is of order p*, it would determine an abelian subgroup 
of order p°*? of G, namely, the direct product of the group of order p* and 
{81, 82 Sa}. Therefore the commutator subgroup of {U1, U2, Ua} has a sub- 
group of order p in common with. {s1, S2, 83}. Every operator of the commu- 
tator subgroup of {U,, U2, Us} is a commutator, for otherwise some quotient 
group of {U,, Us, Us} would be a metabelian group with k == 2, c= 2, and 
no abelian subgroup of order p*. We have seen that no such group exists. 
Therefore, U’, and U’ may be chosen in U so that their commutator is in 
{81, 82, Ss}. Let #i'be the commutator of U’; and s. The commutator of U’, 
end U’: cannot be in {s'1, 82}; for then {H, U’;, U’2} would have a commu- 


t 
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tator subgroup of order p? and hence would have an abelian subgroup of order 
pe?, We may therefore denote the commutator of U’, and U’: by Ss, and 
then find in U an operator U’, independent of U’, and U’; which with s has 
.s4 for a commutator. Therefore, any group having the given properties 
has generators which satisfy (2) with a = 8 = ô = e = 0 and y =1. Thus 
_ there are just two groups with k — 3 and c = b, and, they are distinguished: 
.by the fact that one contains an abelian subgroup of order ae and the other 
does not. 

When c = 4 the considerations which we have employed above show that 
G is generated by operators satisfying the relations: 


USU = 581, 027002 = Using, : | 
(3) : | U8 Uz = 882, A Ust U Us = U 181228828371 
Us sU; —. 883, Us UaUs = U84. 


The condition for permutability of V, and V: is that the matrix 


— k, h — l — Gama aik l azk 

M — —= l — Bali — Bam, & + Bik nn Bok, 
— M,  — yh — yM, yıkı ay + yki 

0 0 — M -. L 


be of rank 2. Again, if V, exists such that the rank of M is 2 then there 
exists such a V, with m, = 0. In order that M be of rank 2 when m, = 0 
it is necessary that y, = -0 or else that l —0. The latter possibility requires 
further that 


a,” + a + ye) Qik, + (B1y2 — Bari =o. 


In order that a, and k, rational and not both zero, exist and satisfy this | 
relation it is necessary and sufficient that 


(4) a (Bi ——y2)" + 4Boy1 


be a square, mod p. Since it is possible to select numbers &ı, B1,° °°» y» 
with y: >< 0 so that this condition is not satisfied it follows that there exist 
groups @ with c==4 which contain no abelian subgroup of order p*?. When 
yı = 0, then (4) is a square, and therefore, that (4) be a square, is a neces- 
sary as well as a sufficient can for @ to contain an abelian subgroup of 
order p***. 

Two groups which have k= 3, c—4, and which contain no ein 
_ subgroup of order p°*? are simply isomorphic. One such group is generated 
by operators satisfying (3) with a4.—fi=®—=y—0, 8i = 1, y =r, 
where r is a particular not-square. New generators U’;, U's, U’s: may be 
selected in U so that ee 
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Cig == sing Pig’ Vi 


ao 
( ) Cig = 8,088 Pag”, 3), 


where c'i; is the commutator of U’; and U’, 84 is the commutator of U”; 
and s, and @, 81," - *, yz are arbitrary except for the condition that (4) be 
not a square. Let 

Ua =U I 

Uge Y UU 

U’; = Ọ,4U 3U; ™, 


Tf the commutators are obtained and conditions derived that they satisfy (5), 
there are obtained six linear non-homogeneous congruences in the six unknowns 
kist >t, M. The ranks of the matrix of coefficients and the augmented 
matrix are the same provided Bry: — yı Æ 0, which is true whenever (4) 
is not a square. This completes the proof of the statement at the beginning 
of the paragph. > à | 

For any other group whose generators satisfy (3) we may suppose that 
(4) is a square. Then there exists a set a, kı, 0,0 which reduces the rank 
of M to at most 2. If for a particular set the rank of M becomes 1, then the 
corresponding system of congruences has three linearly independent solutions. 
These solutions define V, itself and two others which we denote by V: and Vs. 
Then every element of {V2, Vs} is permutable with V;. Hence @ contains 
at least p + 1 abelian subgroups of order p°*?. Two such groups are simply 
isomorphic since Vi, V2, Vs may be taken as generators. They satisfy (3) 
with a, == 8, ==: : : == y = 0. Conditions that the rank of Af may be made 
1 are: ya B2 = y1.— 0, since k, = 0 implies a, == 0. 

For the remaining groups we may suppose that M becomes of rank 2 for 
l = m, = 0 and a, and k, satisfying the quadratic which precedes (4). 
Writing this quadratic in the form 


(a 7 Aik1) (ay — ki) = 0, 


we note that a,/k, =A, or À, and unless À = àz there exist two independent 
sets a, kı, 0,0 and a'i, k'a, 0,0 each of which reduces the rank of if to 2. 
When À, À, the system of congruences for the determination of de, ka, l2, Ma 
becomes 
— Oe + Mkr + aile + Qam = 0, 
(Ar — Bi) la — Bam: = 0, 
ile + (AG = ye) Me = 0, 


‘where the last two are dependent. ane following sets reduce M to a matrix 
of rank 2: | 

à 1,0,0 Ge, ke, — Bo, Ar + Bi 

À, 1, 0, 0 das Was — Bas M + Bi. 
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The sets in the same row determine permutable operators V,, V2 and V'i, V'a. 
If Ba 0 and M À, then these four operators are independent and they 
generate G. G then contains two abelian subgroups of order p°* and is a . 
subgroup of the holomorph of {C, V:, V2}. Such a group is completely deter- 
mined by the above mentioned properties.’ It is not simply isomorphic with 
any of the groups determined previously. The assumption that 8: 34 0 is not 
essential, for if Ba == 0, we solve the first and third congruences and obtain 
the same result so far as abelian subgroups of order p”? are concerned and. 
these abelian subgroups determine G. a | 

If A == àa then considerations similar to those used above show that G 
contains but one abelian subgroup of order p*?. Under the given conditions 
cn the exponents @,,: > +, y: new generators U’;, U’s, U’, in U may be found 
such that Bei and g'i == By = y, = a = y2 == 0. This shows the 
existence and the uniqueness of the group with one abelian subgroup of 
order p*?, l 

‘Hence for k==3 and c= 4 there are four groups having respectively 
0,1,2, and p + 1 abelian subgroups of order p°*?; any two groups with the 
same number of abelian subgroups of order p°* are simply isomorphic. 

For c == 3 generators of @ satisfy the relations: 


U sU = S81, UU WU — U 18188, 
(6) U27sU 2 _— 882, UUU; = U 18192892833, 
U,738U — 883, Us UaUs — U'2s,%s°88,78, 


Conditions for the permutability of y , and V; require the rank of 


— k, a — Gil — am, ok, — asm, i akı + aal 
M =| —h — ıı — Bom, 4 + Biki — bm Bok, + Bali 
— M — Yılı — YM yiky — ysm Ay + yeka A ysl 


to be at most 2 for a proper choice of V,. For certain groups G, in other 
words for certain sets æ, 8, y, it is possible to choose V, so that A7 has rank 1. 
_ In such a case V, is permutable with V, and Vs and Vi, Vo, Va are inde- 
pendent. Since H is maximal abelian, s is not contained in {Vi, V2, Vs}, 
each operator of which reduces the rank of M to 2 or 1. Hence G is generated 
by s, Vi, Va, and Vs. We may take U; to be F, and assume G to be gen- 
erated: by operators which satisfy (6) with a, = f, = yı = %2 = f: = y: = 0. 
If in addition a3 == fs == ys = 0, then U2 and Us also reduce the rank of M 
to 1. This group is generated by the two abelian subgroups H and Ọ, and is 


3Cf. “On the metabelian groups which contain a given group H as a maximal 
invariant abelian subgroup,” American Journal of Mathematios, vol. 56 (1934), p. 510. 
This is the first group in the table. : 
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therefore. a subgroup of the holomorph of H.: It is completely determined by 
its order and the order of its commutator subgroup.® It contains p? + p +1 
abelian subgroups of order p*? corresponding to the same number of subgroups 
of order p° in U. | 

If os, Bs, ys are not all zero, we note’ that every element of {U2, Us} 
reduces the rank of M to 2, since every such element is permutable with Uj. 
G therefore contains at least p + 1 abelian subgroups of order p**?, one for 
each subgroup of order p in (Uz, Us}. It will be convenient to write M in 
the more special form 


| — k, ay = CELL Gl, 
W =| — l 0 a, — Bam, Bsh |. 
SE Mi 0 ae Ya ay + ysl 


The choice @, ki, l, m1 = 0, as, Bs, ya reduces the rank of M’ to 1. The 
corresponding operator V, is permutable with U, since every operator of U 
is permutable with Ui; it is permutable with V, determined by B3, 0, 0, 1 
which is not in U if B;5£0; and it is permutable with V, determined by 
— ys, 0,1, 0 which is not in U if y0. If neither Bs nor ys is zero, the 
operator V, is in the group {U,, V;, V2} since 


0, as, Ba, Ys 


Bs, 0, 0, 1 
— Ya 0, 1, 0 
0, 1, 0; 0 


are linearly dependent. If not both Bs and ys are zero, then G is generated 
by U,, Uz, Vi, and Vz or V3. The pairs Ui, Uz and Vi, Va (or Vi, Va) are 
permutable. Hence if not both 8, and ys are zero G is generated by two of 
its abelian subgroups and hence is a subgroup of the holomorph of {C, U1, U2}. 
This group is generated by operators which satisfy the relations 


Uts U = S183, Uri Ua — $185, 
U; AU, = 8284. 


The group is unique,’ it contains 2p + 1 abelian subgroups of order p°*?. 
If Bs = ys = 0 and a;5£ 0, then the rank of M’ is 3 unless a, = 0; and 
if a, —0 the rank is 2 unless l, — m, = 0. Therefore U; is the only operator 
of G which reduces the rank of W’ to 1. The only abelian subgroups of order 
pe? of G correspond to subgroups of order p? of U which contain U;. Hence 


"Cf. the preceding reference, p. 495, Theorem 5. 1. 
7 Cf. loo. cit., p. 510. This is the group of order p™* with K of order p? and one 
subgroup of type 1. 
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G contains p-+ 1 such subgroups, which distinguishes G from the groups 
which have been discussed. These p+ 1 abelian subgroups of order p°? are 
all contained in a non-abelian subgroup of order p°**, generated by C, Ui, Us, 
and Us, which will distinguish it from another group containing p +1 
abelian subgroups of order p*? which will follow. In the present case an 
obvious change of generators will make a, = 1, which shows that any two . 
such groups are simply isomorphic. G is not in the holomorph of any of its 
` abelian subgroups since all of its abelian subgroups are in H or in {C, U} 
and none of the latter contains both UV, and .U3. 

For none of the remaining groups with c = 3 can, V, be selected to make 
the rank of M smaller than 2. We consider those for which the rank of M 
` may be reduced to 2. For these we may assume a, = 8, == y, = 0. Then 
both U, and U, reduce the rank of M to 2. If there exists a V, not in {U;, Ua} 
which also reduces the rank to 2, the corresponding Va will not be in {U:, U2} 
for otherwise V, would reduce the rank to 1. Every operator of {Vi, V2} will 
reduce the rank to 2. Hence there exists a V, with m, =0 and not in 
{U,, U2} which reduces the rank to 2. Under these conditions M takes the form: : 


—k a 0 gk, + asl, 
MY” =m | — h 0 Ay Beky + Bal: . 
—0 0 0 a + yok: + Yalı 
Now V;, being not in {U:, U2}, must have a, 40. Such a V, exists only if 
not both y, and ys are zero. Hence if y: =y = 0, @ contains but one abelian 


subgroup of order p™*?. The condition that no operator of {U:, U2} reduces 
the rank to 1 requires that (a2.8;—- 82) be different from zero which implies 


`, that (Bs — a2)? + 4a382 be not a square. The existence of such a group is 


obvious; we omit for the moment consideration of the question of uniqueness. 
If not both y: and ya are zero, we may suppose that yz5£0. Then — ye, 1, 0,0 
determines an operator V, which reduces the rank of M” to 2. In this case 
G contains at least two abelian subgroups of order p°*?, and since the rank of 
M cannot be made 1 the corresponding Va is not in {U:,U2, Vi}. Hence 
such a group, if it exists, is generated by the two abelian subgroups {C, Ua, U2} 
and {C, Vi, Va}. It is therefore a subgroup of the holomorph of either. It is 
identified as the group with commutator subgroup of order p* and no sub- 
group of Type 1. The existence is established by showing that the group 
described in the paper referred to contains a maximal abelian subgroup of 
order p°™. It contains p + T abelian subgroups of order p**. Since it is 
generated by two of these abelian subgroups it is distinguished from the group 


5 Cf. Loc. cit. 
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. with p-+ 1 abelian subgroups of order pes all contained in a subgroup of 
order p%**. 

We shall now see that the next to the last group is a determined | 
by the property of having one abelian subgroup of order p™?. This group is : 
obviously not in the holomorph of any of its abelian subgroups, since it con- 

_ tains no abelian subgroup of order p% and but one of order p*?. We then 
fix attention on the characteristic subgroup {C, U1, Us} which we denote by 
H’. With this change of notation 


Si = U, 8/2 = U2, U = 87, U’; = U;7, Sa = 81, ga = 82, 5 cme $3 
we have the following relations satisfied : 
Ds Um dite, UL Ur os (99), 
0718.0", = 08" 4, USU = #3 (8 9229’ e), 
U's U’ U's var U':55. 


The commutator subgroup arising from transformation of H’ by {U”,, U's} 
is of order p?. U’, and U’: determine two permutable operators in the group 
of isomorphisms of H’ which with H’ give the particular subgroup ° of the 
holomorph of H’ which has no subgroup of Type 1. A. choice of generators 
to give the canonical form of generating relations of this subgroup of the 
holomorph of H’ gives a canonical form for generating relations of G. This 
form is the above set with &: = 8: = 0, B2==—1, a, = — r, where r is a 
particular not-square. The possibility of doing this is a consequence of the 
` fact that (8; — a2)* + 48: is not a ‘square. The canonical form contains 
no arbitrary constants and hence the group is uniquely defined by the fact 
that it contains just one abelian subgroup of order p%?. i 
For arbitrary a, 8, y there exists V, such.that the rank of M reduces to 2, 
when the corresponding 4, kı, h, Mm, are substituted. A proof of this will 
establish the fact that every group G with c = 3 contains at least one abelian 
subgroup of order p*?, and hence is one of the five groups determined above. - 
If there exists a V, which reduces the rank of M to 2, there exists one 
with m, —0. Suppose that a, = 0 also for this Vi. The condition that such 
a V, exist is that F (k, L) = 0, where F is 


(Biy2— Bayı) ki? + (Brya — Bai — ve + aay1) krh + (ayı — Grys) L”. 


There exist quantities a, £, y such that this congruence is irreducible; we may 
suppose that such is the case, for otherwise we have the existence of the required 
V,. Hence we may assume that a,540. When mÆ 0, the first column of M 
is expressible linearly in terms of the last three columns, and hence the rank 


° Cf. loc. cit., p. 510 and p. 500. 
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of M is the same as the rank of the matrix composed of the last three columns., 
The determinant of this matrix, with m, = 0, is 


m[a? + {(Bs + ve) ki + (ys — m)h}a + F (k, 1,)]. 


“Thd quadratic factor can always be made zero by a proper choice of a, ki, li, | 
and since F(&:,1,) has no linear factors a, will not be zero. This completes 
the proof. 

We give below a table of the groups for k < 4. For given k and c the 
different groups are arranged in the order of their appearance in this paper. 
k c number of abelian subgroups 
of order p°*?, 
D Bo 
0 
1 
0 
1,0 
0,p +1,2,1 
pP+pt+i, 2p+1 p+1*, Lp+1* 


* These two groups are distinguished by the fact that the first contains an 
abelian subgroup of order p°*° and the other does not. 


1 


wo 
Oran wor 


3. A geometric description of the groups for k—3. It has been. 
convenient in the preceding pages to single out the maximal abelian subgroup 
H, and consequently to distinguish. between s and the other generators. In 
the sequel we shall drop this distinction and consider the same groups gen- 
erated by the operators U;, U2, Us, and U,. Every such group is defined by 
a set of relations on the U,’s. Each set contains six relations which define 
commutators of pairs of Ups. If there are no more relations, aside from those 
expressing permutability of the commutators, that is, if the six commutators 
are independent, then the group G is the master group described in Section.1. 
Any other group generated by four U,’s is defined by additional relations 
among these six commutators and hence is a quotient group of this group of 
order p!° with respect to some subgroup of the commutator subgroup. The 
existence of two kinds of group with c = 5 shows that the commutator sub- 
group of G, of order p°, contains two kinds of cyclic subgroup. Distinguishing 
properties of these two groups with c= 5 are that one contains an abelian 
subgroup of order p°? and the other does not. . G itself contains no abelian 
subgroup of order p™? and consequently if the group with c = 5 contains 
such a subgroup the process of taking the quotient group introduces permuta- 
bility among operators of the form U,U,%U,"U,"" where none existed in G. 
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This means that the cyclic group which is set equal to identity contains com- 
mutators. In the other case for c = 5 the cyclic group contains no commutator 
except identity. We therefore examine the question of commutators and 
non-commutators in the commutator subgroup of the master group G. 

The group G is defined by the relations: 


UUU; = Vary, . (<j, j =32, 3,4), 
TijTkl == Tkif'tj, (t, Í k, l = 1, 2, 3, 4) . 


If the r1°8 are ordered then every operator of the commutator subgroup of G 
is determined by the set of exponents of the 7:78 in the expression for it; two 
operators belong to the same cyclic group if and only if their sets of exponents 
are linearly dependent. The set (0,0,---,0) corresponds to identity. Hence, 
if the sets of exponents are taken to be the’ codrdinates of points in a finite 
projective space À of 5 dimensions, every point in R will correspond to a cyclic 
subgroup of the commutator subgroup of G. We determine the condition that 
the point @ == (a, &,°:-,@) which corresponds to 112% 1139? 114% Tas™ Taa roue, 
represent a cyclic subgroup which contains a commutator.° If a represents 
a commutator then there exist two operators 


Vi UU U a 4 
Po V,U 0 Un 


which have the element corresponding to @ for their commutator. 


Var V2ViV 2 pose fy g Ma Path, Osa + + + org Pair dits, 


This is exactly the problem of the Pliicker line codrdinates in a projective 
three-space. We then have the following theorem: 


A point a in the space R corresponds to a commutator if and only tf it 
hes on the four-dimensional spread S defined by 


Qio — Bohs + Oya, = 0. 


A point in the space R corresponds to a cyclic subgroup in the commu- 
tator subgroup K of G; a line in R, being the set of points linearly dependent 
on two points, corresponds to a subgroup of order p? in K; and a plane corre- 
sponds to a subgroup of order p° in K. The effect of taking the elements of 
a particular subgroup of K to be identity, so far as abelian subgroups of order 
p°? of the resulting quotient group are concerned, depends on the relation of 
the corresponding point, line, or plane to the quadratic spread S. If the point, 
line, or plane has a point in common with S, the resulting quotient group will 


19 Since Vr’ èV, = Vir, if r is the commutator of F, and V;, then every element 
of a cyclic group is a commutator or else none (except identity) is a commutator. 


10 
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contain two permutable elements V, and V2 which with C give an abelian 
subgroup of order p**. 

The points of R constitute two classes with. respect to 8, a point being 
either on S or not on 8. If the point is not on § the corresponding quotient 
group contains no abelian subgroup of order p**. The fact that there is but 
one such group for c= 5 means that all: points not on S are alike. The 
canonical form obtained for generating relations of this group in Section 2 
has the commutator of U, and Us the same as that of U, and Uy. Thus. by 
_ putting equal to identity the elements of a cyclic group corresponding to a 
point not on S, a group of order p? determined by two points on J is reduced 
to a cyclic group of order p. Hence every point of RÈ not on si is on a line 
joining two points of £. 

A line in À may have no points on S; it may have one point on S; it may 
have two points on 9; or it may lie wholly on S. These possibilities corre- 
spond to the groups with c= 4 with 0, 1,2,p +1 abelian subgroups of 
order p°*. 

A plane in À has at least one point in common with S. It may cut S-in 
one point, in a line, in a proper conic, in two lines, or it may lie wholly in J. 
These possibilities correspond to the groups with c = 3 and 1,p + 1, p +1, 
2p + 1, and p? + p + 1 abelian subgroups of order p°*? respectively. 

Each of the last three groups is a subgroup of the holomorph of one of 
its ‘abelian subgroups. A geometric criterion for this possibility involves a 
consideration of the three-space (21, To; Tay ta). Let us consider the case 
c—3. A particular group is determined by a plane in Æ which has a certain 
set of points on S. Each point on 8 determines a line in X. ‘Each line in X 
is determined by two'of its points. Two skew lines in X will be determined - 
by four points in terms of which every point of X can be expressed linearly: 
If two lines in X are not skew, then the line joining the corresponding points 
in È lies wholly on S.** Thus in the case of the third and: fourth groups 
above, where the plane cuts J in a proper conic and in a conic consisting of a 
pair of lines, it is possible to select two points on the intersection which 
represent skew lines in X. In those two cases the groups are subgroups of the 
holomorph of the abelian group of order p%?. For the first two cases it is not 
possible to make such a selection. A plane wholly on S determines a set of 
lines in XY which are also on a plane. The corresponding points on X repre- 
sent an abelian group of order p** in G. Any metabelian group with operators 
all of order p which contains an abelian subgroup of index p is in the holo- 


u Cf. for example, Veblen and Young, Projective Geometry, vol. I (1910), p. 329, 
Theorem 30. 7 
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* morph of that abelian subgroup. For c = 4 the group is determined by a line 
in R. The only one of these groups which is in the holomorph of one of its 
abelian. subgroups corresponds to a line in R which has two and only two 
points on 8. In the case where the line is wholly on S each of its points 
determines a line in X but all of these lines in X are in a plane. Since not 
every line in this plane in X is represented by a point on the intersection of S 
with the given line in R, this plane in X corresponds to a non-abelian subgroup | 
of order p°*? in the given group. 

In general, for an arbitrary number & of generators, the condition that 
a group @’be in the holomorph of one of its abelian subgroups may be stated’ 
in geometric form. - If @ is in the holomorph of one of its abelian subgroups, 
then the points of the space X can be expressed as linear combinations of 
points of two of its subspaces each of which determines an abelian subgroup 
of G’. An abelian subgroup of order p°* in @’ determines a (k, — 1)-space 
in X all of whose lines determine points on the intersection of S with the 
m-space in À which determines G” as a quotient group of the master group G. 

The geometric aspect of the solution of the problem of classification of 
metabelian groups with & independent generators and elements all of order p 
. is now clear. It involves the extension of the theory of Plücker line codrdinates 
to a space of & — 1 dimensions. These codrdinates determine a space R of y— 1 
dimensions, y==k(k—1)/2. In R the points which correspond to commu- 
tators are on a subspace S of 2k — 4 dimensions defined by (k —2) (k—8)/2° 
quadratic congruences, the conditions that a point of R represent a line of X. 
It then involves the determination of the relations of points, lines, planes, 
three-spaces, etc. in R to the subspace S. These relations determine the possi- 
ble types of quotient groups of G. In determining the types of relations to 9 
of flat m-spaces in R it is necessary to.separate the m-spaces of R into classes, 
all the members of a class being conjugate under the group of collineations 
of R which leave 9 invariant? This group is closely connected with the 
group of collineations in À. Of course the transformations are “ rational ” 
end the geometry is finite. 


12 It is this part of the problem that made necessary most of detail of section 2. 


AN EXTENSION OF ANALYTIC FUNCTIONS TO MATRICES.* 
By R. W. WAGNER. 


The analytic functions of a complex variable have many interesting 
properties. The property of analytic continuation is made the basis for this 
extension of such functions to matrices. The extended function is then a 
mapping of a subset of the matrix space on to another subset of the same 
space. The procedure followed here is to replace the complex variable in a 
power series by a variable matrix, show that the resulting matrix function can 
be reduced. to the original function of several complex variables, and apply 
the process of analytic continuation to each variable. The most interesting 
results of this paper concern the singularities of the extended function which 
are introduced by the extension. The last part of the paper shows how this 
approach may be applied to the solution of certain matrix equations. 

The notational scheme is as follows: Capital script letters indicate 
matrices, small letters indicate ordinary complex numbers, and subscripts are 
used for enumerative purposes. 


1. Let M denote the matrix space, the space of all square matrices of n 
rows whose elements are complex numbers. One can make this a metric space 
by defining the absolute value of a matrix and then defining the distance from 
X to F to be the absolute value of X — F. The absolute value of XY will be 
taken to be? 


| X | = Vtr XX’ = È tutuy. 


A similarity transformation applied to M is just a change of codrdinates 
in Mm. Unfortunately, such a transformation does not leave the distance 
invariant, but it is a homeomorphic transformation of Mm. Therefore, limiting 
relations will be independent of the codrdinate system, and it will be per- 
missible to use the most convenient codrdinate system for investigating these 
limits. 

It is convenient to distinguish several subsets of M for future reference. 


I. (x), the set of matrices which have a for a characteristic root. 


* Received November 28, 1938; Revised August 19, 1939. 
1Compare with Wedderburn, [1], page 125. The numbers in square brackets refer 
to the bibliography. 
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fe (x) has dimensionality two less than m. For it is defined by the complex 
equation 
det (X — x) — 0. 
II. , the set of matrices whose characteristic equation has distinct 
and simple roots. 

III. 9, the set of matrices whose reduced characteristic equation is of 
lower degree than the characteristic equation. 

IV. &, the complement of D in M— MN. 

The set D + $ is also (2n? —2)-dimensional. For the codrdinates of 
the matrices which belong to it satisfy the complex equation obtained by 
setting the discriminant of the characteristic equation equal to zero. ‘These 
equations define an algebraic locus, so that the following topological theorem 
is valid. 

THEOREM 1.1. ‘Any matrix of & or D can be approached by matrices 
which belong to N. 

2. Corresponding to each elementary divisor of X is a pair of matrices 
called partial idem-potent and nil-potent elements of X. If X does not belong 
to D, these matrices are uniquely defined. If X belongs to D, they can be 
found, but not uniquely. In case they are unique, they are called principal 
idem-potent and nil-potent elements of X.? 

If X belongs to N and has the characteristic equation 

g(t) = (@—~A,) (2 — àa): > > (U—An) = 0, 
then the idem-potent elements of X are the matrices ` 
(2.1) Pi(X) = (X — M) + (X -= ħa) (Z — ia) o (An). 


The nil-potent elements are all zero in this case. 
But in any case the partial idem-potent elements, P;, and the partial 
nil-potent elements, Qi, satisfy the equations 


PiP; == PP = iP 
(2. 2) j P:Q; = Q;P; = åQ; 
QiQs = Qi = bQ’. 


In addition, the important identity, 
La 
(2.3) X= z (Pi + Qi) 
3 =i 
is also true, v being the number of elementary divisors. 


*See [1], pages 27-29, 42. Also [2]. 
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Let f(z) be an analytic function of the complex variable z. Itis assumed 

. that everything is known about the function f(x), so,that the only problem is - 

~ to extend the function to M. Assume that the origin is a regular point of 

f(z) and that it is represented in a neighborhood of the origin, | «|< c, 
by the power series 


as 
px) = Sage. 
k=0 
„ +-The expression 


(2. 4) $(X) = Sax 


defines a mapping of the part of Mm for which the right member converges on 
. toa subset of M. Therefore it is considered as a part of the extended function. 


THROREM 2.1. (X) is an absolutely continuous function of X on 
{|X| <e. 

The proof of this theorem is exactly the same as that for the corre- 
sponding theorem in function theory. Replacing X by its absolute value 
reduces 2.4 to a power series in | X |. l 


Tagorex 2.2. If |X| <e, then it is true that | 
-> ott 1 
$2) = 5 [sanr + Zor aners |. 


The first step of the proof is to show that each characteristic root of X is, 
in absolute value, less than | X |. X transforms the unit sphere in a vector 
space of n dimensions into an ellipsoid in this space. The absolute value 
which has been chosen is the square root of the sum of the squares of the 

. principal semi-axes of this ellipsoid. Corresponding to each characteristic 
root, Ax, there is a vector, vs, such that Xos == Aste. Therefore | As | is less 
than the semi-major axis of this ellipsoid, and thus [M |<|X|. > 

If X is such that | X | <c’ < c, for any « > 0 there exists an m (which 
we take greater than n) such that for [A | < c 


(2.5) | @ (A) — Sapp(p—1)- + = (po + 1) | <e 
p 
l [o == (0,1,:--,7)] 
and : : 
(2.6) | #(2) — È ax) <e | 


From (2.2) and (2.3) one gets 


por eo  (p— o)! 


L 


(2.7) Žar- 3 ss ap 2 are + Qa Pr 


* 
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But Qs is nil-potent. Therefore, powers of Q: higher than n may be omitted. 
Thus, by combining (3:5) and (2.7), one gets 


i 


| à as XP — È =; = Lgo (Ax) PQr | < (n+ De 


Combining this with (2. 6) leads to 


| $(X) -$ À O (Ae) PQ | < (n +2)e é 


Since € can be taken arbitrarily small, the theorem is proved. 

The statement of this theorem is an identity on the region |X |< c. 
The analytic continuation in M is based on this identity. The region on. 
which f(X) is defined is extended by assuming that a similar relation is valid 
in M. | 


DEFINITION. If X has the partial idem-potent and nil-potent elements 
Py and Qu associated with the roots Ax, the matrices of the form 


(2. 8) | 5 Oe TO Qu) Paar 


will be considered as values of the function, —images of X. Moreover, if Y - 
approaches X, any limit of f(Y) is to be admitted as a value of f(X). 


The above definition reduces the matrix function to a single function of 
n independent variables on N. But the matrix X can be changed in two 
distinct ways. One can change the characteristic roots, or he can change the 
idem-potent elements, Px. The equation (2.8) shows that, on 71, the Px 
are not changed in passing from the argument to the value. 


THEOREM 2.3. If X belongs to N, and if each À is a regular point of 
f(x), all values of f(X) are gwen by (2. 8).. 


If X belongs to N, the Pr are given by (2.1). The-Ax are continuous 
functions of the codrdinates, so that the same applies to the Py. By hypothesis, 
f(x) is continuous in! the neighborhood of each A4. Therefore no limiting 
process can produce limits not of the form of (2.8). 


THEOREM 2. 4. If W is noninar f(WOXW) = WY (X)W. 


Proof. It is easy to verify that the similarity transformation can be. 
applied to the power series (2.4) and to the matrices of the form (2.8). 
‘ All values are obtained from these, or by applying limiting processes to such. 
values. The similarity penerormialion is continuous. Hence the theorem is 
MAR " 
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_ THeoren 2.5. If X has an elementary divisor of degree m associated 
with À and if À is such that the first, second, + + (r—1)-th derivatives of 
f(x) vanish for z =A, but f(A) 40, then f(X) will have s elementary 
divisors associated with f(A), where s is the smaller of r and m. These 
clementary divisors will be of degree [m/r] or [m/r] + 1. 


The proof of this theorem depends upon the identity (2.8) and the ` 
consideration of the rank of powers of the Qs associated with this elementary .. 
divisor in X. The details are omitted. It was stated above that on Tl the 
elementary divisors were changed only in the root associated with them in 
passing from. the argument to the variable. This theorem states that on & - 
an elementary divisor of the argument may be broken up by passing to the 
value of the function. Other changes will appear later. 


3. This section is devoted to a discussion of the singularities of the 
extended function. It will appear that the extended function reflects the 
singularities inherent in the function and that the extension introduces some 
singularities if the original function is multiple-valued. 

If f(X) is single-valued in a neighborhood of X = A, but discontinuous 
at A, A is called a singular point. A is called a pole of the function if the 
limit of f(X) is not finite no matter how XY approaches A. - 

If f(X) is multiple-valued, the point A will be called a branch point of 
f(X) if the number of values of the function is different for the point A and 
for points in every neighborhood of À. 


THEOREM 3.1. The matrices of & are singular points of f(X) if, and 
only tf, f(x) is mulitple-valued. These points ara poles of some branches of 
f(x). 

In view of Theorem 2. 4, it is permissible to give the proof in the most 
convenient coôrdinate system. Another simplification is accomplished by using 
matrices with only the essential parts appearing. Since the elementary 
divisors enter into the function independently, additional elementary divisors 
may be added later. | 
© Let Y =M +J, where À is a regular point of f(x) and J is the matrix 
all of whose elements are zero except those in the diagonal above the main 
one, which have the value one. Let X[h] denote the matrix 


À 1 D” xs 0 . 0 
O Ath 1 >’ 0 0 
0 O A+2%--- 0 0 
>- X[k] = oe 
‘ 0 0 0 st A+(m—2)h 1 
0 0 0 0 . A+ (n—1)A 
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The plan is to compute PEN, its limit tION, f(¥) and to compare 
the last two matrices. 

Note that X¥[h] is a matrix in N. Therefore its principal idem-potent 
elements are given by (2.1). Making this computation, one finds that the 
elements of Pr associated with the root À + (k—1)4 of X[k] are either 0 or 


Pra[h] = it ‘ a Le ome rskSs. 
Putting these into (2.8) one gets 


f(X[h]) = > Pilà + (6 — 1h] 











way 


where ¢;, denote the various branches of f(x) in the neighborhood of A. The 
above can also be written 





sun =| ST) nent nul. 


k=r 

In case all the by, are the same, the terms added represent the (s—r)-th 
difference of ¢; with respect to the increment h. Because the limit of the 
ratio of the r-th difference of a function to the r-th power of the interval is 
the v-th derivative, one gets in this case 


(3.1) f[X(0)] = 


When f(x) is single-valued, this reduction is possible. 
But in case the $j, are not all the same, one of the elements of f(X[h]) 
with s =r +- 1, namely 














Goal oer (A) 


= Len (à + rt) — eA + (r-—1)4)], 


has different values of f(x) in the numerator. Hence, in this case, one gets 
F(X[0]) = ©. 

To complete the proof of the theorem, it is necessary to show that similar 
limits are obtained by using other paths of approach. It was assumed that A 
is a regular point of f(x). Therefore, changing the manner in which the 
roots of [hk] become equal can have no effect on the limits as long as the 
path lies in N. The path can also be deformed in this way: let V[h] be a 
non-singular-matrix valued function of À which is continuous for O=h < 1. 
Then VÆV-1 is a continuous function of h. In order for this matrix to 
approach Y it is sufficient (and necessary) that V[0]Y =F V[0]. But, by 
Theorem 2.4, one has f(VXV) =Vf(X)V for h different from zero. 
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Also, a similarity transformation is continuous. Therefore, the limit of 
f(VXV") —V[0]f(X[0]) V[0]+. In case f(X[0]) is finite, equation (3.1) 
shows that there is a polynomial such that p(Y) —f(X[0]). Therefore, in 
this case V[0] commutes with f(X[0]), and the limit exists independent of : 
the path in N. On the other hand, if f(X[0]) is not finite, the similarity 
transformation cannot change this property. Therefore since any path in A 
can be obtained from the original path by a combination of the above dis- 
tortions, all paths in Ÿ lead to the same limit. 

It can be proved by induction that paths in & lead to values of the type 
(2.8) for arguments in & also. The first step is the above proof for two 
roots becoming equal. The inductive step can be carried out by using an 
approximation involving matricos in N. Let Zm be a sequence of matrices 
with elementary divisors of degrees less than n. Choose Xm so that f(Xm) 
approximates f(Zm) within an amount e/m. Then the limit of f(Xm) is the 
same as the limit of f(Zm). Using the above result concerning the limit of 
f(Xm), one arrives at the theorem. i 


. THEOREM 3.2. The points of D are singular points of f(X) if, and 
only if, f(x) is multiple-valued. The singularily is of this nature: if X 
approaches a point of D along some path the limit of f(X) exists but depends 
upon the path. | 


As before, let X[h] be a point in N + $ and let its limit, X[0], be a 
point of D. Let $; denote various branches of f(x) in the neighborhood of 
À. Then a value of f(X) can be written in the form 


(3.2) (ZRI =$ pna HEP À file) Pe 
Let X[0] have the form 
Mol ÈP + S hPe 


k=r+1 


Then, on applying (2.8), a value of f(X[0]) has the form 
(3. 3) $n (Ar) B Pet 2 fx) Pe 


Note that, if the ¢), are not identical, the limit of f(X[A]) is not the ex- 
pression in (3.3). However, if the j, are the same (necessarily true for a 
single-valued function), the limit of f(X[h]) is given by (3.3). Thus the 
points of D are singular points of the matrix function. 

Now let U be a matrix which commutes with X[0] but not with the 
individual Pr, (k = 1,2,---,1). Moreover, U can be chosen so that it will 


` 
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commute with a linear combination of these Pr only if the coefficients are all . 
the same number. A similarity transformation is continuous. Therefore 
one gets | an l 
lim f(UXTR]U +) = U lim f(X[h]) U=. 
k0, . k0 ` 


Tn case the ¢;, are not all the same, the coefficients of the Pz will not all be 
the same number, and U cannot commute with the limit of f(X[h]). But 
notice that X and UXU- approach X[0] along different paths. Therefore 
the theorem is established. 


COROLLARY. If f(x) is multiple-valued, and if X is a point of D, the 
matrices admitted as values of f(X) are the transforms of the values of form 
(2.8) by the group of non-singular matrices commutative with X. 


THEOREM 3.3. Unless X belongs to D, all values 4 f(X) are of the 
form (2.8). | 

This result is a combination of Tiei 2. 3, the proof of Theorem 3. 1, 
_ and the définition of the sets N, &, and D. The corollary describes the 
situation otherwise. 

The above discussion concerns the singularities which arise from the 
extension of functions to matrices. The following theorems concern the 
inherent singularities of the function. 


THEOREM 3. 4. Ifz—atisa singular point of f(x), the points of R (a) 
are singular points of f(X). 


This theorem is important because it states that the point singularity of 
f(x) is exploded into a (2n? — 2)-dimensional singularity for f(X). The 
possibility of carrying the variable “around” a’ singularity is preserved. 
The values obtained by a limiting process applied to (3.2) can also be obtained 
by carrying a value (3.3)! around the proper branch locus of f(X), keeping 
the argument in D. 


‘THEOREM 3.6. If X has an elementary divisor of degree r associated 
with À, and if f(A) = œ for some s less than r, then f(X) = œ. 


This theorem is proved by substituting into (2. 8); and then applying 
Theorem 3. 3 and the corollary. This theorem can be applied to show why a 
nil-potent matrix with two rows has no square root. Such a matrix js a 
singular point of the function. ; 


4 Let F(X) be defined to be any matrix which satisfies the equation 


(4.1) p(l(X)) = Ê REP. 


- oy 
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In this section, the function F(X) defined here will be compared with the 
corresponding function f(X) defined in Section two. It will appear that 
f(X) is identical with the primitive solutions of (4.1). The primitive 
solutions are those solutions of (4.1) which are not solutions of both (4.1) 
and a lower degree equation. In making this comparison, certain results of 
Roth [3] and of Franklin [4] concerning the function F(X) will be used. 


THEOREM 4.1. If f(z) is defined by the equation 


p(f(z)) =z, 
then the matrices f(X) defined in Section two are solutions of 
(4.2) p(f(X)) = X. 


Proof. The values of the form (2.8) satisfy (4.2). Also the values of 
the form of the corollary of Theorem 3.2 satisfy (4.2). All values of f(X) 
are of one of these types or limits of these types. The operations of addition 
and multiplication are continuous. Therefore, any limit of matrices which 
satisfy (4.2) will also satisfy (4.2). 

In order to prove the converse relationship, it will be convenient to trans- 
late some of Franklin’s results into the language of this paper. The solutions, 
F(X), of (4.1) can be put into three classes: 


A. Solutions which are in turn polynomials in X. 


B. Solutions similar to one of form (2.8) by a matrix commutative 
with X. 


C. All others. These solutions must have elementary divisors whose 
degrees differ from the degrees of the divisors of X. 


Roth showed that all solutions are of Type A unless X belongs to D. 
Franklin showed that, in general, solutions of Type B exist whenever X is 
derogatory. Furthermore, he showed that solutions of Type C exist only if 
X is a point of D and if X has a root A, such that p’(f(A)) = 0, associated 
with several elementary divisors.. ` 

The solutions of Type A are given by (2.8). Solutions of Type B are 
found in the corollary to Theorem 3.2. Hence it remains to show that the 
solutions of Type C can be obtained by a limiting process applied to values 
of the form of (2.8) or of the corollary. 

The condition p’ (f(A) ) — 0 is equivalent to the condition that f’ (à) = œ. 
Therefore, solutions of Type C can exist only when X has a root which is a 
branch point of f(x) associated with several elementary divisors. 

The branch point of the function æ/" is typical of any branch point of 
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order m— 1. Therefore, the branch point of this function will be investigated 
instead of the various branch points of the more general function. Let X[h] 
be the matrix 








hls + ds - > 0 Oo: Bee 0 | 
0 > + ki tds 0 siti 0 
AS oy Es dr Eddie g 
ee | 
0 eee 0 0 OERE 
where J, is the identity matrix of r rows, s =r + 1, and Jr is the matrix of 
r rows with all elements zero except for 1’s in the diagonal above the main one. 


Let Y[h] have the form 


whl, Is +++ 0 0 

0 wb, >> 0 0 -:- 0 
Y[h] = 0 -wbl, E, --- 0 

0 0 +++ 0 wb 0 

E, 0 see 0 0 ns ris wb, 














where w is a primitive m-th root of unity, b is an m-th root of h, and F, and 
E, are matrices of the form 





fay 00 01000 
0 1 0 0 eee 

ae ee | BI an 
ne lo oo 0% 
[ooo 0 


By a direct expansion it is possible to verify that the m-th power of Y[h] 
is X[h]. The blocks along the diagonal come out very simply, and the 
coefficients of the other blocks are sums of powers of w which are zero. Since 
YTh] is a solution of the equation Y" = X it is of Type B for all values of 
h different from zero. Moreover, the equation connecting XY and F will be 
valid in the limit. The rank of Y[0] is n— m, but the rank of Y[0] is 
n— 1. Therefore ¥[0] is an m-th root of Type C. Thus, by changing the 
equation slightly, a solution of Type C was obtained as a limit of solutions 
of Type B. 

X[h] is a matrix in D. So, for each h, there are continua of matrices 
which satisfy the equation. One of them was selected and called [hk]. Note 


` 


% 
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that Y[}] is not in D. The limit of Y[h] is in 4. In taking this limit 


two things vary; both the continuum from which Y[h] was selected and the 
relative position of Y[h] in the continuum. 


THEOREM 4.2. If À is a branch point of f(x) of order m— 1, f(A) 
finite, and if X has m elementary divisors associated with À (k of them of 
degree r +- 1 and the rest of degree r), then there exists a value of f(X) in 
which f(A) ts associated with an elementary divisor of degree .mr + k. 


The ibose theorem states that some of the solutions of Type C can be 
obtained as values of the matrix function f(X). But the only solutions so 
obtained are those which utilize the complete symmetrÿ of all the values of 
f() which merge at the branch point. Hence the above process can lead 
only to primitive solutions. However, the non-primitive solutions are primi- 
tive solutions of a lower degree equation. Therefore, if the above ‘process 
yields all primitive solutions, it can be used to get all solutions. í 

Recall that X is a function, namely a polynomial, of f(X). From this 
view-point, Theorem 2.-5 states that all primitive solutions will have the form 
specified in the hypothesis of Theorem 5.1. Each continuum of values of 
f(X[h]) leads to a continuum of values of Type C for f(X[0]). The various 
primitive solutions of Type C differ only in the values of f(A) associated with 
the elementary divisors. But one can achieve this same result by properly 
choosing the continuum from which Y[h] is chosen. Therefore, any, solution 
of Type C is a value of f(X) as defined in Section two. 


Tusonm 4.3. The function f(X) ts the same as the function defined 
as the POENE solutions of CE. 1). 
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LINEAR DIFFERENTIAL INVARIANCE UNDER AN OPERATOR 
RELATED TO THE LAPLACE TRANSFORMATION * : 


By Earr D. RAINVILLE. 


1. Introduction. The Laplace integral transformation ? 


(1) BR) = f RO=), 


is one which associates with each function F(t) of sufficient regularity another 
function f(s). Elementary known ® properties of the operator ¥ include 





(2) gl a | — s"f (8) -3 — (i). 
and 

| 2 . dE ie 
(3) Sf {RF (t)} = (—1)* a f(s). 


The Laplace transformation has important applications * to the solution 
of boundary value problems in ordinary and partial linear differential equa- 
tions. The operator £ often transforms one differential equation into another 
which is more readily solved, one which, indeed, may even be algebraic. The 
transformed equation may be of higher order, or otherwise more complicated, 
than the original. Finally, we see that many equations do not change form 
in any essential way when subjected to the operator £. One such equation is 


OF | pe 





* Received August 15, 1939. 

1 Presented to the Society Nov. 26, 1938 under a slightly different title. 

3 For an extensive treatment of this transformation see G. Doetsch, Theorie und 
Anwendung der Laplace-Transformatton, Berlin, 1937. 

3 Enzo Levi has shown that the case n = 1 of equation (2) above, together with 
certain conditions on F(t) and its transform is sufficient to characterize the operator 
completely. For the precise result see his paper, “ Proprieta caratteristiche della tras- 
formazione di Laplace,” Rend. Aocad. Lincei, (6), vol. 24 (1936), pp. 422-426. 

t See, for example, R. V. Churchill, “ The solution of linear boundary value prob- 
lems in physics by means of the Laplace transformation”: I, Mathematisohe Annalen, 
vol. 114 (1937), pp. 691-613; II, Mathematische Annalen, vol. 115 (1938), pp. 720-739. 
See also his paper, “On the problem of temperatures in a non-homogeneous bar with 
discontinuous initial temperatures,” American Journal of Mathematics, vol. 61 (1939), 
pp. 651-664, in which the Laplace transformation is used to establish a uniqueness 
theorem. 
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for which the transformed equation is 


(5) DE + fms + (TE) 


i=0 


as may be seen from (2) and (3). 

Our fundamental problem is suggested by the fact that (4) is essentially 
invariant under £. In order to use only those properties of £ which are 
concerned in that invariance we introduce another operator o and study o 
instead of &. 


DEFINITION 1. Let D == d/dx be the usual symbol for differentiation 
with respect to v; let D° = 1. Then any polynomial in D and + will be called 
a linear differential operator of type P. 


DEFINITION 2. Let k,n, ka na; s—1,2,: : -, be non-negative integers. 
We define ° o as a linear operator on linear differential operators of type P by 
(6) ot D* = (— 1)*D*ar, 
and | 
(7) | of £ aD] = $ uo (at D), 
a 8 


where the a, are any constants. 

It should be noted that by D*2" we mean that differential operator which, 
acting upon a function F, yields the k-th derivative of the product 2” F. 

It is of some value to keep in mind one aspect of the nature of o. Con- 
sider a given function f(x), taken to be single valued for the present purpose. 
We may associate with f(x) an operator f which transforms each number z 
of a certain set of numbers into another number f(x) in another, or the same, 
set of numbers. We call f an operator of class one. Next consider D. The 
operator D transforms each function of a certain set of functions into another 
function of v. Further, if D operate on numbers, the result is trivial; i. e., 
D transforms every number into the same number, zero. Hence, we call D 
an operator of class two, noting that in.a sense D must operate on operators 
of class one to give non-trivial results. Now consider e. This operator is 
defined above in such a way that it transforms each linear differential operator 
into another linear differential operator. Essentially o needs to operate on 
operators of class two to give non-trivial results. We call o an operator of 
class three. 

An adjoint operator may be defined such that it changes a linear dif- 


5 Essentially this definition is to be found in 8. Pincherle and U. Amaldi, Opera- 
gioni distributive, Bologna, 1901, p. 361. 
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ferential operator, not necessarily of type P, into its adjoint linear differential 
operator. This adjoint operator ® is of class three in the above sense. 


2. Results. Some useful, not all new, properties of o are obtained. 
Two linear bases are found for the set of linear differential operators invariant 
under o. Two invariant second order differential operators are found to form 
a fundamental system of invariant operators; i. e., any invariant operator may 
be expressed as a polynomial in these two operators. A linear basis is ex- 
hibited for what are called ¢-variants (Definition 3) with respect to o. 

Linear operational equations in o are completely solved in the case of 
constant coefficients. This is done with the aid of two theorems on the repre- 
sentation of linear differential operators in terms of ¢-variants or of invariants 
and pseudo-invariants. The same tools are useful in the solution of linear 
operational equations in o with variable (linear differential operational) 
coefficients, as is demonstrated in the example worked out in Section 10. 

In Section 9 certain results are specialized to yield a classification of the 
differential equations, such as (4) above, invariant under øo. 


3. Preliminary definitions and lemmas. Since the only linear dif- 
ferential operators to enter this study are of type P, we shall often hereafter 
omit mention of this restriction. 


DEFINITION 8. If y is a linear differential operator such that oy = ty 
where tt — 1, then y will be called a linear differential t-variant with respect 
to o. A 1-variant will be referred to on occasion as an invariant and a 
(—1)-variant may be called a pseudo-invariant. 


DEFINITION 4. The degree and the order of a linear differential operator 
are respectively the highest power of the independent variable and the order 
of the highest ordered derivative appearing explicitly in the operator. 


Lemma 1. If y is a linear differential operator, then the degree of 
oy = the order of y and the order of cy — the degree of y. 


This lemma is an immediate consequence of the definition of o. The 
application of Lemma 1 leads to 


SE. D. Rainville, “ Adjoints of linear differential operators,” American Afathe- 
matical Monthly, vol. 46 (1939), pp. 623-627. For relations between o and the adjoint 
operator, see L. Schlesinger, Handbuch der Linearen Differentialgleichungen, Leipzig, 
1895, vol. 1, p. 426 and E. D. Rainville, “A discrete group arising in the study of 
differential operators,” as yet unpublished. 
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LEMMA ©. For a linear differential operator to be t-variant with respect 
to o tt is necessary that its degree equal its order. i 


Lemma 3. The linear differential operators A, = D? + 2? and A, == 2° D? 
+ 2xD, are invariant with respect to o. 

We prove Lemma 3 by direct evaluation of cA, and cz. 

oA, = 2° + D? A. 
cå: = Dr? — 2Dr = z? D? + 4D + 2 — 2aD — 2 = Ap. 
THEOREM 1. If A and B are linear differential operators, then 
o(AB) = (cA) (oB). 
Let v == 2*D*, then 
o (Tv) =e o (at D") = (— 1) Drg = — Do(zt Dr), 

so that we have 


(8) o(zv) = (ox) (ov). 
Next, 


o o(Dv)= o ($D 4 k D") == (—1) Dre 4 (—1) kD Eg 
= (—1)*eDter + (— 1)*k Dig” 4 (— 1) kD Ea" = (—1)*x DEN = (ov) 


Then: 

(9) ` o(Dv) = (cD) (ov). 

Theorem 1 follows directly from (8) and (9). Further, 
(10) o*(AB) = o[ (oA) (oB)] = (o*4) (0°B), 


and, for any integral k = 0, o*(AB) = (o*A) (o*B). 

LEMMA 4. If v= aD", then ow = (— 1). 

By (10) above 
oy = (072) (o° D”) = [o(—1)*D*] [or] = (— 1) *a*(— 1) "D* = (— 1) 9, 
Lemma 4 itself leads at once to? 


THEOREM 2. If y ts a linear differential operator, then oty = y. 


4. First classification of invariants. Direct application of Theorem 1 
and Lemma 3 yields 


THEOREM 3. Any linear combination of terms of the type 


Ay™ ga À," - = + Am A g™ 


‘Theorem 2 appears in Pincherle and Amaldi, loc. oit., p. 357. 
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in which mı; i= 1,2,- <- ,$, are non-negative integers, is a linear differential 
invariant with respect to o. 


Next we obtain a simple necessary condition for invariance under o. 


LEMMA 5. A necessary and sufficient condition that a linear differential 
operator be invariant under o? is that, for each term a,x*D" of the operator, 
k =n mod 2. 


This follows at once from Lemma 4. Noting that invariance under o 
implies invariance under o*, we have a necessary condition for the former. 


THEOREM 4. For each term dynt*D* of a linear differential invariant 
with respect to o it is true that k =n mod 2. f 


DEFINITION 5. The leading term of a linear differential operator is the 
non-vanishing term of highest degree among those terms of highest order in 
the operator. | 

It will prove useful to note that, since the order of the operator is the 
order of its leading term, we have 


Lemma 6. In a linear differential t-variant the degree of the leading 
term does not exceed its order. 


DEFINITION 6. By linear differential invariants of type H we mean the 
set of invariant operators — | 
(11) - AA}; OSkSn, 
and 


1 ; 
(12) 4(n—k) (41) [AFA — AFA] ; 0 < k <n. 


LEMMA 7. The leading term of A" *4¥; 0 S k Sn, is ED. 
This follows at once from the definitions of A, and Ao. 


Lemma 8. The leading term of 
1 
ten n-k A &+1 kt n-k]. 
E(n—k) EFI AA e 
4S gr Pent 


In the proof of Lemma 8 we shall use the convention that 
y — aD? + apr DE 4- 


means that y is a linear differential operator with leading term asxD° and 
that the leading term of (y — as? D°) is ayez? D7. 
With the above convention note that the formula 
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(13) Ak me EDR 4. Qh2gthipwi4.... 


holds for k = 1 and in a trivial sense for k—0. Assume (13) to hold for 
some k. Then 


A+ —_ gtk+? J) 2k+2 + (4k + Qh? + 2) piel pien -+ e. 
== gtk+? [)2k+2 + a(k + 1) 2244244 + tees F: 


and it follows by induction that (13) is true for any k = 0. Now 
(14) AFA m FED? 1 2h (2n — hk) cI Dmi 4... - l 


holds for n = k Z 0. Assume (14) to hold for some pair of numbers n, k. 
Then 
(18) Ayr ett 4 ok = g’k])inr2 + [4k + 2k (2n — k) |z- pm + Si 8 

== 2k J) 2n+2 + 2k[2(n + 1) — k] 1p? + A 


so that by induction (14) holds for any n Z k Z 0. 
In view of (18) the application of A,* to A,"** is seen to yield 


(16) ARAT == EDM 4 krapo 1... 


for n= k= 0. Combining (15) and (16) with k replaced by (k + 1), we 
have as the leading term of [4,**A,***— AFA" +]; 0S k <n, the ex- 
pression 4(k + 1) (n — k)r™! D?™, 80 that Lemma 8 is established. 


THEOREM 5. A necessary and sufficient condilion that there exist a linear 
differential invariant with respect to o with leading term zD" is that either 


è = e= 0 mod À, osise 
or 
S=ce=_lmod2, 1S8<e i 


Lemmas 7 and 8 exhibit linear differential invariants for each leading 
term indicated in Theorem 5. We proceed to show that no linear differential 
invariant can exist with leading term not proportional to one of those indicated 
in Theorem 5. By Theorem 4 we must have == e mod 2. By Lemma 6 we 
must have Se. We have left the one case 5 e= 2h -4-1 and we con- 
sider that now. If a linear differential operator y had for its leading term 
ga pmi, then oy would have for its leading term (— r™D™:), and y could 
not be invariant. This concludes the proof of Theorem 5. We proceed to 
the main result of this section. 


.Taronex 6. Any linear differential invariant with respect to o is a 
linear combination of linear differential invariants of type H. 
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Stated in another way, we show that the invariants of typé H form a 
linear (infinite) basis for the algebra whose elements are the invariants with 
respect to o. 

Let y be any linear differential invariant with leading term ax D°, where, 
of course, § and e are subject to the restrictions of Theorem 5. By Lemmas ? 
and 8 there exists a linear differential invariant of type H with leading term 
D1. Hence we see that there exists a linear combination of y and a linear 
differential invariant of type H (with coefficient of y not zero) which is in- 
variant under o and is such that its leading term is either (a) of lower order 
than the leading term of y, or (b) of the same order and of lower degree than 
the leading term of y. Repetition of this argument shows that there exists 
an identically vanishing linear combination (with coefficient of y not zero) 
of y and linear differential invariants of type H. Thus Theorem 6 is 
established. 

Next we note that, since no two of the linear differential invariants of 
type H have proportional leading terms, it follows that the linear differential 
invariants of type H are linearly independent. 

The preceding work, particularly Theorem 6, shows that A, and A. form 
a fundamental system of invariant differential operators in the sense of 


THEOREM Y. Any linear differential operator invariant with respect to o 
may be expressed as a polynomial in Ay and Az. 


Of course, Theorem 3 has already stated that any polynomial in A, and 
A, is invariant under ø. Since A, is not commutative with A», the word 
polynomial is used here in the sense of linear combinations of operators of the 
type exhibited in Theorem 3. 


5. Second classification of invariants. 
LEMMA 9. A necessary and sufficient condtlion for 
Ten = OD" + o (a*D*) ; 0Sk,n, 
to be a linear differential invariant with respect to o is that k == n mod 2. 


Noting that 
olki = o (2 D") + o° (a*D*), 


and recalling Lemma 4 we see that Lemma 9 follows at once. 


DEFINITION 7. By linear differential invariants of type J we mean the 
set of invariant operators 
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Len; k=n=0mod?, 0Sk =n, 
and | 
Ikn; k=ne=lmod2, 1Z<k<n. 

Note that in the set of linear differential invariants of type J whenever 
kn the leading term of Ign is 2*D*; if k == n, then k and n are even and 
the leading term of In, is 2s" D*. Hence it is evident that the linear differential 
- invariants of type J are linearly independent. 

Since all leading terms permitted by Theorem 5 are included in sine J, 
we may follow the line of reasoning used to prove Theorem 6 and thus 
demonstrate 


THEOREM 8. Any linear differential operator invariant with respect to o 
may be expressed linearly in terms of linear differential invariants of type J. 


See also the remark directly below Theorem 6. 


6. A classification of t-variants. We shall briefly indicate a classifica- 
tion of {-variants similar to the above second classification of invariants. This 
done, we may consider t-variants completely specified and may proceed to two 
representation theorems with the aid of which we solve linear operational 
equations in o. 

From Lemma 4 of Section 3 we get 


Lemma 10. A necessary and suficient condition that a linear diferential 
operator be pseudo-invariant with respect to o? is that, for each term asna D” 
of the operator, k==n-+ 1 mod 2. 


Let i= V—1. If ét or if t=", then &——1 and any corre- 
_ sponding linear differential t-variant with respect to o is pseudo-invariant with 
respect to o”. If £ = — 1, then we have actual invariance with respect to o”. 
Hence Lemmas 5 and 10 lead to ~ 

THeoREx 9. For each term aux D" of a linear differential t-variant with 
respect to o tt is true that k =n + 4(1— t) mod 2. 

Lessa 11. A necessary and sufficient condition that 


| LO =D" + Bo(aD*), t= 1, 
- be a t-vartant with respect to o ts that k = n + $(1— t) mod 2. 
Since 

OL = o(æ D") + t9o?(akD*) 


tI = taD" + o(a*D"), 


and 


a necessary and sufficient-condition for the equality of ol (H and é7(9 is that 


4 
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a’ (2*D") = 2*D*. By Lemma 4 this is equivalent to t? = (—1)** or to 
k=n + 4(1—?#*) mod 2. 

Considerations similar to those used in we proof of Theorem 5 yield a 
proof (omitted here) of 


THEOREM 10. A necessary and sufficient condition that there exist a 
linear differential t-variant with respect to o with leading term èD" is that 
either 


S=. + 4(1— P) =0 md? 058 <ce4+F(14 #) (142), 
s=. 441 — t) =1 md? 1<8<et+H(1+#)(1—+). 


or 


DEFINITION 8. By linear differential t-variants of type J we mean the 
set of t-variant operators 


I); k=n+4(1—t)=0mod2, 0Sk<n+H(1+H)(1+1), 
and 
ITO; kæ=n+i(i—é#)=imod?, 1Sk<n+4(1+4+%)(1—%). 


It can be seen that the linear differential ¢-variants of type J are linearly 
independent. Reasoning parallel to that used to prove Theorem 6 will 
demonstrate 


THEOREM 11. Any linear differential t-vartant with respect to o may be 
expressed linearly in terms of linear differential t-variants of. type J. 


See also the remark directly below Theorem 6. 


7. Representation theorems. We shall prove the following two theorems 
on the representation of linear differential operators of type P. 


THEOREM 12. Any linear differential operator of type P may be repre- 
sented in one, and only one, way in the form 


(17) I+P+Q+W 


where I ts an invariant, P a pseudo-invariant, Q an i-variant, and W an 
#-vartant. 


THEOREM 13. Any linear differential operator of type P may be repre- 
sented in one, and only one, way in the form 


(18) ht Pollet Pe) DU + Po 


where IIs, Ia are invariants and P,, Poe, Ps are pseudo-invariants. 
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In order to picture more clearly the relation between Theorems 12 and 
13, let us consider the operator 22°D. We may write 


20D == (— ir D? + £D — AD) + (iD? + rD + D), 


where Q == — tD? + °D — AD and W = izD? + xD + iD are respec- 
tively an i-variant and an <*-variant. Here, though the operator ®x°D is real, 
the representation (17) introduces the imaginary unit. We may, on the other 
hand, write 

2a*D = 2[(—1) + (22D + 1)], 


where I, == -—1 and P,»-22D-+1 are respectively an invariant and a 
pseudo-invariant. Hence, using (18) the representation of 227D is “ real.” 
The representations (17) and (18) play roles corresponding to the two solu- 
tions F == 4,645 + a,e-** and F = c, cos w + casin g of the differential equa- 
tion (D? + 1)F =0. 

The example in Section 10 illustrates the fact that ( 18 ) may on occasion 
have considerable advantage over the apparently simpler and more natural 
representation (17). | 

The representation (17) is essentially a result of the fact that o satisfies 
the operational equation o*t == Æ, the identity. 

Proof of Theorem 12. First we give an explicit expression for any term 


a*D* of a linear differential operator in the manner desired. Let k,n be non- 
negative integers. Then, using the notation of Lemma 11, we have 


(19) AD =i (D) TE +19] + 4 — (— 1 + 19]. 


Because of the linearity of the operators uniqueness of (17) will follow 
if we show that 
(20) = I4P4Q+W=0, 


with the notation of Theorem 12, implies I = P == 9 = = W= 0. 
If we operate on (20) with o* we find 


(21) I PQR. 


From (20) and (21) we have Z + P= 0. Operating on this with o, we get 
I— P = 0. Hence I = P = 0. But (20) and (21) also lead to Q + W = 0, 
from which it follows that iQ —iW = 0. Hence Q == W = 0 and the proof 
of Theorem 12 is complete. 


i 
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12. Then there exists one, and only one, ie differential operator which 
satisfies the A ci y : | 


(81) — go"y + moy + moy + ay =1+P+Q+W, 
namely i i ; 
(82) | Fae ewe eu 





‘That (32) is a solution of (31) may be seen by direct substitution. If 
there were two distinct solutions of (31), the non-vanishing difference of those 
solutions would satisfy (28), the homogeneous equation. In view of the in- 
equality of Theorem 15 and the necessary condition in ‘Theorem 14, this is 
impossible, 

Equation (32) is readily altered 7 fit the case where the homogeneous 
equâtion also has a solution. Let us suppose, for example, that in (31) we 
find A; = A, = 0, Aâ 540. Then there exists no solution of (31) unless 
P—0and W == 0. If these conditions are satisfied, the general solution of 
(31) is ; 


IZ th HR +Y, 


where P, is any pseudo-invariant, W, any t’-variant and ihe other symbols are 
as in Theorem 15. There is here a noticeable resemblance to the general 
solution of a non-homogeneous.ordinary linear differential equation. 

If in Theorem 15 we use the representation-of Theorem 13, instead of 
that of Theorem 12, we need only to ‘replace (31) by 


(31) azoty + a0°y + moy + doy = I + Pa + (Ia + Pa) + D(L + Ps), 


and (32) by - 
O P, ads — 


(32’) f TER Na, Ake & D, Pa — al, — P3] 


= a Le( + Pa) + D(Is+ Ps]. 


9. Linear differential E R invariant under o. We have inci- 
dentally solved the problem of determining what linear differential equations, 
are invariant under o. Theorem 14 and the remarks following it yield at once 


THEOREM 16. i Let y be a linear diferential operator of type P and.let 
F be an undetermined function of z.: Then a necessary and suficient condition 
that yF = 0 be invariant under o is that y be a t-variani with respect to a. 
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This is an interpretation of the fact that for oy = cy to have a solution 
for constant c it is necessary and sufficient that c* = 1. 

-By means of the representation (18) of Theorem 13 we are able to state 
this result in another form sometimes more useful. | 


THzorem 17. Lei y be a linear differential operator of type P and let 
F be an undetermined function of z. Then a necessary and sufficient condition 
that yF = 0 be invariant under: a te that y be expressible in one of the four 
forms (83) (36) ; 


(33) y=, 
(34) : y = Pi, 
(35) © y= e(l + Pr) +iD(I;— Pi), 
(36) ` y = z(I: + Pr) — +D (I, — Ps), 


‘where Iu, I, are invariants, Pı, Pa are pseudo-invarianis, and i= V— 1. 


10. Linear operational equations in a: variable coefficients. We may i 
' generalize equation (31) to the case where the -coefficients are themselves 
linear differential operators. We consider 


(37) ag (oy) Bs + aa (oy) ba + a (oy) bs + doybo—= A, 


where the a, b4; i — 0, 1,2,3, and A are known linear differential operators 
and y is to be determined. We shall illustrate our two methods of attack on 
(37) by means of a numerical example. Consider 


(38) roy — Dy = 0. 


If we use the representation or +P ma Q + wW as inated in 
Theorem 12, we find that 


(89) 2(I—P) + i2(Q— M — D+ P)— D(Q +W) =0. 


Operating on (39) with o, o°, o%; and combining the resulting equations, we 
readily obtain { == 0, P == 0, and 


(40) © (D—iz)Q + (D +i) W — 0. 


Further, if we substitute y == Q + W into (38) we get (40). Hence, the 
general solution of (38) is y = Q + W where Q and W are respectively any 
i-variant.and any #°-variant subject to the restriction (40). 

* Let us now attack (88) with the representation made available by Theorem 
13. In this case the solution appears in a more satisfactory form. Let 
y= L + Pı + (la + Pa) +P + Ps). - Then from 42) we get 
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(41) (e—D)I,—(e+D)P, 
i — (22D + 1) — P: + r° (I — Pa) — D? (I; + Ps) = 0. 
Using o on (41) we arrive at 
(42) — (z+ D) — (D —1)P, 
+ (21D + 1)Ze + Pa + D?(I3 + Ps) — zr’ (13 — Ps) = 0. 
Equations (41) and (42) combine to yield 2D (I, + Pı) = 0, hence I, = P, 
— 0. We return to (41) which has become 
(43) (22D + 1) + Pa— z? (Is — Ps) + D? (I + Ps) = 0. 
Substituting for P, from (43) into the assumed expression for y we get 
y = — DI, — (1D? — D — t?) I, — (1D? — D + zè) Pa. 
Since J, and J, may be any invariants and P, any pseudo-invariant, we shall 
write i 
(44) y = @ DI, + (D? — D — r?) Is + (1D? — D + r) Ps. 
Then 
roy = z( D°x) I, + «(— Dr? — x + D*)I,;— 2 (— Dr? — q — D) P, 
= (1D? + 2D) + 2(D? — r° D — 3x)]s + «(D? + rD + 3r) Ps, 
and A 
Dy = (£D? + 2xD)1, + (1D? — D — 32°) Is + (1D + D + 3r) Py. 


Thus we have the result: the general solution of (38) is (44), where J, and 
I, are any invariants and P, is any pseudo-invariant with respect to o. In 
this case the representation in Theorem 13 is seen to have a considerable 
advantage over that in Theorem 12. 
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ON THE MINIMUM NUMBER OF POLYGONS IN AN 
IRREDUCIBLE MAP.* 


By C. E. Winx. 


. In a recent paper Franklin‘ proved the number of polygons? in an 
irreducible map M to be at least 32. It is proposed here to shew with the help 
of certain new reductions that the number is at least 36. 

Our main object is to set an upper limit on the number of pantagons 
touching a ‘given polygon of = When the contacts are consecutive, we use 
the fact that 


A. A polygon of 5, 6, Y or n> sides is reducible when in contact 
respectively with 3, 3, 4 or n—2 adjacent pentagons? 


_ When a pentagon has separate contacts with other pentagons, we note its 
reducibility * if it touches the chain 5665. And, combining with A, we find 


B. A pentagon is reducible when in contact with 4 minor polygons of 
which the extremes are pentagons. 


Using the fact that 


C. A hexagon in contact with the chain 5565 or 66665 is reducible,” 
we shall prove a.result analogous to B, namely 


D. A hexagon is reducible when in contact with 5 minor polygons of 
which the extremes are pentagons. 


Tbis still allows the possibility of a hexagon of M touching two separate 
pairs of pentagons and two major polygons. But in this case we observe that 


E. A hexagon touching two separate pairs of pentagons is reducible when 
both patrs are in triad with another pentagon.® 


* Received June 24, 1938. 

1“ Note on the four color problem,” Journal of Mathematics and Physics, vol. 16 
(1938), p. 172 (published at Mass. Inst. of Technology). 

*Tn an irreducible map overy region is either a minor polygon of 5 or 6 sides, or a 
major polygon of more than 6 sides. 

3 The only recent case is the third, given by the author, “ On certain reductions in 
the four color problem,” Journal of Mathematics and Physics, vol. 16 (1938), p. 159. 

#C. E. Winn, “ A case of coloration in the four color problem,” American Journal 
of Mathematics, vol. 49 (1937), p. 515. 

5 Loo. cit.*. Unfortunately the claim in footnote 17 turns out to be unfounded. 
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As regards separate contacts with a heptagon, it is known that 


F. A heptagon touching 4 pentagons and 3 hexagons in any order is 
reducible. 


We supplement this result by proving that 
G. A heptagon in contact with the chain 55655 is reducible. 


The details of the new reductions appear at the end of the paper, as 
well as those of a few configurations not employed here. The latter are as 
follows: 


A pair of pentagons in triad with a heptagon and touching no other 
major polygon. 
A pair of hexagons in contact with 55655 or 556655. 


The next configuration is obtained by introducing into the ring-of Errera ° 
the triad 575 occurring in a recent reduction of Franklin, an odd number of 
hexagons being allowed. 


Any ring formed of pairs 57 and, optionally, pairs 55 and hexagons in 
any order, the pairs 57 being oriented in one direction and each in triad with 
a pentagon that touches no other polygon of the ring. 


As in Errera’s case an isthmus in the reduced figure implies a Birkhoff 
ring in the original map when the ring encloses a single polygon, a pair or a 
triad—otherwise it may invalidate the result. Simple instances of unre- 
stricted reducibility are those of 5(5)7666 about a pentagon and 5(5)765(5)76 
about a hexagon, the digit in brackets denoting the pentagonal ‘cap.’ The 
final configuration is a modification of this type, namely 


The ring 5(5)7665 about a pentagon. 


If as be the number of pentagons A, in M, and jon be the number of their 
contacts with higher polygons An, we shall have 


(1) jos + 2 D fon == Sas. 
nT 


For the contribution of A, to the left member (which cannot exceed 10) 
is at least 4 when it touches one or no other pentagon. Also in view of the 
reductions A, B and the fact that an irreducible pentagon touches at least 


e“ Une contribution au problème des quatre couleurs,” Bulletin de la Société 
mathématique de France, vol. 53 (1925), p. 42. 
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one major polygon,* the possible contacts of As with more than one pentagon 
are BONSN, binnN, 55nNn and 5nN ön, where n = 6 and N = 1, as hereafter. 
So in general the contribution is seen to be at least 4. 

Let us now denote by 4,‘ an À, contributing 4 + r to the left of (1). 
Then as") being the number of such pentagons, we get 


$ 8 
(2) jse + 2 D fon = 405 + X ras”. 
5 f “net ral 


4 


Further, let An“) be an A, touching r pentagons. Their number being 
än”), it follows from the last case of A that 


| n-3 
(3) ; jon = X, Tar”. 


Tal 


The combination of (2) and (3) leads to 


È Th +2 = 5 Taa) = As + bY ras", 


nT r= 
whence, seeing that an = + an, we get 
6 i 
2 J, (8n—17)an + ag + Ra) = 4as + E ras” ! 
nee r=1 


HE E (3n—r— 17m, 


net r=1 
Consequently, if we shew | 
6 — 
(4) de® + 2ag + 2a," <= S ras) + 2 > 5 (8n —¢r—17)a,", 
r= nT rl 
the negative term, given by n — 7, r= 5, being omitted from the double sum, 
it will follow that 
D (8n —17) an = Ras. 
nae 


Then we shall obtain by Euler’s relation, as required, 
(5) ds + ZE An = 8a; — 3 À (n — 6) an == 36. 
: n=6 427 


It may be remarked incidentally that, if no two pentagons of an irre- 
ducible map are adjacent, then at least 18 pentagons touch 3 or 4 hexagons.’ 

In fact, denoting the number of pentagons required by a,’ and a,” 
respectively, we have, since the contracts jsn are separate, 


“Cp. Reynolds, “On the problem of coloring maps in four colors,” Annals = 
Mathematics, vol. 28 (1926), p. I. 
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ZE [ar ]an = E jon = 305 — ay’ — Las” 
n=7 
= 36 + 3 > (n a) 6)an seme as TT, Ras”, 
n28 
whence 
ds’ + 2a,” = 36. 


To establish (4), we shall set against A,‘*), Ag") respectively one or two 
polygons At"), A,“ adjacent to them as compensating elements which con- 
tribute to the right-hand side; and against 4:(%) one or two such elements 
As"), Tt will then be necessary to verify that the number of sources of a given 
element, after reckoning twice the source A;‘*) yielding a single element, is 
at most equal to the corresponding coefficient on the right of (4). 

From C and D we deduce that a hexagon of W that makes separate 
contacts with 3 pentagons must touch at least two major polygons. Thus 
A,‘®) is bounded by either 5n5N5N, 55N5Nn or 5N5nN5. In the former two 
cases we take as our element the last pentagon a adjacent to Ag‘), which is 
bounded by 6NmmN, where m = 56, as hereafter. 

In the last case let bede be the last four polygons about 4,(%), and let 
f be the outside polygon touching de. Then, if c > 6, we choose the element 
b(6NmmN). If c= 6 and f= 5 or N, we choose e, which, on account of A, 
is bounded by 65N5N or 65mNN respectively. Finally, if c= f= 6, our 
element is d(66- - -65). If d is an A, we infer from F that r < 3, so that 
A, does not appear as an element. 

The contacts of A, are 55N55N in view of A and C. Moreover, we . 
conclude from Æ that one of these pairs of pentagons g, g’ are not in triad 
with a third pentagon nor, by A, with another hexagon, when g, g’ are in 
chain with a third pentagon. We here select two elements, namely g, g (65NmN) 
or (656n N). 

On account of A and G the ring round A,‘ is 555N55N. If the fourth 
(or last) polygon is also an A,“*), we take the two pentagons h, h’ touching 
both A,™’s, These are both A,‘?)’s, their contracts being 75N57 by A. But, 
if there is no adjacent A,‘*), we take the extreme pentagon i of the first three 
which, in virtue of B, touches an outside major polygon. We have then a 
single element bounded by 75NmWN’, where N° is not an A,(5). 

In each of the above rings about an element we have placed first its 
source. Consequently, a polygon with such contacts may occur as an element 
as often as one of its adjacent polygons fits into the first place of the ring 
(allowing for a reversal). We have thus to examine the possible occurrences 
of Ay‘), where J = r < 5, and of A, where I Sr S n— 2 or 3, according 
as n is greater than or equal to 7. 


12 
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The incidence of A,‘ is at 
a, b(6N55N); e,g,g'(GENEN); 6(655NN); g,g'(6566N). 


There is no repetition here, since the two adjacent hexagons touching 
g or g’ cannot come first in any of the four rings. ` 
The incidence of A; is at 


a, b(6N56N); ¢,9,9'(G56NN); h,h’(Y5N57) ; i(75N6N’). 


We observe that this element occurs twice at most in the first two rings, 
which contain only two hexagons; also these rings are distinct from the last 
two. Now, by supposition, the last polygon of the third ring is an A,“, 
whereas the last in the fourth ring is not. Hence an element A,‘) can only 
appear twice in the third ring and once in the last, but not in both. This 
yields altogether a maximum of 2 occurrences, counting that at + twice. 

The incidence of A, is at- 


a,b(6N5NN) or (6N66N); e(65NNN); i(Y5N6N’). 


This element occurs only once in the last ring, as the third polygon, being 
next to a hexagon, is not an A,‘), It can then fall but once elsewhere, 
namely in the first ring. Thus the maximum amounts to 3, seeing that none 
` of the first three rings contain more than 3 hexagons. 

The incidence of À,(%). is at 4 

a,b(6NN6N); i(75NNN’). 

The N between N and N’ not being an A,‘*), we infer that this element 
can only fall twice in the last ring, and not then in the first. Hence, as the 
first ring contains but two hexagons, the maximum here is 4. 

Lastly, and 4,(%) is only to be found once, at a, b(6NNNN), while A,‘ 
does not occur at all. So altogether the number of pentagons compensating 
Ag", Ag, and A, is not in excess of the first sum on the right of (4). 

The number of occurrences of An‘) at d(66 : - -65) cannot exceed n—r, 
i.e. the number of hexagons touching d, which is at most equal to the coeffi- 
cient of an") in (4), unless n = 7, r = 2 or 8. Moreover, in the last two cases 
the largest number of hexagons coming third in the sequence 6566 (or second 
in 6656) is found by inspection to be 4 or 2 respectively, i.e. not more than 
the coefficient of a“. This concludes the demonstration of (4), and so 
of (5). 

We now reduce ® the cases of D not contained in A or C, namely a hexagon 
touching 56565 and 56665. : 


* The scheme of reduction is that explained in Loc. cit. “. To accommodate a trans 
formation in one line a comma is sometimes used to mean ‘or,’ when it could not mean 
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56565. See Fig. 1. 





(1) d, f, h=2; f=3 or h = 2, unless dfh = 332, 342: u=1 












































(2) dfh = 243 
12 e to gori f=3, g =3,4 u=] e= 
32 k toa ore i= 4, d= 1,4 u= h = 
34 b to f o= us] b—4 
13 g tot h= u=l 
13 g tod i= ud g = 
32 e to a b=1, d= u—4 e=3; 0,g = 2,3 
u = 3 
(3) dfh = 244 
43 f tob c—=2 wood f= 3, h=3,4 
(1) 
(4) dfh — 223 
13 g to e or o f=4, d=2,4 (2), (1) g=3 
34 g to b a=l,h=2 uss 
34 g toi ghi = 424 
(23 a to f or h i=], g= 4,1 u=] 
23 dtoforh a =3 u=2 abcd == 3243 * 
24 b to à a=1 ua 4 = 4 
12 etocorh d=4 or g=3 u=? e=? 
23 ctoaorh b=], i= 4,1 u= 2 o=3 
u = 4) g=4 
us] 
(5) djh — 224 
31g tod h=2 | u=] 
3lgtoeoro f=4, d=2,4 (2), (1) I= 
34 g to b a = uo g=4 
41 g tot h=2 us | 
41 g toe ore f=3, d=2,3 w=] g=1 
(4) 
(6) dfh = 323 
13 g toe f= (1) g=3 
34 i tog h=2 u=3 i= 4; b, d =3,4 
u=] 





‘and? Thus a,b == 2,3 means a= 2 or 3 and b—2 or 3. 





Also, if a chain ‘affects 


adjacent polygons, it suffices to note the change in one of them. It should be added 
that, unless otherwise pointed out, an isthmus in the reduction implies a Birkhoff ring 


in the original figure, as can be at once verified. 


°? The absence of a 2 3 chain from d to b allows a = 3, as just given. 
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(7) dfh = 324 
14 g to e or o f= 3, d=2,3 u—= 1] g=4 
12 k to f or a g ox i=3 u=] | 
12 À too . g=3,d=4 u=] h=2 
23 a to d or f e= 4, e= 1,4 u= 4 
23 h to d or f g =l, e= 4,1 u= a = h = 3 
42 à to g or b hora=1 u= 1,4 i1=2 
41 g toeore f=3, d=3,2 w=] g=1 
24 b to f c=3 u=2 
24 b toi f=4 uc] be 4 
43 B to d or h : o=2 or i=l u= 2,1 b =3 
32 b to f o=e=4 u=3 b= 2; d, h =3,2 
“= 2 
(8) dfh = 332 
23 h to f g=4 u=3 h = 2; d, b =2,3 
(1) or equivalent 
(9) dfh = 342 
41 e to o or é d=2 or h=3 u=] e= 4 
12 f tok oro g or e=3 we], f=2 
32 dtoaor khk o= 4, i=], 4 umd 
32 ftoaork. g=1,+=4,1 u= 3,1 d = 2, 
42 etogora forb=1 u=1,3 e= 
4lgtotord k or e=3 u =l g= 
13 gtoioro h or e=4 u= g=3 
21 h to f g=4 u= 1 
21 A too e=1 u=] h=] 
14 a to f g=2,i=3 u=2 a=4; c,h—1,4 
| ul] 
(10) dfh = 423 
14 4 to g ore h=2, f=2,3 (1) t=4 
24 f to d'or i e or g=3 u = 3,1 f=4 
43 f to b c—e—2 u = f=3; d,h=3,4 
č t= 3 
(11) dfh = 424 
14 g toe {=3 (1) g=4 
43 g to b a= l, h=2 u= 2 
48 gtoi ght = 323, d = 4,3 u== 3 g=3; d= 
u= 7] 
N60666. See Fig. 2. 
(1) j=3 unless dfh = 243 (342) us] 





t 
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(2) j=4 unless dfh — 222, 244 (332) 


























u=1 
(3) j= 4, dfh = 222 
12 i to a i=3 u=1 o=1 
42 j tod a == 3 u = 1 
42 j tof i—g—3, d=4 u=3 j=2; k =2,4 
u=? 
(4) j= 4, dfh = 244 
l4 c toe d= u=l omd 
43 j tobor h a=] or i=? u=4 
43 j tof j=f= ; 
(23 d to b o=] u= d= 3; f =3,2 
u=3) j= 
23 d to b o= u=l d=3 
42 o toa or h b=], j=3,1 u=l 
42 o to f o=f=2 
(21 o toaore b or d= =] 
21 otoi d= 4, k = w=] o=1 
u=1) o=2 
34 b tod o= u=] 
34 b to f d= 4 u=1 b=4; h,j=3,4 
u=1 or (8) 
G. NN650556. See Fig. 3. 
(1) d=3 or h=2 or f—4 #—= 1 
(2) dfh = 444 | 
v A 
43 d to b o= 2 u=4 d= 3; f, h= 4,3 
u=l1 
(3) dfh = 244 
13 $ to g, ec h = 2, etc. (1) +=3 
23 d to b -0 = 4 u= d= 
3l 4 to b or g a=4 or h=2 wel i=l 
(1) 
(4) dfh = 243 
| etocori d=380rh—2 (1) e=4 
34 b toh ore i or c—2 u= 2,1 b—4 
14 0 to e or à d or a—=3 u= 3,2 o= 
12 6 to d'or f o==3; e= 4,3 u=3 b= 
24 à to g or b h=1 or o=3 ` u=4 i=4 
$ u=4 | 





A pair of pentagons in triad ‘with a heptagon and bounded 
minor polygons. See Fig. 4. 


elsewhere by 
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The polygons g,t and f,j must be hexagons on account of A and B 
respectively, The case where h is also a hexagon has lately been reduced by 
Franklin: The remaining configuration, where À is a pentagon, can be 
colored immediately in the present reduced figure: 


(1) o— b= 2 ume 2, v=] 
(2) a= 2, b= 3 u= 2, v == 1, unless d= 3, c= 4; then u—1, v—? 
(8) a=b=3 u—3,v—1, unless d= 2, c= 4; then u= 1, v—2 


(4) a= 3, b=? u=? or 4, v= 1 
or 4, 


A pair of hexagons touching 55655 or 556655. See Figs. 5 and 6. 

We may suppose that a hexagon of the chain forms a triad with the given 
pair, as otherwise we get the reduction C. 

We can color both figures immediately by marking u, v, b with 1, 2, 3. 
Then either 3 can be used for c, or else we get a choice for 8 (similarly for 
d and f). . 

There is an obvious extension when u, v have 2k, 21 sides and touch 
k — 2, 1—-2 pairs of pentagons. 

The reduction of the modified Errera ring R together with its pentagonal 
caps is made by suppressing these except for alignments +° connecting the 
remaining free vertices of caps, pairs of pentagons, hexagons and heptagons 
(see Fig. 7). 

As to the coloration of Æ, we note that, just as in Errera’s case a hexagon 
or pair of pentagons can always be colored when one neighboring polygon of R 
is already marked. Moreover, if the polygon a following cb (67) with the 
cap d is already marked, we can always color bdc. For b is adjacent to two 
other colors, namely that next to the part of R including c and the other color 
bounding the unreduced alignment L of b. The marking of b leaves 3 colors 
next to d. Then likewise 3 next to c. | 

We now have two possibilities according as the unreduced polygon e 
abutting d bears the latter color bounding L or not. If so, we mark c with 
this color and fill in E going away from b, with a final choice for bd. But, 
failing this combination at any pair 57, all such pairs can be colored in the 
reverse direction when the polygon of R previous to the pentagon is already 
marked. Consequently we can then fill in R, starting from b with the color of 
6, passing through a and finishing with a choice for cd. 


1° The alignment crossing R at a pair of pentagons passes along their common 
side. Those crossing R at a hexagon or heptagon are kept apart by tracing them round 
the perimeter in the same sense from a given side of R. 
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We observe that in the reduced figure alignments crossing R may divide 
it into a number of parts. So an isthmus is formed at an alignment if it is 
the only one to cross it. The other possibilities of an isthmus are 

(1) if a polygon u makes two separate contacts with the same part of R, 
one contact only being reduced. 

(2) if two adjacent polygons v, w make separate reduced contacts with 
the same part of R. 

(3) if a polygon ¢ makes separate reduced contacts with consecutive 
parts of R. 

When R encloses a single polygon x, no alignment crosses it. In case (1) 
we get a 4-ring formed by u, two polygons of R and x, or a 5-ring about more 
than one polygon including a cap. In case (2) a similar 5-ring is formed 
by v, w, two polygons of R and x. 

When À encloses a pair or a triad, it is crossed by 2 or 3 alignments 
respectively, or else two free vertices belonging to the same polygon or pair 
of pentagons on this side of R give rise to a polygon of 4 sides or less. Cases 
(1) and (2) yield the same result as above for the part of À considered. 
Lastly, in case (3) we have a 5-ring about more than one polygon formed 
by ż, two polygons in the two parts of À and two of the enclosed polygons. 
Thus the reducibility is unrestricted for the configurations in question. 


(5) 7665. See Fig. 8. 





(1) unless d—emm? u= J 
(2) d == e =? 
24 d to a C m 3 u= 1 d = 4 
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INFINITE PRODUCT MEASURES AND INFINITE 
CONVOLUTIONS.* 


By E. R. van KAMPEN. 


Introduction. The purpose of this paper is a systematic study of certain 
measurable functions on an infinite product space carrying a Lebesgue measure 
of the product type, especially the convergence theory of sequences of such 
functions and their distribution theory. Such a study is necessitated by the 
fact that these topics were considered during the last two đecades from many 
different points of view by many authors and the development of the theory 
was quite slow. This warrants a uniform treatment of the central phases of 
the subject. An attempt will be made to approach each point by the method 
through which it is most easily accessible. There will result in this manner 
not only a systematical presentation of the general theory but quite naturally 
also several results which are not to be found in the literature. 

Although some references are given, no attempt has been made at a serious 
historical study of the subject. Numbers in square brackets refer to the list 
of references at the end of the paper. References to statement numbers in 
parentheses are preceded by the roman nümeral of the Part in which the 
statement occurs, except if the reference occurs in the same Part. 

"Part I concerns the theory of a product measure in a product space. The 
idea of such a measure developed from the theory of probability, cf. [1]. Later 
it took the form of a measure in certain special product ‘spaces defined by 
means of a measure preserving mapping, cf. [27], pp. 496-497, [2 bis], [28], 
[29], [25], [3], [9]. Finally it took the form of a product of measures in a 
product space, defined directly as the product of given abstract measures in the 
factor spaces, cf. [20], [22], [8]. In Part I only so much is stated as is neces- 
sary for the understanding of what follows. A proof is given of the 0 — 1- 
theorem stated as I (6). The development of this theorem may be followed 
through a wide range of papers, for instance, [1], [28], [16], [21], [20], 
[9], [31]. | 

A measurable function on one factor of the product space may be con-. 
sidered as a measurable function on the product space which is independent 
of all but one of the codrdinates of each point of the product space. Part II 
concerns formal series of such functions, each series containing one term for 
each factor of the product space. The convergence theory of such series is 


* Received June 19, 1939; Revised January 18, 1940. 
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easily accessible on the basis of the product measure introduced in Part i 
` The central theorem is the Three Series Theorem (Theorem I), which contains 

necessary and sufficient conditions for the convergence of the type of series in 

question. . This theorem is due to Kolmogoroff, [17] and [18], who was led 

to this problem in connection with a particular series of. independent functions 

introduced by Rademacher, [26]. 

In Part III it is shown how a simple mapping may transform a sequence 
of independent functions in the sense of Kolmogoroff, [20], or equivalently 
in the sense of Steinhaus, [11] , into a sequence of functions of the type con- 
sidered in Part II. Thus one can write, corresponding to every theorem of 
Part II, a corresponding theorem on series of independent functions. The 
mapping of the space of Part III on the product space of Part IT is not a 
correspondence between points of these spaces, but a measure preserving corre- 
spondence between sufficiently extensive classes of measurable sets in these 
spaces. For considerations of the type used here such a correspondence is 
gufficient (cf. [82], 88). The last paragraph of Part ITI contains the negative 
answer to a question of Kac and Steinhaus, (cf. [80], § 6). 

Part IV concerns the convergence theory of infinite convolutions. The 
results of Part II are transferred to the theory of infinite convolutions by 
means of Theorem V, cf. [10], Theorem 32. A ‘first proof of this theorem is 
based on Theorem IV in § 10 and IV (6) in 817. Theorem IV, which is due 
to Jessen and Wintner ([10], pp. 84 and 85) is proved here by a method of. 
Marcinkiewicz and Zygmund ([24], p. 119). Tha other result, IV (6), is 
usually proved by means of the theory of Fourier transforms ([10], Theorem 
1). It is shown here that completely elementary methods are sufficient. A 
second proof of Theorem V is based on II (17) and is independent of IV (6). 
On the basis of Theorem V a list. of theorems is stated without proof in § 20 
and 821. The relation of Parts IT and IV is much more complicated than 
the relation of Part II and Part III. For instance, it can hardly be said 
that Theorems I and VI are equivalent, even though their analogy is imme- 
diately obvious; Theorem VI is due to Jessen and Wintner, [10], Theorem 34. 
Similarly, Theorem 3 of [15] corresponds to (and is used to prove) that part 
of the last statement of $ 11 which has so far been proved. It would be 
desirable to invert this process. In other words, a simple proof of the last 
statement of $ 11 would lead to a shorter proof and a better understanding 
of Theorem 3 of [15]. 

-The pure theorem (Theorem VIII of § 22) is a a generalization of (17) 
which is due to Jessen and Wintner ([10], Theorem 85). The remark that 
(17) may be extended to cover the case of any Hausdorff measure was com- 
municated to me by Wintner. It may be of ‘interest to investigate now far 


t 


INFINITE PRODUCT MEASURES AND INFINITE CONVOLUTIONS. 419 


one may allow more general given pure functions o, in Theorem VIII. It is, 
for instance, obvious that if Yren» is convergent and o, is absolutely continuous 
for at least one value of n, then tron is absolutely continuous. 

It may be considered undesirable to prove a statement on convolutions 
like Theorem VI by means of series on infinite product spaces. However, at 
present it does not seem possible to prove Theorem VI without leaving the 
domain proper of distribution functions and their convolutions. Thus, for 
instance, the proof of Theorem VI which is sketched in 8 20 includes a short 
excursion to the domain of Part I and an essential use of the theory of Fourier- 
Stieltjes transforms of distribution functions. An account of the latter may 
be found in [5] and many applications in [6], [10], [33], [35], [3%]. How- 
ever, in view of the criterion in IV (1) for the convergence of a sequence 
of distribution functions, it may be considered probable that eventually a 
reasonably simple proof of Theorem VL within the domain proper of that 
Theorem will be constructed. A presentation of the theory of distribution 
functions as a whole may be found in a course of lectures by Wintner at the 
Institute for Advanced Study, 1937-1938; a previous presentation is con- 
tained in [10]. 

The functions of Part IT are real valued and the distributions of Part IV 
are 1-dimensional. This restriction is quite unessential. The extension to 
vector valued functions and more dimensional distributions involves only 
' formal complications, but no essential difficulties. For such extensions in 
different situations compare [5], [6], [7], [10]. | 

The convergence theory of series of independent random variables is not 
discussed in this paper. This theory, which from the historical point of view 
precedes the others, represents from the methodical point of view, an attempt 
to combine the advantages of the other theories. Apparently this combination 
succeeds only at the cost of some clarity, so that it seems preferable to consider 
the points of view of functions of independent variables and of distribution 
functions separately. A comprehensive treatment of this side of the question 
may be found in the well known treatise of P. Lévy: Théorie de Vaddition 
des variables aléatoires, Paris (1937). 


PART I. Product Measures. 


1. Let X,,n=—1,2,8,: -- be an infinite sequence of sets and X = IX, 
the product set of the Xp, i. e., the pet of elements 


(1) T= {In} == (Ti, Ta, Gas"), 


where the n-th codrdinate +, of x is an arbitrary element of Xa. This product 
satisfies the commutative and associative laws with regard to any form of 
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permutation and bracketing of the sequence of integers n = 1, 2,8, + 
If this sequence is divided into two parts I, II, and correspondingly one writes 


X «a Xr X Xn, and if Or is any subset of Xr, then' Cr X Xi will be denoted 


by (Cx) z. As an example for this convenient notation one has (Ai X As)x: 
A,X AX YX EX: "if die pasate The symbols Xa, Xn, 
will be used to denote the products X, X Xa X °°: X. Xa, Xn X Xma X: 0, 
80 that X =X „a X Xn., subsets of Xn, Xn. will be pene with similar 
subscripts. 


2. Let every space Xn carry an absolutely additive-non-negative measure 
unBn, defined for the sets Bu belonging to the field Ba of z,-measurable sets, 
and suppose that paXn==1. By the definition 


(2) pB — I Br, where B— (iI B,)x and Be C Bu, 
+ ý - A1 kl 


and by subsequent uniquely determined extension (based, for instance, on the 
method of the exterior measure), an absolutely additive, non-negative measure 
pB may be defined for all sets B in a field 8 of p-measurable subsets of x. 
This product measure » == Iyun has the property that 


(3) The set IAs, where An © Xn, is w-measurable if and one af either each - 


Anis pa measurable, in which case pA, = Upady = Up(An)x, or IL An = 0. 


Thus the use of u both in (2) and as a notation for the product measure . 
` Im is justified. In particular (3) implies that „X —1. The proof of the 
` ‘above statements may be based, for instance, on the-following intuitive lemma, 

used for this purpose by v. Neumann in a course of lectures at the Institute 
for Advanced Study, 1934-35. 
(4) If dn CB, À — DA, and AC ZB", where each B is a set of the . 
- type occurring in (3), then 
i TipnAn = SmpB" 
holds for the function p defined in (2). 

8. The measure: p — II, satisfies the commutative and associativé laws 
with regard to any permutation and bracketing of the sequence of integers 
n= i,2,3,- : -. In particular, if again X == Xr X Xn, then p= prm, 
where ur, pr are the product measures of the x, belonging to Xr, Xn. Thus 
the theorem of Fubini may be applied to any factorization of X into two 
factors. This proves, for instance, the following § statement, if one considers that 
the product B.» X Cn. is equal to the common part of (B:n)x and (Cx.)x. 

> (5) If Bw X On. are measurable sets in Xn, Xn. respectively, then 


pB.n X Cn, = p(B.n)x (On.)x = #(Bn)x #(Cn) x: 
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“The following well known theorem is important for the development of 
the theory of the product measure p = D pn : 


(6) If a measurable set CCX is of the form (On.)x for every n, then 
either p(Q) == 1 or p(C) = 0. 


The condition concerning C is to the effect that a point (1) in C remains 
in C if any one of its codrdinates is modified arbitrarily. The proof of (6) 
proceeds as follows: 

A totally additive, non-negative measure function vB may be defined on 
B by the equation vB == BC, where BC denotes the common part of B and 
C. If Bis a set of the type B = (B..)x then vB is, by (5), of the form 


vB = uBC = p(Bin)x (C n)x == p(B.n)x H(Cn.)x == Bu. 
Since the measure v is uniquely determined on B, by its values on all sets of 
the form B = (B.n)x, this implies vB == »BC = uBuC for every measurable 
set. On placing B == C one obtains the statement (6). 
4. The following convention will be used concerning Xn, pn, X, p fn: 


(7) X» is a space which carries a measure yn such that pmXn = 1. The 
product space X = UX, carries the product measure p = yum, 80 that pX =1. 
The symbol fn represents a given pa-measurable function fn = f(x) on X, 
and the same symbol represents the p-measurable function fn—=fn(w) on X, 
which is defined by: fu(X) = fn(&m) tf the n-th codrdinate of x is rn, cf. (1). 


Thus, for instance, in the symbol 3f,, the fa are thought of as functions 
on À, since otherwise addition would not have a meaning, and one e finds for 
the k-th moment Mx(fx) of fa the two expressions 


Maelfa) =f falza) * dX, = f ioa, 


if at least one of the integrals exists. The flexibility in the manipulation of 
integrals on X which one attains by means of Fubini’s theorem is illustrated 
by the following example, which is typical of many situations in Part Il: 


(8) IfnSl<m, fifa) and fn(tm) are integrable, and C is a set of the 
type O = (C_1)x, then 
Ji, HOAX — ACL ACTES 
c c x 
A special case of (7) is represented by (9). The use of the 7 on Z is 
equivalent with the use of the well-known Rademacher functions, cf. [26]. 


(9) Za is a space which consists of two points, Z'n, Zn, each of which has 
the va-measure $, so that vun = 1. The product space Z == OZ, carries the 


1 
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measure v= Im, so that vZ =1. The function t is defined on Zu by 
m (Z'n) = 1, (Zn) esl) and Ta is defined on Z according to the last 
part of (7). 

Two functions f on Y and g on Ÿ are said to be equimeasurable if 
ulf (z) <w] —Alg(y) <w].for every real w. Here [f (r) >], for instance, 
represents the æ-set defined by the inequality f(x) >w. Let the functions 
gn on Y» satisfy a convention similar to (7) and let fa and gn be equimeasurable ‘ 
for every n. If any limiting process (reducible to convergence in measure) 
is applied both to the f, and the ga, then the resulting functions are defined 
on sets of the same measure and equimeasurable on those sets. | 

A function f on X is said to be symmetrically distributed if [f (z) <w] 
= [f (z) >—o] for every real w. If the spaces X, and Y, are in 1-1 
measure preserving correspondence, so that the same holds for X and Y, and 
if fn(Tn) == ga (Yn) if the points vn and yn correspond, then it is easy to see 
that the function fu (En, Ya) = fn (tn) —gn(yn) is symmetrically distributed 
on Xn X FA, hence also that f*» (x, y) is symmetrically distributed on X X Y 
"== II(X» X Yn). Moreover, Mf, (f*n)==0 and Ma (f*n)— Mo (f*n)== 2M (fa). 
Here M2(f) denotes the value of the second moment of f — M, (f), i. e., the 
minimum value of the second moments of the functions f—const.; thus 
Ma(f) = Malf) ~ (M (Ye 

It is also easy to see that if fn is symmetrically distributed on X,, then the 
functions fan on XY, and sf, on Xna X ZA are equimeasurable, cf. (9). Thus, 
if fn is symmetrically distributed for every n, then any limiting process applied 
to the sequences {fa} on X and {rafa} on X X Z—=T1(Xn X Zn) leads to 
equimeasurable results. 


PART II. Series of Functions of Independent Variables. 


In Part II, a number of more or less known criteria are given for the 
conyergence of a series Sfn on Y == IX, where each fa = fa (£) is obtained 
from a function fu == fa (2n) On 2 according to the convention at the beginning 
of §4. The main criterion is stated as the “three series theorem” (‘Theorem I). 
Tt was obtained first by Kolmogoroff, who used the language of the theory of 
random variables, and restricted these to take on an at most enumerable set 

_of values. His original proof in [17] turned out to be most convenient for a - 
systematic presentation of the subject (§5 and §6). Some simplification 
could be obtained. For instance, IT (8) is replaced by II (2), and the proof 
of II (6) is separated into two parts, the first of which proves the separate 
lemma, II (5). The presentation of the proof of Theorem I may be varied 
in numerous ways. 


` 
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In $7 and §8 conditions are given for the unconditional and absolute 
convergence of Xf, on X. They follow easily from Theorem I. Theorem HI 
is essentially simpler in character than Theorem I, from which it is here 
obtained, and may be proved directly by means of a lemma analogous to (5), 
concerning absolute convergence. Conditions as used in (12) of § 9 occur 
for q = 2, p= 4 in [11], Theorem. 4, and for q=1, p=? in [23], 88. 
Theorem IV of § 10 is essential for thé coördination of Part II and Part IV. 


5. This $ 5 contains some lemmas needed in the proof of the three series 
theorem. The convention I (7) is of course essential. 


(1) The series Yf, ts etthen almost everywhere convergent or almost every- 
where divergent. This is an immediate consequence of I (6). 


(2) If for an arbitrary K > 0 and every n, the function f'n ts defined by 
Fn = fn or fin = K, according as | fn | SK or-| fa | > K, and also tf f'n is 
defined by fn = fa or Fa =— K, according as | fn | = K or | fu| > K, then 
Xf» and f'n are simultaneously almost everywhere convergent on X and almost 
everywhere divergent on X. This is clear, since, for a ‘fixed x, the passage 
from either of the series fn, fn to the other involves only terms which are 
of absolute value not less than I. 


(3) If SM:(fx) < + © and My (fn) = 0 for every n, then Sf, is convergent 
almost everywhere on X. 


Suppose that M:(fn) exists and M, (fn) = 0 for every n, and that Sf, is 
divergent on a subset of positive measure of X. It will be shown that SM/2(fn) 
is divergent. 

If sn denotes the n-th partial sum of Xf,, then the sequence {sn} is 
divergent on a subset of positive measure of XY. Thus at almost every x on X, 
the oscillation of the sequence s41, Sms, * `° is not less than a, positive number 
a==a(x) which depends on x but not on m. Thus it is evident that, for 
sufficiently small b > 0, the set D defined by the inequality a == a(s) >b 
is not a zero set. This means that there exists a number b > 0 such that D 
is not a 0-set, where D is defined by the condition that | s,—s» | >b holds 
for every m and at least one n—#(x) > m. If m is arbitrarily fixed, and 
k >m, let DE be the set defined by | s(t) —sn(r)| Sb for m<n<k 

-and | sk — su | >b. The sets D” are disjoint, 3D* contains D and D* is a 
set of the type (A.x)x. Let n be fixed in such a way that p 3 D” > 44D. 


r m< kón 
Since M,(fx) = 0, one sees from I (8) that 


f (sa — sa) (Se n) aX = f (sas): f (sS: — Su) AX = 0. 
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Taking in account iio ‘definition of D*, one finds for m < k = <n, 


fom) Em FS) + (ea) ar 
ee nk > WAD 


so that 
3 Malf) = f (1 — sn) aX 
k=m+1 x 
= 3 | (ne) aX > 3 bu D* > $b°pD. 
k=m+1 k=m+1 : 


Ve Since m was arbitrarily chosen, it follows that SM.(f,) is divergent, go that 
the proof of (3) is complete. 


(4) If SM (fx) < + 0, then Sf, is convergent almost everywhere on X if 

and only tf SM, (fa) is convergent; in which case Zf» converges in the mean 
(L?) on X to f == Bf, and 

| M: (f) = ZM; (fa), Half) — 2H, (fr). 

In fact, if ga is defined by gn—=fr—Mi(fx), then M1 (gn) 0 did 

OS Ma(gn) = Mo(fs), 80 that SMo(gn) < + 0. Thus according to (3), 

the series Zg» is convergent almost everywhere on X. The first part of (4) 

is now evident from the definition of ga. In order to prove the remaining 

` statements of (4), put g = 3g» = f — 3M, (fn) and let t denote the n-th 

partial sum of Sg». Since the functions gn are orthogonal on X, one has 


Mi(g) Z Hits) = à Ma(ge)- From the equality one obtains by means of the 


theorem of Fatou the. odii M9) £ ZM:(g»), 80 that M:(g)— SM (qu). 

Since the last identity may be read as the Parseval identity of the a 
g = Ëg», the remaining statements of (4) are now evident. 

(5) If | fa(2)| <E for every v and every n, if Mi(fn) == 0 for every n and 
tf Ifa ts almost everywhere convergent, then 2M2(fn) is convergent. 

Let S — s,(x) denote the n-th partial sum of the series 3fa(z). Then 
the last assumption of (5) implies, in view of Egoroff’s theorem, that 
| sa (£)| < N for some N > 0 and for every x in a set C of positive measure. 
Let the set C* be defined by the inequalities | ss(£)| < N for 0< k&n, 
then C* is a set of the form O” == (A.n)x. Moreover, CD Cr DC. On 


placing Ty = fe 8,°*4X and D" == (#3 C", one has 


Tr — Tra = J fax +2 f Re de S dX. 


ce 
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Application of I (8) gives the value M2(fy)uC** for the first integral: and 
0 for the second integral, since M,(fn)=—=0 for every n. Thus, since 
BETZ pC and | % |S K+ N on Dé, 

Ta — Ta = Ma(fe)uC — (K + N)a Dr. 
Summation of this relation for 1< k&n gives, since pD* == OK — pO* 
and pC" S 1, 


E? Z KyO” Z Ta ET, TES M, (fh)u0 — (E + N)3; 
Æ=1 


so that the convergence of 3M(fa) is evident. 


(6) If | fu(v)| <E for every x and every n, and if 3fn is almost everywhere 
convergent on X, then both series 3M,(f;) and SM2(f1) are convergent; 
so that Sfn belongs to the class (L?) on X, (cf. (4)). 

Let f*n (Tn, Ya) be the function defined in § 4, so that | fu (an, Ya) | < 2K, 
AL (fn) = 0, Me(f*n) = 2M2(fn) and 3f*, is convergent almost everywhere 
on X X F = O(a X Ya). From (5), which is thus shown to be applicable, 
one obtains the convergence of 2/2(fn). Since X(f,— M, (fa) ) is convergent 
almost everywhere by (4), it is clear that 3M, (fa) is convergent. Finally, - 
that Sf belongs to the class (Z7) on X, is evident from (4). 

6. From the statements (2) and (6) it is easy to obtain the three series 
theorem : 

TEEOREM I (Three series theorem). If An denotes the subset of Xn 
defined by the inequality | fn | S K, then the series Sf, is almost everywhere 
convergent on X tf and only if all three series 


(*) m (Zn— An); (3 f fades (9) 2 eaS 4x)*] 


are convergent for a ficed K © 0, in which case they are convergent for every 
K>0. 


In fact, if f» is any one of the functions defined in (2) for n == 1, 2,3,---, 
then Xf, is almost everywhere convergent on X, if and only if the same holds 
for 3f’,. Application of (6) to 3f, now shows that this is the case if and 
only if 3M, (Fn) and 3M2(f’n) are convergent. Since clearly 


M, (Fa) = f, fnåXn + Kyn (Xn — An) and 
EP = f fédXn— (f, fad%n)* 
A Rina AS f, A GOD C1006 CT D PO OL 
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the.convergence of 3M, (F ») and 3M, PEE for both functions f'a is equivalent 
to the convergence of the three series occurring in Theorem I. | 

The following simple sufficient condition for convergence is an immediate 
consequence of Theorem I and may sometimes be easier of access; cf. [23]; i 


. Theorem, 5. 


(7) The series Xf, (x) is convergent almost. everywhere on X if so are ‘both 
series XMi (fa) and 3My(| fa |) for a fixed ppl <pS2. 


In fact, if the second series is convergent, then so are the series 


afi frdXn, 3pa(Xn—An) and 3 T fidXss where the An are the sets 
Xuân 
defined in Theorem L Thus (7) is an immediate consequence of Theorem Ï. ` 
- On applying Theorem I to the functions fn defined by fa(£t) = 1 or 
fn(x) = 0 according as œ does or does not belong to a given measurable set 
B, on Xn, one obtains the following statement (which occurs as a lemma in. 
the usual proof of Theorem 1; cf. [17], [18]). 


(8). If Bn is a measurable subset of X, for every n, then the measure of the ~ 
set of points x of X which have infinitely many coördinates X, in the i Ba 


- 48 0 or 1 according as 2pm (Ba) ts convergent or divergent. . 


As remarked by M. Kac, application of Theorem I and of the other con- 
vergence criteria of Part II, to the series 3f,/n leads to most of the well 
known sufficient conditions for the strong law of great numbers; cf. [11] p. 54. 
In fact, if 3fh/n is convergent, then (f: +°::+f,)/n—>0 as n— œ. For 
instance, on applying (3) to the series SE, one obtains the ee 
sufficient criterion for the law of great numbers. 


(9) If Me(fa) exists for every n, and tf 3M: (fn)/n and 3M;(fn)/n° are 
convergent, then 


(fit: +++ fa)/n—>0, as no, 
almost everywhere on X. 


7. Since in (4), the series ¥M1(fn) has non-negative terms, and since 
the product space of X° and its product measure y satisfy the commutative law 


"under any permutation of the factors X» and px, one may read (4) as follows : 


(10) If Ma(fe) exists for every n and if %M2(fa) is convergent, then the 
sum Xf, ts convergent almost everywhere, no matter in what fixed order the 
terms are taken, if and only if 3M; (fa) is absolutely convergent. 

If Ÿ and ©” denote the sums of two dont ie of fn, then Y= t” 
almost everywhere on X. 
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_ im order to. prove this last statement, let g» be defined again as 
fu — Mi (fn), let ts, tx” be the partial sums of the two rearrangements of 


SG; and let Tx’, Tn” be the partial sums of the corresponding rearrangements 
. of 3M: (gn). Thon 


f (tx — É ta”) *4X = = T,! Sa a 
so that the integral tends to 0, as n—> œ. Hence, by Fatou’s theorem, 


Í (# =t") 4X = 0, 


80 jé Y = t” almost everywhere on, Z. 
Using (10) instead of (4) in the proof, one obtains a as in $ 6 the following 
theorem, where notations of Theorem I are used: 


THEOREM II. The series Xfn(x) is convergent idimost everywhere on X, 
no matter in what ficed order the terms are taken, if and only if the series . 


(*) and 3f fa’dXn are convergent and the series (**) is absolutely con- 


| vergent. Moreover, if Y and t” denote the sum of two rearrangements of Zfa, 
then Y — t” almost everywhere on X. 


_ As an immediate consequence of Theorem II, one sees that if Xf» is con- 
vergent almost everywhere on X, then constants a, may be determined in such 
a way that the type of convergence of Theorem IT holds for X(f, — a,). One. 
may select for instance as dq the terms of the series (**) of Theorem I. 


8. The type of convergence of Xf, which was discussed in § 7 should be 
distinguished from the unconditional convergence of Sf, almost everywhere 
‘on X. The latter is equivalent to the convergence of X | fa | almost every- 
where in X, and obviously implies the type of convergence discussed in 87. 
Simple examples show that the converse implication does not hold, For 
instance, the series Sr/n (cf. I (9)), satisfies the requirements of Theorem II, 
but 3|r, |/n is divergent everywhere on Z. 

A condition for the convergence of X | fa | ‘almost everywhere on X may : 
be obtained very easily from Theorem I. In fact, the set À, of Theorem I 
is the same for f, and | fn |. Thus the statement of Theorem I for the series | 
S | f» |, is that this series is almost everywhere convergent on X if and only 


if the three series GS parc JS. lfs | dx)*] and af. | fa | aX 


are convergent. Since Si FX E SK X | fa | dX», the result thus ‘proved 


reduces to: 


. F 


T œ 
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THEOREM IL. The series sf. (x) is absolutely convergent almost every- 
where on X if and only if the two series 


Sun (Za — án), 3 ft @% 


are convergent for a fixed value of K > 0, in which case they are convergent 
for every K > 0. | 

The simple sufficient conditions for absolute convergence in (11) are not 
equivalent for any two distinct values of p, cf. [34]. 


(11). If for any fired p 0 <p = 1, the series 3M,(| fn |) is convergent, 
then Sf, ts absolutely convergent almost everywhere on X. 


In fact, if 34,(] fa |) is convergent, 0 < p:& 1, then so are the two 
series of Theorem III, so that Xf» is absolutely convergent almost everywhere 
on X. 


9. The condition | f, |< K-in (6) insures that the convergence of Sfx 
almost everywhere on X implies the convergence of S#:(f,). In this section 
a different condition will be discussed which leads to the convergence of a cer- 
tain moment series, if a series Xfa of symmetrically distributed functions fn. 
In the special case q == 1, p==2 of (12), the restriction to symmetrically 
distributed functions is not needed, as shown by Marcinkiewicz and Zygmund, 
[23], $3. The resulting theorem is given here for completeness as (13). 
The moment condition of (12) has the form of an inverted Hilder inequality ; ‘ 
thus cS 1 in (12) by Hélder’s inequality. 

(12) Suppose that 0<q<pandc>0. Let the functions fa on Xn be, 
symmetrically distributed and let cMy(| fa |)? S Mg(|fn|). Then the con. 
vergence almost everywhere on X of Ifa implies that 2M,(|f,|)"/? te 
convergent, where r = Max (q, 2). 


First it will be shown that m, —> 0, where mn = Mo(| fx Lys. If Ba, On 
are the gn-sets Ba — [| fn | > amn], Cx = [| fa | > mA/a], where a > 0, then 


My? = Í. | fn |PdX., = myPa Pins 
hence pnCn = aP, And, from Hölders inequality 
(S | fa dEn)? 5 (ann)? 4 f. | fa IP) eS (armee. 
Cn Ca 


Since obviously f | fn (dE, S amns, it follows that 
Xx-Bn . ik A 
(Bn — Cn) Z atm! f | fa | d£n = atm, 4 (cm — amni — a? Imat). 
Ba-Cn f 


oe ta 
~ 
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Since p (Bm) 2 a (Ban— Cn), one sees that (Bn) > const. > 0 if a is selected 

sufficiently small. Now if lim sup m, > 0 and if in Theorem I, one takes 

K < a lim sup ms, then the series (*) of Theorem I is not convergent, since | 

infinitely many of its terms are > const. > 0. Since this is contradiction 

with the assumption that Zf, is convergent almost everywhere on X, it follows 
that my — 0. 

| Let 4, be the æ, set An = [| f | £1]. Then 


pli Aa) S S | fa PNW S ms, 
| Xs-An 
so that Holder’s inequality implies 
(f° | fa OT (Xn An) PA, | fr Pan) eS mr 
as i E 
Hence the assumption Ma(| fa N= = cma? implies that 


Si fn |% dXa = cma? — m, 


where the right side is positive for sufficiently large n, since Mn — 0 and q < p. 
For such n one now obtains from Hélder’s inequality of r == 22 q ane from 
the definition of An ifr=q 22: 


| S. fax, = fy | fa |0 dXa)" 2 = (cmt — my?) 1/4 == my (c — Mnt) rra, 


This implies the statement of (12) that Ema" is convergent. For ma— 0, 
c>0,p>q> 0, and the series (,*,) of Theorem I reduces to the form 


x Í. fn°dXn, since f, is symmetrically distributed. 


In case q == 1, p= 2 the condition in (12) that f» is nb dis 
tributed may be replaced by the much weaker condition that Af,(fn) = 0. 
In fact, if M, (fa) == 0 and cM2(fn)3 S M (| fn |) for some c > 0, then the 
symmetrically distributed functions f*, of § 4 satisfy dMs(f*,)'S M: (| ff |) 
for a d > 0 which depends only on c, as shown in [23], p. 71. Taking in 
account that the condition cM2(fa)#=< Mi(|fn|) is homogeneous in fa, 80 
that it is possible to normalize f» by placing M2(fa) = 1, one obtains one half 
of the following theorem of Marcinkiewicz and Zxgmund, the other half of 
which is clear from (4). 


(13) If fn satisfies the conditions M:(f») —0, Ms(fn) = 1 and M;(fn), 
c>0 for every n, then the series Xonfx with constant coeficients cn is 
convergent almost everywhere on X or divergent almost everywhere on X, 
according as Xc,* is convergent or divergent, cf. [23], & 3. 
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10. In this § 10 applications are given of a method of Zygmund ([381, 
82). They will be useful to establish the relations of Part II and Part IV. 


(14) Let the n-th partial sum of the series Xcntn on Z—=UZ, be denoted by. 
İn, where the cn are constants and the Ta are defined in I (9)..1f a subsequence 
of the sequence ty is convergent almost everywhere on Z with reference to the 
measure v = Im, then Zes? is convergent. Thus Senta ts almost everywhere 
convergent on Z. 0 


In fact, if a subsequence of {#,} is convergent almost everywhere on Z, 
` then there exists a constant M, and a subset O of Z of positive measure, such 
that | ¢m(z).| <M holds on C for arbitrarily larga values of'n. Thus, 


MC > Í bn? AZ = 3 0x0 + 23/en0- f rridZ, 
Cc n=l CG. 


if X denotes summation over the range 1=n<1=<m. Now the functions 
Tar n < lare orthogonal to each other and orthogonal to a constant function 
on Z.‘ Thus, the Parseval inequality, when applied to the characteristic 
function of C, shows that 


w [nn < 0 — 00) Sh, 
so that, by Schwarz’ inequality, 


(BX ocr: fl raridZ) S aee f ride)? S $( ont), 
and finally 
| mozio (0—5). 
er va 
Since M and C may be selected in such a way that vC is arbitrarily near to 1, 


this completes the proof that Zen»? is convergent. And now (4) implies that 
XCaTn is almost everywhere convergent on Z. 


(15) Let Sn be the n-th partial sum of fn on X —UXy. If the subsequence ' 
{Sm} Of {Sn} is convergent almost everywhere on X, then there exist constants 
an such that %(fn— an) converges almost everywhere on X. 


Consider first the case where the distribution of fa on X» is symmetric, 
i.e., where the sets defined by fa > w and fa < — o have equal measures for 
every real w» Then the function tif, on Xn X Za and fr on À, are equi- 
measurable, i. e., the sets defined by fa > on Xa and fate > © on Xn X Zn 
have equal measure for every real w. Thus if t, denotes the n-th partial sum 
of the series Xfntm on X X Z = (Xn X Z,), then the sequence {tn,} is con- 
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vergent almost everywhere on X X Z. Thus, by Fubini’s theorem, {t,,} is 
convergent almost everywhere on Z at almost every fixed point of X. Hence, 
by (14), Sfarn is convergent almost everywhere on Z at almost every fixed 
point-of X. ` In view of Fubini’s theorem, this implies that Xf, is convergent 
‘almost everywhere on X at almost every fixed point z of Z. Since the fonc- 
tions fa on Xn and fags on Xx X Zn are equimeasurable for every fixed zn in : 
Zn, this implies finally that Zf» is almost everywhere convergent on X, so that 
one may choose ‘a; == 0. in the case under consideration. 

Now let f, be arbitrary and let f*, be the function introduced in § 4. 
Clearly, 3f*x(z,y) has the same convergence property on X X Y as Yfn has 
on X. Moreover f*.(@x, yn) has a symmetric distribution on Xn X Yn. Thus, 
by the case of (15) which has already been proved, 3f*,(x,y) is almost, 
everywhere convergent on X X FY. This implies that Xf*,(x, y) is almost 
everywhere convergent on À at at least one fixed point yo of Y. On placing 
fn(¥o) = Gn, one obtains the statement of (15). This pros was obtained from 
a proof in [24] by a slight simplification. 


Tarore IV. If Sf, is convergent in measure on X —=ILXy, then Sfr 
ts almost everywhere convergent.on X. i 


In fact, if 3fw is convergent in measure on X, then a classical theorem 
states that a subsequence of the sequence of partial sums of Xf, is convergent 
almost everywheré on X. Thus, by (15), there exist constants a, for which 
X(f» — an) is convergent almost everywhere on X. ‘Since Xf, is convergent 
in measure on X, one may choose the constants a, equal to 0. Thus the proof 
of Theorem IV is complete. ; . 


11. The method of Zygmund which led in §10 to (14) and (15) may 
be used to prove certain additional theorems, examples of which are given here. 
Let yon be real numbers such that ymn—> 1 as m— œ for every n and 


let Sa denote the sum Sm = À trmfns if it is convergent. 


(16) If a sequence of constants bn exists such that the sequence (In— bn} 
- is bounded on a set C of positive measure, then there exist constants a, such 
ihat Z(fan— an) is convergent almost everywhere on X, cf. [38], p. 97 and 
p. 100. 


First, one may suppose that ymn = 0 for every fixed m and n > N= Na. 
This is obvious, since the series defining Sm are, by Egoroff’s theorem, uni- 
formly convergent on a subset.of C which has positive measure. 

Next, using the argument in the second part of the proof-of (15), one 
sees that it is sufficient to consider the case in which the distribution of each 
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fn on Xn is symmetric, choosing bn — 0 for every m and aa = 0 for every n. 
And the first part of the proof of (15) may be used to reduce the proof of 
(16) to the case where each fa is of the form cr», cf. (14) and I (9). 

Furthermore, on omitting a finite number of terms of the series enn, 
one may suppose that the measure of the set on which the sequence {8,} 
corresponding to ScA1» is bounded, is larger than 24. Finally, application of 
the method used in the proof of (14) establishes an upper bound for the 
sequence of numbers 


Nw : 
>> Ymn°Cr?. 
#=1 : 


Since ymm —> 1 as n—> œ for every m, this implies the convergence of %c,?, 
hence the convergence almost everywhere of Xc,r,. This completes the proof 
of (16). 

The generality of the summation method in | (16) allows a large number 
of applications. However the next statement may not be obtained from (16), 
since the set C” is allowed to depend on m. The notation Sm introduced at 
the beginning of this $ 11 is used again. 


(17) For every e > 0, let there exist a constant M = M. and sets O™ on X, 
such that pO" > 1—e and that | Sn | < H holds on C" for every m. Then 
there exists a sequence of constants an, such that S(fn — a) is convergent 
almost everywhere on X. | 


The proof of (17) is not essentially distinct from the proof of (16). The 
measure of the set on which the method of (14) is applied may be selected 
larger than 2-4 by choosing e sufficiently small. The case where Sm is selected 
to be the partial sum s of Sf, will be used in § 19. 

A comparison of .(16) and (17) naturally suggests the truth of the 
following statement, of which no proof is known: 


If0<a<1,M>0 and tf there exists a sequence of constants bm and 
a sequence of sets O” in X such that | Sw—bm| < M on C™ and pO” >a . 
for every m, then there exists a sequence of constants a, such that $ (fn — an) 
is convergent almost everywhere on X. 


In the particular case where each Sm is a partial sum of the series 2fn, 
a proof may be obtained by comparing Theorem V of Part II and Theorem 3 
of [15]. An analysis of the prota of this last theorem might lead to a proof of 
the above conjecture. 


NES 
oe 
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‘PART III. Series of Independent Functions. 


' Convergence criteria of series of independent. functions on [0,1] in the 
sense of Kolmogoroff or, equivalently, in the sense of Steinhaus (cf. [20] and 
` [11]), may be derived immediately from the convergence criteria of Part II 
for Xf, on X — ILX,, where each fa is a function on X,. It is immaterial 
whether the independent functions are defined on the interval CO, 1] or on any 
set Z carrying a measure y such that vZ = 1. 


12. Let {gn(z)}, {g’n(z)} be two sequences of real valued measurable 
functions defined on spaces Z, Z’ carrying abstract Lebesgue measures y, v 
respectively. These sequences are said to be equimeasurable if for any finite 
number of Borel sets h,’ -, 9 of real numbers, one has vO = vC’, where - 
C, C are the sets defined by i 


| C: ga(2) COn, (n=l, k); Ot gr) COs, (h=, 5k). 
The functions g,(z) on Z are said to be independent on Z if for 


x 

any finite number of Borel sets Qi,---,Q, one has vO =. II Cy. Here On 
n=l 

and C are defined by the inequalities: 


Cx: -gn(2) Co; oO: gn(Z) Ca, for n=1,- x nk. 


The proofs of the following statements are evident, cf. e. g., [24]. 

If {gn} and {g'n} are equimeasurable sequences, then the functions obtained 
from {g,} and {g'n} by any limiting process (reducible to convergence in 
measure) are equimeasurable functions. 

If {fn} is a sequence of functions obtained on X —=IX, by the con- 
vention I (7), then {fn} is a sequence of independent functions on X. 

If the sequences {fn}, {gn} are independent on X,Z respectively, then 
these sequences are equimeasurable, if and, only if the functions fn and gn are 
- equimeasurable for every n. 


13. Now let {gn(z)} be a sequence of independent functions on a space Z, 

which carries a measure y for which vZ = 1. For every n, let X, be the space 
of real numbers. Let pa be defined on X» by the definition pa(Q.) —v(Cna), 
where Q, is any Borel set in X, and Cy is the z-set On — [gn(z) C On]. Let 
a function fa(£n) be defined on Xn by placing fu(zæ) —2, for every (real 
number) a in Xa. From these definitions it is clear that gx and fn are 
equimeasurable functions. Now let the measure x = Ips be introduced on 
the product, space X —ILX,. Then the statements of $ 12 imply that the 
sequences {fa} on X-and {gn} on Z are equimeasurable sequences of functions. 
Thus one obtains the following theorem: 
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_ (1) If {gn} is a given sequence of v-measurable independent functions on a 
` space Z for which vZ = 1, then there exists a sequence {fn} of u-measurable 
functions, defined on a product space X == ILX, by means of the convention 
I (7), such that {gn} and {f,} are equimeasurable sequences. 


Thus, according to §12, the criteria of Part II for different types of 
convergence of series of the type of Zf», retain their validity if Xf» is replaced 
by Zg», where the gn form a sequence of le functions on a ue Z 
of total y-measure 1. | 

. As another application, note that as a consequence of (1), Theorem I, ' 
82 of [23] is a very special case of (16.1) on p. 280 in [9]. The more special 
character of the former, has as one of its consequences, that the former does 
while the latter does not allow a direct generalization to the case p = 1, cf. [24]. 


14. This § 14 contains the negative answer to a questiòn of Kac and 
Steinhaus concerning the relation between completeness and independence, 
cf. [80], §6. 

Tf the functions g» on Z are bounded, then clearly the sequence of powers 
fn°, fn, fa’, * * + is complete on Xx, so that the set of all monomials in the fa 
is complete on X = ILX,. It cannot be concluded that the set of all monomials 
in the g» is complete on Z, or even that.if this set is not complete, then the 
set of independent functions g, on Z can be enlarged by a non-constant func- 
tion. In fact, if g(x) is defined on [0,1] by g(a) = 2424 or 1 — 24(1 — x)à, 
according as 0S 2S} or $< 231, then the sequence g°, g*, 9’, g°,: °° 
is not complete on [0,1], and if f is a function on [0,1] such that f and g 
are independent, then f is a constant almost everywhere. If f is not constant 
and f and g are independent, let A be a set defined by 4: f(x) <a, where. 
w is selected in such a way that the measure of A is neither 0 nor 1. A con- 
tradiction is now easily obtained by applying the fundamental theorem of the 
calculus to the characteristic function of A. 

Closely related is the remark that a sequence of independent functions gn 
(or even of their powers (ga)") on [0,1] is never complete. In fact, if fa is 
the function on X, corresponding to gw, and k 1, then frf: cannot be approxi-: 
mated by linear combinations of the (f,)" unless either fx or f: is constant. . 


. PART IV. 


In this part certain convergence criteria for infinite convolutions will be 
derived from the corresponding criteria in» Part II. As soon as the connection 
between the theory of infinite convolutions and the theory of the series in 
Part II has been established, one may transcribe these theorems without 
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further proof (although theorems of Part II and Part TV are never equiva- 
lent). This explains the list of theorems without proofs in § 20 and § 21. 


15. A distribution function s == e(t), — œ < t< + 0, is defined to 
be a monotone function such that o(— œ) = 0, o(+ ©) = 1. Two distri- 
bution functions will be considered as identical if they are equal at their 
common continuity points. Thus two distribution functions are identical if 
they are equal at a dense set of values of t. A sequence {ox} of distribution 
functions is said to converge if there exists æ distribution function o, such 
that os(t) —o(f) holds at every continuity point of o. Note that {sẹ} is 
not considered to be convergent”if ou(#) > &(t) holds for every % but a(t) 
is not a distribution function. If o,,0 are distribution functions, then on, >0 
obviously holds if o1(#) —>o(t) for a dense set of values of t. The following 
criterion for the orog of a sequence {on} of distribution functions is 
not quite obvious. - 


(1) The sequence {on} is not convergent if and only tf there exists an a > 0 
and for every n, an m >n such that | om(t’) —on(t”)| > a ‘holds m every 
land ¢” in at least one interval of length a. ; 


That the existence of such an a is not compatible with the convergence 
of {on} is obvious. Thus, it remains to prove that if such an a does not exist, 
then {cn} is convergent. 

By a theorem of Helly, a subsequence of {on} may be selected which tends 
everywhere to a monotone function @(t). If to is a continuity point of «, and 
«> 0 is arbitrary, let 8>0 be chosen such that | a(t) + 28) —a(to)| < €. | 
By assumption, there exists an n = ne such that for every m > n = ne, the 
inequality | on (t) —on(t”)| Se holds for at least one set of values t, t” on 
every interval of length 6. 

Let the element om, m > 1143, of the subsequence which determined a 
be such that | a(t) + 28) — om (to + 28)| < ¢, 80 that | om(to + 28) —a(to)| | 
< 2e. Hence, the definition of n = ns implies first | on (to + 8) — a (to) | < 8e, 
and then | op(to) — @(to)| < 4e for every p > n= ng. Thus, on(to) —> a (to) 
at every continuity point of a(t). Finally, « must be a distribution function. 
In fact, since a(+ œ) and a(— œ) exist, the condition which was used 
above to determine & as a function of e and to, may be satisfied by the same 
8>0 for a fixed «>0 and every tọ which ‘is sufficiently large. Since 
op(+ ©) = 1 and op(— œ) = 0 for every p, this implies that a(-+ co) == 1 
and a(— 0) =0; so that a is a distribution function, and the proof of (1) / 
is complete. . a 

A distribution function o= o(t) determines uniquely a Lebesgue- 
Stieltjes measure on — œ < t< + œ. This measure, which will also be 


` 
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denoted by o, may be obtained by a well-known extension, from the definition 
cA =o(b) —o(a), where A is the t-set a St <b and a,b are continuity 
points of ø. Thus ot denotes the o-measure of the point t, i.e., the jump of 
a(t) at é. The integral of an integrable function g(t) with respect to this 
measure o is the Lebesgue-Stieltjes integral 


+00 


f oO. 


The spectrum of o is defined to be the set of those values of ¢ for which 
| o(¢ +e) —o(t—e)| >0 holds for every e>0. The point spectrum of © 
is the set of those ¢ for which ot > 0, i.e., the set of discontinuity points of o. 


16. If cı and cz are two distribution functions, then the convolution 
1*o2 of 01 and cz is defined by 


sat) f (té —s)doz(8) -ff do, (r) doz(8) 
react 
where the double Lebesgue-Stieltjes integral on the right represents the 
measure of the (r,s)-set [r + s < t], if in the (r,s)-plane the product of 
the two measures o; and v2 is used. From this second expression for o; * o 
the following statement is clear. 


(2) The function o, * 02 is a distribution function and the point spectrum 

(spectrum) of o1 * o, may be obtained by adding arbitrary elements of the 

point spectra (spectra) of cı and o, (and forming the closure). Moreover, 
+00 


0, *02(¢t + 0) =f cı(t—s +0)doa(s) and o, * oot = X of os. 
e r+s=t 
. Using Fubinÿs theorem, one sees that the convolution of any finite number 
of distribution functions satisfies the commutative and associative laws. In 


what follows o, T, on, Tn always denote distribution functions. 
(3) Lf ono and ta > 7 as n— ©, then on * 12D * r. 


It is sufficient to show that on *ta(to) — o * ta (to) —> 0, as n— œ, at 
` every continuity point fo of o*7. In fact, one obtains the statement 
o * Tafto) —o*r(to) 0 on replacing o, on, Tmn by T, Ta, o, and the two 
together imply (3). 

 Itfe>0 is given, let 1,- - ~, Tp be a finite number of discontinuity points 
of a(t) which are such that the sum of the jumps at all remaining discon- 
tinuity points of a(t) is less than e Since, by (2), the p numbers to — 1, 
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(k = 1,: - yp) are continuity points of r({), one may determine the non- 
-overlapping intervals Ty: de < t< br, in such a way that Zs[r (b+) — r(x) ] 
<e, where a, < to — 7x < br, and the ay, by are continuity points of r(t). 
Next, one.can determine M in such a way that 34[r, (br) — ra(ax) ] < 2e 
for-n > M. On the other hand, one may determine N > M in such a way 
that | on(t) —o(t)| < 2e holds for every n > N and for every t which is not 
in any of the intervals to — bx < t < to— ar. Thus, on separating the con- 
tributions of the intervals I; and of the rest of the integration domain, one has 
| : +00 | 
‘| on® ra (fo) — o * ru (to) | = | f [on (to —8)-—o(to—8) ]drn(a) | S 2e + 2e, 
J | 
if n> N (> M); 80 that the proof of (3) is complete. 
Let w denote the distribution function for which w(t) — 0 or 1 according 
ast?<0ori>0O.° 


{ 
(4) If on, tx are such that on —:0, on * rx >a as n— œ, then ta — w. 


If «>0 is given and ft, —t¢ are continuity points of o for which 
a(t) —o(—t) > 1—e, then N may be chosen such that on(t) — on(— t) 
>1—2e and on * ta(t) —on* ra(—t) >1—2% for n>N. Thus, for 
n>N, C | 

' -2t +00 - at 

1— 2< f + f+ f lats) —on(—t—8) ]dra(s) 
-0 2t -2t | 
SS 2 + rn(2t) — ta (— Èt), | 

where the variation of r, in the first two integrals and the integrand of the 
last integral have been majorized by 1. Since ra (2t) —ta(— 2t) > 1— 4e 
for any «e and a suitable t == te one can find a subsequence of the sequence 
{ra} which tends to a distribution function r. From (3) and the first 
assumption of (4) one sees that the corresponding subsequence of on * ta tends 
to o*r; so that o * r == o. Now if t —> oœ did not hold, one could select r to 
on any interval consisting of negative t-values. Thus r == w, as stated in (5). 


(5) If o*r=o then r—0. 


Let to > 0 be given; the maximum m of e(t + to + 0) —o(¢— 0) for. 
— œ << + œ exists and. is positive, and is attained at every point of a : 
bounded closed t-set, Ao. Let ti, t2 be the least and largest values of ¢ in Ao. 
Then (2) implies that ° 


a(t +t + 0)—o(t,—0)— f [o (ty + ts —8 + 0)—o(t; 8 —0) ]dr(s) =m. 


k 
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© Since the total variation of r is 1, and since 0 So(¢+ to + 0) —o(t —0) 
£ m, this is possible only if r(s) has the variation 0 on any interval on which 
o(t, + to —58+ 0) —o(t; —s— 0) <m holds everywhere. In particular, . 
the total variation of r must be 0 on any interval ‘consisting only of positive - 
t-values, Using t, instead of ¢,, one sees that r also has the total variation 0 . 
on any interval consisting of negative t-values. Thus r == «w, as stated in (5). i 


17. If {cy} is a given sequence of distribution functions, let on be the 
convolution 


d.n Kop — où * on * . Fon, 


and let the hé convolution of the on be defined formally as 
K on = 0102 * 03%: ; 
The infinite convolution Yo: is said to be convergent if there exists a dis- 


tribution function o such that ©.» —> o, in which case one writes o = Xon. On 
denoting by on.m, where n < m, the distribution functions 


On, m = Onyi * Onan Ÿ° Om 


and supposing that o == o» is convergent, one sees that not only c,» —>o a8 
n — œ but also o.n * on.m = 0m, n0 matter how m > n depends on n. Thus, 
by (5), on.m—>o as n->» œ, for arbitrary m—m,. This proves one-half of 
the following Gauchy criterion for convergence of Won: 


(6) The infinite convolution À an is convergent if and only tf on.m— vw as 
n— œ no matter how m > n depends on n; cf. [10], Theorem 1. 


In order to prove the second half of (6), let it be assumed that the 
sequence co.» is not convergent, so that, by (2), there exists an a > 0, such that, 
for every n, there is an m >n for which | o.(t’) —o.n(t””) | > a holds for every 
v,¢” in at least one t-interval of length 3a. By the assumption that ox m — o, 
one may select this m and n in such a way that | on.m(+ a) —on.m(—à)| 
>1—a Now, if # —a, Y, Ÿ +a are continuity points of on and om in 
the above mentioned interval of length 3a, one has 


o.m(t) = f T f F f o.n(t’ —s)don.m(8), 


os f+ JEE and mamta f set +a); 


where 


80 that (1—a)v. (U!—a) S Som) =e a) +a. A contradiction ‘is 
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now obtained, since there must exist a t” between ¢’ — a and ¥ He a for which 
| omt) —o.n(t”)| Sa, while | o m(t) —o.n(t”)| > a for any such t. 
If o= Yon is convergent, it follows from (6) that, on placing 


© Or, = Gry One ** + +, 80 that c= S.a + on, One has oa, > w as n— co. 


(Gbis) If Won is convergent, then the (topological) limit of the spectrum 
of o.n exists and is the spectrum of À ox; cf. [10], Theorem 3. 

This means that if for every ‘e > 0, the interval I, : to—e< t< tote 
contains points of the spectrum of e,» for infinitely many n, then, for every «, 


` the interval J, contains points of the spectrum of o,» for all but a finite number 


of'n and t is in the spectrum of o. It is sufficient to prove that to is in the 
spectrum of c, since the other part of the statement follows then from e,n — 0. 
Thus it. is sufficient to prove that for every e > 0, one has o(to + 2e) 
—o(to— 2) >0. 

Let n be chosen such that Ie contains a point of the Soraa of oy 
i.e, such that on(to + €) —o.n(to—e) > 0, and let n be so large that 
on.(€) — on. (—e) > 0. The proof is now evident, since one obtains easily 
from o = 0,1 * on, that 


a (ty + Re) — o (to — Re) 
= [o.n(to + €) Seg hile (+) — or. (—9]> 0. 
Thus, by means of (2) and (6 bis); the spectrum of o == ox may be 
determined from the spectra of. the on. ` 


18. Let f(z) be a -measurable function on a space X, which carries a 
measure y such that pX ==1. Then the distribution function a(t) of f(z) 
is defined by f(t) — At, where A; is the z-set [f (£) < t]. Clearly the. dis- 
continuity points of o(t) are the values of t at which aB: 34 0, where B; is 
defined by f(x) == t. ‘Moreover, o (t—0) = pAr and o(t +0) = pA: + aB. 

On the other hand, if a distribution function o(t)-is given, then a func- 
tion f(x) may be defined on a suitable space X such that e(t) is the 
distribution function of f(x). In fact, let X be the open interval 0 < z <1 
and let » be the Lebesgue measure on this interval. Then the “ inverse 
function ” of e(t), which may be defined on 0 < zx < 1 has o as its HOUR 
function. 

If f is u-measurable on X, and o(t) is its baton function, one can 
express certain y-integrals on X in terms of corresponding o-integrals and 
conversely, as follows: 


(7) One has 


È gG@)ax—  g(t)ao(t), 
J Je 
if at least one of the integrals exists. HF 
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19. Suppose that fa(Tn) is pn-measurable on the space Xx, where 
Knn = 1, and that the functions f, are considered, according to I (7), as 
p-measurable functions f.(z) on the product space X = ILX,, where u = Tpn. 
Let on(t) be the distribution function of f,(t). 

The distribution function of fı +f. on X is o, *o2(t). In fact, if À: 
is the X-set [fi(z) + f(x) < t], one has by I (8) and by the definitions of 
o (t), oa(t) as distribution functions of fa, fs: 


PERNES az= f f did) =o, *oa(t—0). 
film) +f) t . r+8 <t 
By an easy induction argument one obtains the proof of the following 
statement (if use is made of notations introduced in § 17): 


(8) The distribution function of f, + fs +: - :+f, on X is on and the 
distribution function of fna + fma +: + fm on X is on m 


The connection between Part II and the theory of infinite convolutions 
may be formulated as follows: 


Tueorem V. The sequence Xf, is almost everywhere convergent on 
X = UX, if and only if the infinite convolution Yo, of the distribution 
functions o» of the fa is convergent, in which case o = Yt on is the distribution 
function of Sf,; cf. [10], Theorem 32. 


In fact, by (6), Xon is convergent if and only if on.m—>w as n>, 
for arbitrary m = mn >n. Now it is clear from (8) and from the definition 
of the distribution function of a function, that the condition oc, m—>v is 
satisfied if and only if the sequence of functions ga (£) = fn(t) +:°°:+fm(x) 
tends in measure to 0 on J, as n— œ, for arbitrary m = m,a >n. This is 
the case if and only if the series Sf, is convergent in measure on X. Finally, : 
by Theorem IV, 3f, is convergent in measure on & if and only if Sf, is con- 
vergent almost everywhere on X. The last statement of Theorem V is evident, 
since the definitions clearly imply that if s, tends on X in measure to s, then 
Ta —> r, Where tn, r are the distribution functions of S», s respectively. 

Another proof of Theorem V, which now follows, does not make use of 
the considerations of §16 and §17. In fact, it may be used to give a second proof 
of the statement (6) of $ 17. One half of Theorem V is obvious. In fact, if Sf, 
is convergent almost everywhere on X, then’ Xf, is convergent in measure on 
X, so that {on} is a convergent sequence of distribution functions and 
o = lim ox = Wo, is the distribution function of Zf». Now, suppose that % on 
is convergent, so that {o.a} is a convergent sequence of distribution functions. 
If. «> 0 is given, let M — M. be so large that on(M) —ou(— M) >1—e 
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for every n. In the space X = ILY,, let C* be the set C"— [| sa(z)| S M].- 
Since, by (8), o is the distribution function of sa =f, +- > -+ fa, one 
has pC* = o.n(M) — o.n(— M) > 1—e On selecting Ya in II (17) to be 
the partial sum s* of Xf”, one sees: that X(fa— a) is convergent almost 
everywhere on X for a suitable choice of the aq: ‘Thus, by the half of Theorem V 
which has already been proved, + o„(t— an) is convergent. Since also os 
is convergent, the sequence Xa, is convergent. And finally, 3f, is convergent 
almost everywhere on X. This completes the second proof of Theorem V. 


20. It is clear from (7) that if the k-th moment Non) of on is 
defined by 


Ns(en) = f don (t), 


then Mz (fa) = Ni (on); if at least one of Tese moments exists. Moreover, if 
+00 


Nalon) = T {t — N: (on) }'don(t) — Naon) ) — Mla ; 


then Me(fn) = Ñ: (0a). Similarly, if An denotes the aes [| fa(an)| S K], 
then f au 

bn (An — An) = o(—K—0) +1—0(K +0); 
and 


Jie g (fn) dXn = Í: a(t) don(t) 


if at least one of the integrals exists. f ! 
As a consequence of Theorem V and the above remarks, one can write the 
convergence criteria of Part II for fa as convergence criteria for + on. 
Accordingly, in the following list of theorems, the proofs are represented by 
references to statements in Part II or to preceding theorems in the same list. 


THEOREM VI (Three series theorem). The infinite convolution 3 on is 
convergent tf and only tf, for a fixed K > 0, the following three series are 
convergent: 


3 (1+ on(—K)—on(K)) ; sf" tdon (t); af Piatt) S tdon (t))*}, 
| in which case the same holds for every K > 0. [Cf. Theorem I.] 


` The shortest proof of Theorém VI may apparently be given by noting 
first that the convergence of #04 is equivalent with the convergence in meas- 
ure of Sf, (using (8) and (6)), then applying II (2), (which obviously 


14 
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retains its validity in the case of convergence in measure), thus reducing the 
proof of Theorem VI to the proof of (10), which may be obtained. from the 
theory of Fourier-Stieltjes transforms; cf. [10], $ 4 and Theorem 34. 

Proofs of (9)-(11) may be obtained either from Theorem VI or from 
the corresponding statements in Part II and also from the theory of Fourier- 
Stieltjes transforms of distribution functions. 

(9) If No(on) exists for every n and if XN2:(on) is convergent, then o = Ft on 
exists if and only tf ZN, (ox) is convergent, in which case Ni (0) SADRE 
and N(o) = ZÑ: (on). [Cf. II (4)]. 

(10) If on(—K) == 0, on (K) — 1 for every n, then o = K on exists if and 
only if both series ZN: (on) and ZÑ:(on) are convergent. [Of. II (6) and 
H (4).] 


(11) The infinite convolution X on is convergent if so are the series 


x ff ante and à fi t |? don (t) 


for a fized p 1<p&2. [Cf II (7)] 
The necessary condition for the convergence of Sf» in §9 appears now 
in the following form 
(12) Ifqg<p, ¢>0 and if for every n, on(t) = 1 — on(— t) and 
+00 +00 
f ildo) So ELU) 
-00 -œ 


then the convergence of X on implies that 
a( f Lip don(2) yr (r — Max (g, 2)) 
-00 i 


ts convergent. [Cf. II (12).] 


21. The infinite convolution Y o, is said to be absolutely convergent if 
it is convergent and remains convergent on arbitrary permutation of the on. 
In view of Theorem V, this is the case if and only if 3f, is almost everywhere 
convergent no matter in what fixed order the terms are taken. Thus one 
obtains Theorem VII (which may also be obtained as an immediate consequence 
of Theorem II), and (13). > 


THEOREM VII. The infinite convolution Yo, is absolutely convergent 
af and only if, for a fixed K > 0, the three series 
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3(1+on(—K) — (4); 31 f “tden(#)|: x f edot) 


are convergent, in which case the same holds for every K > 0, and the infinite 
convolution. o == Won 18 independent of the order i) the terms. [Cf. 
Theorem Il] 


(13). The infinite convolution % on is absolutely convergent if the two series 


+00 


| S| S tant ‘and af | t [edon (t) 


are convergent for a fixed p, 1 < = 2. [Cf Theorem VII or II (7) and 
Theorem II.] 


From the remark ETE Phedsen TI, one obtains the follow nig 
statement: | 
(14) If *on ts connenpent, then there exists a sequence {an} of constants 
such that X ou (t — an) is absolutely convergent. 

. In the theory of infinite convolutions it is hard to distinguish between 
the two types of convergence discussed in §7 and 88. Thus the criteria 
corresponding to those of § 8 are listed here as criteria for absolute convergence 
of Won. For (16), cf. [34]. | 
(15) The infinite convolution % o, ts absolutely convergent tf for a fixed 
K > 0 the two series 


f . K . 
S(1+on(—K)—or(K)) and 3 a |t| don (t) 
are convergent, in which case the same di for avery K> 0. [ Cf. Theorem 


I] 


(16) The infinite convolution x on is absolutely convergent if the series 
| | x +00 os 
3 [LEP de(t) 
-0 : 


ts convergent for a fixed p, 0 <p = 1. [Of. II (11).] | 


22. Let Y. represent a class of Borel sets on the infinite t-axis which 
class is invariant under translations of the t-axis, and which includes, along 
with any sequence of sets Ar, the set SAn. 

Such classes are, for instance, ‘the class W of all enumerable sets; the 
class W” of all Borel sets which are 0-sets in the wae sense; the se of 
all O-sets BREE to any Hausdorff measure. 


= 
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A distribution function o(t) will be called pure if it has, with reference ` 


to every class Y, the following property: If the o-measure of one set in an Y, ` 


is not 0, then the o-measure of some set in this À is 1. If, for instance, o is 
pure and has a discontinuity point, then the o-measure of a set in W is not 0, 
so that, according to the definition the c-measure of some enumerable set is 1. 
Such a distribution function will be called purely discontinuous. It is easy 
to prove that o is pure if the o-measure of every set in W” is 0, i.e., if o is 
absolutely continuous. On the other hand, although a continuous, but not 
absolutely continuous, pure distribution function o is always singular, a 
singular distribution function need not be pure. These notions allow the 
formulation of the following theorem. 


THEOREM VIII (Pure Theorem). If the on are purely ECO aoe and 
o == Won converges, then o is a pure distribution function. 


Let fn on Xy be, for every n, a function which has o, as distribution func- 
tion. Since ron is convergent, the series Xf, is convergent almost everywhere 
on X = HY.. And since each o, is purely discontinuous, the function fa may 
be assumed to attain an at most enumerable set of distinct values on Xa. Let 
M denote the (enumerable) modul of values of t generated by the values taken 
by all fa, and if A is any t-set, let M(4-)A denote the set formed by the sums — 
of any element in 4f and any element in A. It is clear that if À belongs to a 
class M, then so does A (+) A. 

Now suppose that the set A of the class Y is such that the o-measure oA 
of A is positive, i.e. that f — Xf, takes values in A on a set D ,of positive 
measure in X —I1X,. If C denotes the subset of X, where f takes values 
in A’ = M(+)4, then clearly C satisfies the requirements of I (6), so that 
the measure of C is either 0 or 1. Since C D D and the measure of D is 
positive, this implies that the measure of C is 1. Finally, the measure of C 
in X is equal to the o-measure of the t-set A’ == M(-+-)A, so that oA’ = 1. 
This completes the proof that o == vo, is a pure distribution function. 

In view of the remarks at the beginning of this Section, Theorem VIII 

-immediately implies the following statement: 


(17) If o— Xo, is a convergent infinite convolution of purely continuous 
distribution functions on, then o ts either purely discontinuous or singular or 
absolutely continuous; cf. [10], Theorem 35. 


23. It seems to be very difficult to decide in general which of the cases 
of Theorem VIII take place for a given convergent infinite convolution + on 
of purely discontinuous distribution functions ø». In other words, no general 


‘ rule is known, to decide for a given class Y whether or not the o-measure of 
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each {-set in Y is 0 or not. In this section a proof will be given of a theorem 
of P. Lévy, which gives the decision in case of the class W of § 22. Note that, 
according to § 15, the expression ot denotes, for a distribution function o and 
a real number #, the value of the o-measure of t, i. e. the jump of o at t. 7 

The following lemma is related to the proof of Theorem VIII in [21] 
and to [15], Lemma 3. 


(18) If0<dZÆ1, 0<6e<d, 1>0 and if the distribution functions 
À, p, v have the properties 
Awe pty, ACa+ 21) —A(a— 21) <d+e, v(l) —v(—l) > 1—e 


while Aa = d holds for some value a of t, then there exist real numbers b and 
c==a—b, such that 


(i). d—e< pb < Ste z (ii) Je] <l; (iii) w>, 


If p, q, are the discontinuities of » and v, then one obtains from the 
definition of a convolution, since À = y * y: 


(*) RM a o) (vq). 


In (*) the terms for which |a—p| <1 (or |q| <1) satisfy 3vqg <1, and 
the remaining terms satisfy Zvq < e, Zup S 1. Hence 
d<e+ aa Re 

If this maximum is reached at b, one obtains (ii) and the left half of (i). 
Next, one has 

d- e> Ala + 2) —A(a—2l) = f [ea —t+ 2) —p(a—t—21)]d(t) 

2 [e(a +) —u(a—1](i—e), 
which implies the other half of (i) and in addition, using the left half of (i), 
E E E E He 


Separating now in (*) the terms where | a— p | > l and the term p = b, 
q == c, from the other terms, one finds 


d= 





which clearly implies the remaining inequality (ili) of (18). 
In the proof of Theorem IX, use will be made of the following statement 
(19) which is an immediate consequence of (18): 
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(19) If À — pn * vn, for every n and ua —> À, so that v, —>w by (4), and if 
for some a, one has Aa = d > 0, then there exist real numbers bn, Cn such that 
ba + Cn == a, bn — 4, abs > d and Cn — 0, vata > 1. 


8 


The characterization of the purely discontinuous case of Theorem VIII 
or (17) may be formulated as follows (cf. [21], Theorem XIII, in the proof - 
. below use has been made of a letter from B. Jessen; cf. [3 bis], footnote +5). 


THEOREM IX. If o= Won is the convergent infinite convolution of the 
purely discontinuous distribution functions on, then o ts purely discontinuous 
if and only if 
(**) l ` Hd, 0, where da == Max ont. 


The sufficiency of condition (**) for the existence of at least one dis- 
continuity of o is obvious, so that the corresponding statement of Theorem IX 
follows from Theorem VIII. It remains to prove that if o has at least one 
discontinuity point, then (**) is satisfied. 

. Let o.n, on. denote the same convolutions as in $17. On applying (19) 
` to the equation o = on * on., instead of to À == un * An, one obtains a sequence 
of numbers cs, such that ca —> 0, os.cn—>1, as n—> œ. Thus, replacing 
on (t) by on(t— Cu-1 + Cu), one can suppose that Ca = 0 for every n, i. e., that 
on.0—>1, which clearly implies 0,0 —>1. On omitting a finite number of 
terms, one may suppose that o.,0 > $ for every n, and o0 > 4. Then: 


f o.n0 Z d= 00 > $; o n0 1; ondol; on0—>1, 
and also - 
Xoaps 1—d; 3 P E 1 — ob. 


Thus, from (2), 


bab — I 040 HS0 S ogapoq £ o0 +3 I o0: (1— d) (1 o0 
A2 I-k+1 . Owép=-q k=1 k=2 1=-k+1 
80 that 
nee Hat des) (ls Mat) Ta S E A 
k=1 ke=2 &=1 AR 
© On letting n tend to infinity, one obtains Ile,0 = 2d— 1 > 0, so that the 
proof of Theorem IX is complete. 
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K-CYCLIC ELEMENTS.* 
By J. W. T. Younas.’ 


The introduction, by G. T. Whyburn,? of the concept of cyclic element 
into the study of continuous curves proved so fruitful that it is only natural 
to hope for a further decomposition of a Peano space. In this connection 
' Whyburn himself has defined cyclic elements of higher order where the space 
is a closed and bounded subset of Euclidean n-space.* The approach is com- 
binatorial. Recently an attack has been made on the problem from a purely 
point set theoretic standpoint by D. W. Hall“ The approach taken in this 
note is similar to that of Hall in that the space is a cyclicly connected Peano 
space and the decomposition is point set theoretic. 

To enlarge on this comment we shall survey, for a moment, the two 
equivalent notions of cyclic element from which the generalizations spring. 
In a Peano space a proper cyclic element Mọ consists of the totality of points 
which are conjugate to a point p which is not an end point or a cut point. 
A proper cyclic element M (a, b) may also be defined as the totality of points 
which are conjugate to each of two distinct points a and b which are conjugate 
to each other.’ The generalization of Hall may be said to be from the first 
definition while that here presented is from the second. 

A large number of desirable properties are common to both concepts but 
it is also true that the generalizations do not seem to be all one might hope for. 
Sadly missing, for example, in any analogue to the theorem that the product 
of a cyclic element and a connected set is connected. On the other hand it is 


* Received August 18, 1939; Revised January 8, 1940. 

3 The work on this paper was done while the author was in residence at Charlottes- 
ville. He wishes to take this opportunity to express his appreciation to Professor 
G. T. Whyburn and the Department of Mathematics at Virginia for their helpful 
cooperation. | 

2G. T. Whyburn, “Cyclicly connected continuous curves,” Proceedings of the 
National Academy of Science, vol. 13 (1927), pp. 31-38; W. L. Ayres, “ On the structure 
of a plane continuous curve,” Proceedings of the National Academy of Science, vol. 13 
(1927), pp. 650-657; C. Kuratowski and G. T. Whyburn, Sur les éléments oycliques 
et leurs applications, Fundamenta Mathematicae, vol. 16 (1930), pp. 305-331. The 
reader is asked to consult this last paper for the terminology of the subject and for an 
extensive bibliography. . 

8G. T. Whyburn, “ Cyclie elements of higher orders,” American Journal of Mathe- 
mattos, vol. 56 (1934), pp. 133-146. 

t The author has had the privilege of reading Halls contribution in manuscript. 

ë For a proof see Kuratowski and Whyburn, loc. cit., p. 311. 
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_ hoped that this note may serve as an indication of a general direction of 
approach which seems to be adequate for certain purposes. 

_ It will be noticed that the discussion is confined entirely to non-degenerate 
elements and the theorems proved are analagous to those true for non- 
degenerate cyclic elements in a Peano space. It follows that nothing is said 
about a “hyper-space,” for the non-degenerate elements will not cover the 
space. These remarks, together with others in this note, give rise to problems 
which might be of some interest to the reader. - 


- Bi-conjugacy. 


1.1. The space (which we shall denote by the symbol 1) is a cyclicly con- 
nected Peano ‘space, or we may consider it as a non-degenerate cyclic. element 
-'of a‘ Peano space. In general, small letters will denote points of the space, 
while capitals will be reserved for sets of points. 


1.2. A point a tg said to be bi-conjugate to a point b (notation: a2 ~ b) 

if for every pair of distinct points x, and z, different from a and b it is 

true that in 1— (x + T2), Sa = 80° In general, a point a is said to be 

k-conjugate to a point b (notation: ak—b) if for every set of k distinct: 
points 2, °°, different from a and b it is true that in 1— (z, +: + zr), 

Sa = So. ‘ 


1.8. We shall confine the discussion almost exclusively to bi-conjugacy ; never- : 
theless, a large number of the statements will be equally true for k-conjugacy. 
Whenever a statement (with obvious modifications) is true for the more general 
concept we shall indicate this by prefacing the remark with an asterisk. In - 
every instance the proofs for the general statements are obvious modifications 
of those offered for bi-conjugacy. 


1.4. The first three of the usual axioms for an equivalence are obviously 
satisfied for bi-conjugacy. That is: (1) a? —boranon?—b, (2) ar a, 
(3) a2—~b implies b2— a The transitivity property is absent but we do 
have a modification of it. 


*THEOREM. If a2 ~a2,2%~-:-2~a,2—~} then any pair of points 
separating a from b must contain a point of the set Ti, °°, tm." 


Proof. Take any pair (p,q) distinct from a,2,:°::,,b. Now 


‘If. we are ire: 1— (a, +... .+ a), then by 8, we shall mean the.com- 


Fe ponent of 1— (®, +...+#,) which contains a. 


T The author is indebted to W. L. Ayres for some valuable suggestions in connection 
with this theorem. 


K-0YCLIG ELEMENTS. "451 


a2~2, hence in 1— (p +q) we have Sa — Se, For the same reason, 
Sao = Say "Sa, Sv. Hence Sa = Sp and the pair (p,q) does not 
separate a from b. 


(For k-conjugacy the theorem will read: Ifak~2,k~-- ka kb, 
then any set of k distinct points separating a from b must contain a point of 
the set T1, : * ,m). 


#COROLLARY, If a2~2,%~+--%~ ay %A~b, amy Rao: 2 Yn 
2—~b, and A2~ nr... Ra ~b, where the rs, ys, and zs constitute 
a set of (L+ m + n) distinct points; then a2 ~b8 


Bi-cyclic elements. 


2.1. The totality of points bi-conjugate to each of three distinct points which 
are bi-conjugate to each other constitutes a bi-cyclic element. That is, if a, b, ¢ 
are distinct points, and a 2—~ b, b 2—~c, c 2 ~a, then they generate a bi-cyclic 
element M (a, b,c) and xe Af{(a,b,c) if and only if 22—a,bande. (The 
totality of points k-conjugate to each of (k +1) distinct points which are 
k-conjugate to each other constitutes a k-cyclic element). 


2.2. A bi-cyclic element need not be connected; in fact, it may consist ‘of 

exactly three points. Let the space be the points on the circumference of a 

circle together with those on three cords which form an inscribed triangle. 

If the vertices of the triangle are a, b and c, then M (a, b, c) =a +b + ce 
However, we do have the following 


“2.3. THEOREM. Any point of a bi-cyclic element is bi-conjugate to any 
other point of it. 


Proof. If z,yeM (a,b,c) then x and y are both bi-conjugate to a, b and ¢ 
by 2.1, and so by 1.4 x2— y. 


This gives a degree of homogeneity indicated below. 


*2.4, THEOREM. If the distinct points z, y, z are elements of M (a,b,c), 
then M(x, y,2) = M (a, b,c). 


Proof. The notation M (z, y,z)} is justified i 2.3. If peM(x,y,2), 
p is bi-conjugate to x, y, z, each of which is bi-conjugate to a. Therefore 
p2~a, by 1.4. Similarly p2—~b,c. Therefore peM (a,b,c). Hence 
M (x,y,z) C M(a,b,c). Similarly M (a, b,c) C M (z, y,2). 

* Throughout the paper a weaker form of this corollary will be used, the form in 
which 1I =m =n =]. In a paper read at the April meeting of the American Mathe- 


matical Society in Chicago (1939), T. Radó called attention to the corresponding “ weak 
‘transitivity ” for conjugacy, thus the key to the situation here follows his remarks. 
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#COROLLARY. If M(a, b,c) M(x,y,z) contains three distinct points 
(u,v, w) then M(a, b,c) = M (z, y, z). 


Both are identical to f (u, v, w). 


*2. 5. THEOREM. If M(a,b,c)-M(zx,y,z) — p + q, then the pair (p,q) 
cuts the space. 


Proof. Suppose the theorem false. Since a2 ~ p2 ~z, a? ~ qR ~r, 

and the pair (p,q) does not cut the space, a2~ s by 1.4. Similarly, 

a2—~—~y,z. Therefore, ae M(x,y,z). By the same argument b ce PACE Y,2) 
and by 2. 4M(a, b,c) = M (x,y, 2). 


Sequences of bi-cyclic elements. | t 


3.1. In connection with the point sab theoretic properties of M(a, b,c) we 
have seen that it may consist of exactly three points and so need not be con- 
nected. Another property is obtained from the following 


*THEOREM. If aa, bn—>b and a 2 —~ bn, (n—1,2,8,: +), then 
aè— b. . 


Proof. Consider any (p,q) distinct from a and b. Take two regions? 
U (a), U(b) without common points, both excluding p and g. There is an 
integer n such'that an eU (a), bneU(b). Now in 1— (p +q), dae So 
bn e Sp and Sa, == Sh, Therefore, Sa = h and a? ~b. 


*3.2. THEOREM. M(a,b,c) ts closed. 


Proof. -Suppose p, — p, pre M (a,b,c), (n—=1,2,3,---). We assert 
that p2— a. The sequence a, a, a, : : - converges to a, Pa —> p and Pn 2 ~a, 
(n=1,2,3,- °). By 8.1, p2~a. Similarly p2~bandc. Therefore - 
peM (a,b,c). Hence M (a, b,c) is closed. 


*3.3. Lemma. If (1) U, V and W are three regions disjoint by pairs, 
(à) there are two sets of points (a,b,c) and (x,y,z) such that a,zeU; 
b,ye V3; cze W, (3) in each set, the points are bi-conjugate to each other; 
then it follows that any pair of the six points are bi-conjugate. 


Proof. It will suffice to show that z2— D. Take any (p,q) different 
from x and b. Some region U, V or W is entirely in 1— (p + q). There 
is a point from each of the sets. (a,b,c) and (x,y,z) in this region, hence 
Ss and Sq both intersect this region and so are identical. 


° A region is understood to be a connected open set. The notation U(p) means a 
region containing Pp. 
39 If there is no such pair of regions, then a = b and so a2 ~ b. 
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*3.4. THEOREM. If {M(dn, bx, Cn)} is a sequence of distinct bi-cyclic ele- 
ments, then lim M (dn, ba, Cn) cannot consist of more than two distinct points. 

Proof. If the theorem is false we may suppose that dn — a, ba — b, 
En > c, where a, b and c are distinct points. Take disjoint regions U (a), 
U(b) and U(c). There is an integer no such that for n > e, ane U (a), 
bne U(b), cmeU(c). Now a2~b, b2~c, c?—a by 3.1, and by 3.3 
a,2~a,b,c. Therefore an e M (a, b,c). Similarly. bx, Cn e M (a,b,c). There- 
fore M (ax, bn, Cn) = M (a,b, 6); that is, the M (an, bay Cn) are not all distinct. 


Using this theorem we may assert something stronger. 


*3.5. THEOREM. There are at most: a denumerable number of bi-cyclic 
‘ elements. 


Proof. In each bi-cyclic element arbitrarily pick three distinct points. 
Tf for a certain bi-cyclic element the points are (a,b,c) then the element is 
M (a,b,c). Now with each element M (a,b,c) associate a positive real number 
y[M (a, b, c)] = min [p(a, b), p(b, c), p(c, a) ].* Take any §>0. If there 
are an infinite number of bi-cyclic elements with associated y greater than 8, 
then we may assume they are M(dn, bn, cn), y[M (an, bn, crn) ] > 8 > 0 and 
En — 0, ba —> b, Cx —> cc. Clearly a, b and c are distinct. But it is impossible 
for lim M (an, bn, ĉn) to consist of three points. The theorem is now obvious. 


*COROLLARY. The points of M (a,b,c) belonging to any other Lio: Le 
element are denumerable. 


No other bi-cyclic element can have more than two points in common with 
M(a,b,c) by 2.4, and as there are only a denumerable number of bi-cyclic 
elements, the result follows. 


Components of 1 — M(a, b, c). 


4.1. Since each bi-cyclic element is a closed set the complement is open and 
so the components of the complement are open as the space is locally connected. 


Lemma. If M(a,b,c) is a bi-cyclic element, and U (a), U(b), U(c).are 
disjoint regions, then no component S of 1— M (a,b,c) can have points in 
common with each of the three regions. 

Proof. If the statement is false then S + U(a) + U(b) is a region 
containing a and b, but not c. Hence there is an are from a to b in it and 
the arc omits c.1? Some point x on this arc is in 9. In the region S + U (c) 


1»(a,b) is the distance from a to b. 
12 This is a well known fact first demonstrated by R. L. Moore, “ Concerning con- 
tinuous-curves in the plane,” Mathematische Zeitschrift, vol. 15 (1922), pp. 254-260. 
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consider an arc from x to c. There is a last point y on this are which is also 
on the arc from a to b. Now yeS and there are three arcs connecting y to. 
a, b and c, each pair of the arcs having only y in common. We assert that 
‘y2—~a. Take any pair (p,q) different from a and y. In 1— (p +4), 
Sy contains at least one of the points a, b, c, and Sa contains the same one 
since a, b and c are bi-conjugate to each other. Hence Sy = Sa. 

Similarly 2 —b,c. Therefore ye M(a,6,c) which is Fes to 
the fact that y «5S. | 

It is clear that this method of proof will not generalize even to tri-cyclic 
elements. As a matter of fact, for a cyclicly connected Peano space the. 
. corresponding result is not true for tri-cyclic elements. 


4.2. THEOREM. The frontier points of S are in M (a, b,c) and there cannot. 
be more than two of them. 


The proof is immediate from the lemma. 


4, 3. It is a well known! fact that every region truly in a cyclicly connected 
Peano space has at least two frontier points, thus it is easy to see that . 


THEOREM. The frontier of a component S of 1— M (a,b,c) consists of 
precisely two points, and these two points (as a patr) cut the space. 


Remark. If the space 1 has the property that no set of (k —1) pone cuts 
it, then a component 8 of the complement of a k-cyclic element M (a, , ax) 
must have at least Æ frontier points. It might be conjectured that it will also 
have, at most k frontier points, but no information seems available on this 
point. 


4.4. THEOREM. If M is a bi-cyclic element and S ts à component of 1 — M 
having p and q for frontiers, then S contains at most one bi-cyclic element 
containing p and q. 


Proof. If M, and Ma are two bi-cyclic elements in § containing p and q 
then since § is a region we have an arc in 9 from m, eM, to mae A. It follows 
that m, 2? ~ ms and so M, = Ms. 


4,5. THEOREM. If S is a component of 1—M then any bi-cyclic element N 
is such that NCS or N-S =0. 


Proof. By 2.3 any point of a bi-cyclic element is bi-conjugate to any 
other point of it. Thus by 4.3 if N-S340,N CS. 


4.6. THEOREM. If {S,} is any sequence of components of the complement 
of a bi-cyclic element M (a,b,c), then d(Sn) — 0. 
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Proof. If the theorem is false, then there is an infinite sequence of com- 
. ponents with diameters greater than some positive 8. It is no restriction to 
‘assume that they are Sn, and (since they are connected) that there exist triples 
of points an, Yn, 2n € Sn, (n = 1,2,3,:--), such that En —> 2, ja — y, Znz, 
‘where x, y and z are distinct. It is clear that z, y, z e M (a; b, cJ. Take three 
disjoint regions U (e), U(y), U(z). There exists an integer n such that 
t, U(x), Yne U(y), 2n¢U(z). That is, a component S, of 1— M (a, b, c) 
has points in common with each of the three regions. This is contrary to 
Lemma 4. 1. 


COROLLARY. If pq there are only a finite number of bi-cyclic elements 
contaimng p and q. 


The proof follows from the above together with 4. 4. 


Continua of convergence. 


*5.1. THEOREM. Every non-degenerate continuum of convergence is con- 
tained in some bi-cyclic element M (a, b,c). 


Proof. If C is a continuum of convergence then no finite set of points 
in C separates C in 1. Certainly no two points outside C separate C. Hence 
there exist three points a, b, ce C which are bi-conjugate to each other. Every 
point of C is bi-conjugate to a, b and c, therefore C C M (a, b,c). 

5.2. THEOREM. Suppose {Kn} is a sequence of continua with the following 
i ` co 

properties: (1) L=lim Kn contains at least three points, (2) L'S En = 0; 
Fra 1 
then there exists a bt-cyclic element M such that L — lim [K, : M]. 

_ Proof. Since we are dealing with a-compact metric space any sequence 
of sets contains a convergent subsequence and from this fact it follows that 
there is a bi-cyclic element 4f which contains lim Ka. As a matter of fact, 
the element in question is that of 5. 1., The proof from this point on follows 
the pattern of the proof for a similar theorem concerning cyclic elements and 
so is omitted. 

The well known theorem alluded to above is used to show that the property 
of containing a continuum of convergence is cyclicly reducible. We cannot 
say that the property is bi-cyclicly reducible since in general the sets [Ka Af] 
will not be connected. 


5.3. Since a bi-cyclic element may not be connected (in fact, a bi-cyclic element 


38 See Kuratowski and Whyburn, loo. cit., pp. 314-315. 
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can have a non-denumerable number of components each of the game diameter) 
we cannot hope for & generalization of the cyclic connectivity theorem within 
a bi-cyclic element, but a generalization is possible within the whole space. 
In fact, we have at our disposal in the Literature the very tool adequate for 
this purpose. According to a result of Nébeling,* if two closed sets in a Peano 
‘space cannot be separated-by the omission of any set of k points then there are 
(k +1) arcs connecting the sets, and the arcs are independent, except in that 
they might have the same end points when considered in pairs. 

If we take x and y two points of a k-cyclic element, this means that 
no set of k points separates x from y in the space and so there are (k + 1) 
independent arcs connecting æ and y in the space. 


Conclusion. 


The reader will have noticed several unanswered questions by this time. 
” We wish, in conclusion, to propose a further problem. One might arbitrarily 
call each single point of the space which is not in any bi-cyclic element a 
degenerate bi-cyclic element, or if a single point is in two or more bi-cyclic 
elements it too might be called a degenerate bi-cyclic element. In this fashion 
a space is covered by its bi-cyclic elements (degenerate and otherwise) so we 
can consider the hyper-space of bi-cyclic elements. One of the most beautiful 
results of the cyclic element theory is that the hyper-space is, in a sense, a 
dendrite. An analogous theorem for bi-cyclic elements would be extremely 
desirable. The fact that the frontier points of a component of the complement 
of a non-degenerate bi-cyclic element are two in number should play an 
important rôle in the attack. 
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1 G. Nôbeling, Eine Verschärfung des iBeinsatzes,” Fundamenta Mathematioae, 
“vol. 18 (1932), pp. 23-38; N. E. Rutt, “ Concerning the cut points of a continuous curve 
when the are curve, AB, contains exactly N independent arcs,” American Journal of 
Mathematics, vol. 51 (1929), pp. 217-246. 
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§§ 1-4 show how simple and natural is Schoute’s representation of the 27 lines by 
the vertices of Gosset’s six-dimensional polytope, and how easily various plane pro- 
jections of the polytope can be drawn. In 88 5 and 6 we find new coürdinates for this 
polytope, and for Elte’s related polytope l». In §7 we derive five quaternary collinea- 
tions (Table IXI) which generate the simple group of order 25920 in a particularly 
elegant manner. § 8 connects these ideas with the lines on a special cubic surface in 
the finite geometry PG (3,4). Finally, in § 9, we show that the 240 vertices of Gosset’s 
eight-dimensional polytope 44 lie by sixes in forty planes which correspond to the 
“non-isotropic ” planes of PG (3, 4). 


1. Summary of properties of the 27 lines. To construct the configura- 
tion of the lines on the general cubic surface, it is usual to begin with a 
double-six 

Ay (la Ag Qs Qs Ay 
by be bs ba bs bo, 


where dy, Aa, Ua, Qs, Qg are five skew lines having a common transversal b, 
and so on.? The remaining fifteen lines are c12, C18, ` `, Cse, Where Cız is the 
line of intersection of the planes a,b,, a,b,. It is then found that ci. intersects 
Cs; but is skew to ces, and that the lines form thirty-five other double-sixes, 

* Received September 15, 1939. 

1 Presented to the Society in two parts: April 16, 1938 and September 7, 1939. 

3 Schlafli, 22. 
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each of which could have been used just as well to build up the configuration. 
Every line intersects ten others, which form five intersecting pairs. . The. 
cubic surface therefore has 45 tritangent planes, each RE three of 
the lines. 

The incidences of the lines aré unchanged by the transposition of suffix 
numbers 1 and 2, which can be thought of as a re-naming of certain lines, 
namely the interchange of rows of the double-six 


Gy by Can Cas Cos C36 
as bs Cis C14 Cis Cre. 


' ‘The transpositions (12), (23), (34), (45), (66), which we shall denote 
by Pa, P, O, N, Ni, generate the symmetric group on the six suffix numbers. - 
But this i is not the whole group of automorphisms of the configuration. The 
operator,’ say Q, which interchanges rows. of the double-six 


C23 Cis Cie aa üs ag 
-bı Ds Ds Cse Cie Cas 


increases the order from 6! to 72-61 — 51840 ; in fact, it enables us to replace 

a,‘ ‘ ‘, Qs by either row of any one of the 36 double-sixes. 
These six operators generate the group in a particularly simple form.‘ 
.. Their product in any order is of period twelve; * e. g., in the order NiNOPP,Q 

it is . ce À 
= (123456)-Q | 
= (a Gz As Csa C16 Ds ba Ds Do Ces Csa as) 
i (Cas Qa Cas Cis C26 ba Cia b1 Cia Cas Css as) (Cr Cas Cae) 


We note that R* permutes the 27 lines in nine cycles of three (each cycle 
belonging to a tritangent plane °), namely 


Rt = (a Cig Dg ) (a2 bs Ces) (as b4 Cs4) (Cse bs dg) ots 
: (Ces C38 Cis) (as bs Coa) (Cae Cia Css) (Cis by as) (cs C25 Cso). 


On the other hand, the operator S = (3 4)Æ* (34)R permutes them in three 
cycles of nine. (We note that S? = #*.) Another operator of the same kind is 


So? wo (a Qa Cne Cie ba bs ba Ces as ) A 
+ (ba Cia Cag Cas Cas Aa Qs Cas Da) 
‘ (Ca bi Cis Ces Os. Cas Cao Cas C28), 


3 This is practically the “ substitution T ” of Burnside, 7, p. 301. 
t Coxeter, 10, pp. 163-166. ~ 
` # Coxeter, 11, p. 608. | 
¢ The fact that the 27 lines are the complete intersection of the cubic surface with . 
nine planes seems to have been first remarked by Baker, 1, p. 16. ` 
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in which the first cycle is the same as the first cycle of R, omitting the 
tritangent plane ds b4 Cay. 


2. An alternative derivation of the cycles. A more elementary (though 
somewhat artificial) procedure is based on the observation that any six lines 
which are the lines of two tritangent planes can be arranged as a cycle, each 
skew to its two neighbors but intersecting the remaining three. For instance, 
the planes @ bs Cas, Cse bs de give the cycle 


(a2 Csa Ds Ds Cos as), 


which is permuted by 2%. Into this cycle, the three lines of any one of twelve 


i Si moe 
a 
2 I l i a 
i} 
i Y a 
SA I 
~A i 
D 1 i Qs 
~ i 





` 


Fig. 1. The construction of a skew hexagon or double-three. 


other tritangent planes can be inserted so that, in the consequent cycle of nine, 
each line is skew to its four neighbors (two before and two after) but intersects 
the remaining four (its four “ opposites”). Inserting thus a; Cıs be, we 
obtain the cycle , i 

(a, l2 Cse C16 Da Dy bg Cos as), 


which is permuted by S-*. 

_ Into this last cycle we can insert the three lines of any one of three 
tritangent planes (from among the eleven just discarded) so that, in the 
. consequent cycle of twelve, each line is skew to its six neighbors (three before 


and three after) but intersects its five opposites. Inserting thus as b4 cu, we 


obtain the cycle: 
(a1 az Q3 Cso Cre ba ba Ds Do Cos Csa Ma), 


which is permuted by R, and whose square contains the original cycle of 
six lines. 
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3. The representation by points in the affine plane. Given two inter- 
secting lines (from among the 27) and a third line skew to both, there is a 





Fig. 2. The corresponding points. 


Cas 





Fig. 3. Three enneagons. 


uniquely determined fourth line, intersecting the third and again skew to the 
frst two. For instance; the three lines &:, be, &a determine b,, so as to form 
the intersecting pairs a, bz, a,b, (Fig. 1). This suggests the possibility of 
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representing the 27 lines by 27 points, so that two such intersecting pairs are 
represented by the pairs of opposite vertices of a parallelogram, skew lines 
` being represented by joined points. The transitivity of parallelism (Fig. 2) 
requires that any two other lines which intersect b», a2 respectively, but not 
vice versa, should also intersect b, a, respectively, so as to form a skew hexagon 
or “ double-three ” (Fig. 1). This is in fact the case; for instance, the two 


Cu C 





Cy Cy 
Fig. 4. Two dodecagons. 


other lines may be as, bs, completing the skew hexagon a, be a, bi a: ba. Thus 
the representation is consistent. 

In order to make a diagram of pleasing appearance in the Euclidean 
plane, we begin by drawing a regular enneagon or dodecagon, to represent ` 
the lines of a cycle of nine or twelve as described in $ 2. The remaining 
representative points can then be derived by completing parallelograms. In 
Fig. 3, the three vertices a1, cos, bs of the outermost enneagon lead to Cso, 
which is thus seen to be the point of intersection of the joins a: Cie, Da Cso; 
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the rest of the smaller ennéagon ce bs ba Cis Cos Csa Css 84 85 Can be. obtained 
similarly, or can be marked at once by using the second cycle of the operator 
8*in §1. Again, the three vertices as, Css, Ds of this second enneagon lead 
to cis, which is thus seen to be the point of intersection of the joins a4 ba, bi a33 
. the third enneagon Cas Cog Cra Di Cas Cas Bs Cas Cae Corresponds to the last cycle 
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Fig. 5. The enneagonal projection of 2s. 


of 8%, The complete diagram (Fig. 5, drawn by J. M. Andreas) has one 
unfortunate feature: 27. of the parallelograms (such as Css Ces C12 Cis, 81 8s Ds bi, 
az 83 bs be) degenerate into lines containing four points,’ thus we have to think 
of Cas as being joined to c:2 but not to Cis, and 80 on. | | 
In the dodecagon (Fig. 4), the three vertices a;, cos, bs lead to css, which 
is thus seen to be the point of intersection of the joins a, C16, bs 83; the smaller 
` dodecagon Co Cis Ceo Da Cre Da Cis Cou Cas As Cas a, Can then be completed in ac- 


"This coincidence has been ‘utilized by Rouse Ball, 2, p. 127. 
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cordance with the second cycle of R. Considerations of symmetry suffice to 
show that the remaining three points (corresponding to the last cycle of R) 
must coincide at the centre. In the complete diagram (Fig. 6), 24 of the 
parallelograms (such as a, 8 bs bi, 1 & Cie Cas) degenerate as before, while 





















sÁ NN 
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Fig. 6. The dodecagonal projection of 2. 

















another 24 (such ag a; Ces by Cso, Cos Cis Cis Cas) have two opposite vertices 
coincident (at the centre). : 

Figs. 7 and 8 both show the points that represent the ten lines meeting b,,8 
and those that represent the six skew lines b; (i. e., one row of a double-six). 
Figs. 9 and 10 show the points that represent the nine lines 


C23 Cis Cag 
la Dy Crs 
bs Gs Css 
s 


3 The particular line b, is chosen because (a, a, a, Cs Cu Cu Ou Ge) is one cycle of the 
operator (34) R. We could obtain a third diagram by drawing this cycle as a regular 
octagon; then the three points bi, Gs Ow, Would coincide at the centre. 
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of a trihedral pair.” Fig. 9 makes it evident that three such sets of nine can 
exhaust the 27 lines, forming one of the forty triads of trihedral pairs. 


4, The representation by points in real Euclidean six-space. Let us 
now make the representation more symmetrical by insisting that the parallelo- 
grams shall be squares. We can no longer remain in the plane; Fig. 2 has to 





Fig. 7. Bs and as. 


be regarded as a square and a triangular prism. The introduction of a, and b, 
necessitates a fourth dimension; of a, and bs, a fifth; of as and be, a sixth. 

Let a, denote the regular simplex (with p -+ 1 vertices) in p dimensions. 
Then the square and the triangular prism can be written as “rectangular 
products” 10 æ, X a1, @ X æ The “ double-n ? 


tı le’ ` ° da 
bı be: . - by 


? Steiner, 27. . 
19 Compare Coxeter, 11, pp. 591-592, where the rectangular product a, X a, is 
written lap agl- Sommerville, (26, p. 114), calls this a “ simplotope of type (p, q4)” 
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(n=6) is then represented by the rectangular product an-ı X @, in n 
dimensions, i. e., by a right prism whose base is the regular simplex on-1. 

We are now representing the lines by points in six dimensions in such 
a way that the distance between two of the points is 1 or 2% according as the 
corresponding lines are skew or intersect. This applies to the c’s as well as 
to the œs and 0’s; for, we may take the representative points to have the 
following Cartesian coordinates in eight dimensions: 1? 








Fig 8 Bs and Gs $ 
a (2k, 0, 0, 0, 0, 0, 2k, 0), bı (2k, 0, 0, 0, 0, 0, 0, 2k), 
a (0, 2k, 0,0, 0, 0, 2k, 0), be (0, 2k, 0, 0, 0, 0, 0, 2), 
ao (0, 0, 0, 0, 0, 2k, 2k, 0), bs (0,0,0, 0, 0, 2k, 0, 2k), 
Ge (—k,—k, kkk, k, kk), -~- Cso (kk, k, k, —k, —k, k, k). 


The number of dimensions is reduced from eight to six by the relations 


Ti + eet: LT Dr + Ts = 2k. 


Since the distance a, a, is 2%/2k, we have k = 2-5/2, 


4 To put it rather pedantically, the distance is (r + 1) # when the two lines have 
r intersections. See Du Val, 15, p. 28. 
13 Coxeter, 8, pp. 3, 6. 
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This representation was discovered by P. H. Schoute and explained by 
J. A. Todd It is perfect, in the sense that every automorphism of the 27 
lines corresponds to a symmetry of the 27 points, and conversely. 

We observe that all the above coordinates satisfy the condition ts 2k, 
equality holding for the five-dimensional simplex bı be bs bs bs be (corre- 
sponding to one row of a double-six) ; and that they all satisfy the condition 





zı + Ta S 2k, equality holding for the five-dimensional cross-polytope (or 
octahedron-analogue) whose pairs of opposite vertices are 


8, be, &2 bi, Caa Coe, Css Cac, Cae Can 


(corresponding to the ten lines which intersect C12). Continuing thus, it can 
be shown that the 27 points in six dimensions are the vertices of a semi-regular 
polytope whose five-dimensional faces are regular polytopes of two kinds: 72 
simplexes a, (belonging to the 36 inscribed a, X a8) and 27 cross-polytopes 
Bs (one opposite to each vertex). Thé numbers of edges a, triangles a, 


13 Schoute, 24, pp. 376-383; Todd, 28: 
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tetrahedra @, and “ 


pentatopes” a4, are respectively 216, 720, 1080, and 
432 + 216.14 | | 







None mé es 


This six-dimensional polytope,!° now known as 





44 Compare Henderson, 20, p. 25. 
** Coxeter, 18, p. 331. Cf. Fig. 11, below. 
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or 22, was discovered by Thorold Gosset.*® Figs. 5 and 6 can be regarded as 
plane projections of its vertices and edges. Squares such as a,a,5 bsb, are 
foreshortened into lines. Figs. 7 and 8 show two five-dimensional faces: a 
_ sandan as. The reader will have no difficulty in picking out from Figs. 6 : 

- and 6 (with the aid of Figs. 3 and 4, respectively) the prismatic figure as X a; 

` whose vertices are all the a’s and b’s. Figs. 9 and 10 show an inscribed four- . 
dimensional polytope, the rectangular product of two triangles, a, X az whose 
* solid faces consist of six triangular prisms a, X a, (or a, X a), all plainly 
discernible. It is interesting to compare -the different views in the two 
projections. ` 


5. The representation by points in complex Euclidean three-space. 
Witting * has shown that the simple group of order 25920 can be represented 
as a collineation group in four variables. Burkhardt ** used the transforma- 
tions B, C, D, Sa of Table I (at the end of this paper) to generate the 
corresponding linear group of order 51840, and remarked that the simple 
group itself is generated by linear transformations of the six Plücker coördi- . 
nates, as in the middle section of the table. (The actual transformations are ` 
easily deduced from the first section by regarding pau as a symbolic product 
Zau = — quan) 

The final section of the table cabine re corresponding transformations of 


Ty =m Doi — Pes; La = Poz — Pas Ts = Pos — Paz. 


These are not strictly “linear transformations,” since D involves the con- 
jugate imaginaries Z, etc. ; however they are “ unitary,” in the sense of leaving 
XE, + Cola + TaZ invariant. Consequently, if we write 2 == yv + Yvset where 
the ys are real, the corresponding transformations of the: six variables 
Yo Y» Yes Vas Ys) Yo are orthogonal. Using the terminology of the six- 


46 Gosset gave this and many other new results in a long and brilliant essay which 
was refused publication in 1897. (For an abstract, see 19.) Since then, 2,, has been 
rediscovered at lenat three times. i 

17 20. 

18 6, pp. 318, 320. 

1 Dr, Frame has drawn my attention to the fact that this representation of degree — 
six is an irreducible component of the Kronecker square of the representation by quater- 
nary collineations, the other component being a representation of degree ten which’ is : 
the corresponding symmetrized Kronecker. square. It is also an irreducible component 
of the representation by permutations of the 27 lines on the cubic surface. He gives 
x character of the ee ts by quaternary collineations as a 0,0, +2, 


(30 -+ 1), + (3w +1), + (w*—1), te(w—1), toe, +a, +1, + (u— w), 
+ i 0,0,0,0, +1, +, +w, taking the classes in the same order as in his table, 
17, p. 483. ~ | 


3° Cf. Burkhardt, 6, p. 326. 
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dimensional Euclidean space. defined by the ys, we may say that the vector 
(41, X2, Xa) is perpendicular to the hyperplane 

u= Pix, + Les + Fm + Xiti + Xefe + Xafs = (. 


Moreover, if X,X, 42,2; 4 XX; = 1, the reflection in the hyper Pi 
u—0 is the transformation 


| - T'i == t, — Xyu, 
(5. 1) ` Ug mem La — Xot, 
Ta = Ts — Xu. 


For, letting m, n take the values 1, 2, 3, 4, 5, 6, and Waging Aya Fy + Fyt 
(v =— 1,2,3), weshave 
43 Vanya = 2(Xy B Z) (zv +) — X (Xr 2) Ss) 
=m 2I (Xie, + Xyz,) ome DU; : | 
and the reflection in the hyperplane 3 Fayn — 0 (where 2 Y„? = 1) is 
Y'm = thy — DY nd Yn¥n; 


which leads at once to the above transformation (5.1). 
Corresponding to the 27 lineg on- ene cubic surface, Burkhardt ™% found 
2% linear complexes 


D pos — Tpos pas wei + Pi = 0, 
(5.2) pos — OF 91 — 2 Pia + o* fos = 0, 
D por — Po — WPa + OF Pyy me 0. 
(A, p= 0, 1,2; w = 6274/8), 
In terms of the 2’s, these are the hyperplanes 
. Da, — va, + wt, — oF, = 0, 
wr, — Gt + wr, — of Z, — 0, 
: wz, — Dar, + oF, — os = 0, 
which are perpendicular to the vectors `` ‘ 
(5.3) , (Qepse), Cotte) (oñ, — ot, 0). 


By keeping this selection of signs, we now have 27 points of the complex 
Euclidean (or “ unitary ”) three-space which are permuted among themselyes 
by the transformations B, C, D, Sz of the rs (see Table I). 

In the real Euclidean six-space defined by the ys we thus find 27 points, 
which correspond to the lines on the cubic surface, and which are permuted 


6 b. 323 (14). 
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among themselves by a rotation group of order 25920. We may naturally 
expect these to be the vertices of the polytope 2a. Such is in fact the cage. 
For, by considering various pairs of the points, we find that the distance *? 
between (Ti, Ta, T3) and (V1, Z'a, T) is 6% if a+ sv 0 for just one of 
the three values of v, and 3% otherwise. This agrees with the known corre- 
spondence between the 27 lines and the vertices of 22: (edge 3%) if we make 
the points (5.3) represent the lines 


tap, USER, SA 


respectively, in the notation of Philip Hall. The three lines of a tritangent 
plane (such as fous, Uoso, Soto, OF Soto, Siti, Selz) are represented by the vertices 
of an equilateral triangle whose centre is the origin, i.e.?# by three vectors 
whose sum ‘is (0, 0,0). 


_ 6. The 36 double-sixes and the polytope 1... As we remarked in $ 1, 
the whole group of automorphisms of the 27 lines, of order 51840, can be 
generated by certain operators each of which interchanges the two rows of a 
double-six. Since the double-six is represented by the prismatic figure a; X a 
($4), it is geometrically evident that the corresponding orthogonal trans- 
formations in Euclidean six-space are reflections which interchange pairs of 
opposite &s’8 of 2. These may equally well be described as reflections which 
interchange pairs of opposite vertices of the “ semi-reciprocal ” polytope 122. 

This six-dimensional polytope, whose 72 vertices are the centres of the 
@s’8 Of 22, was discovered by Elte. The numbers of edges, triangles and 
tetrahedra are respectively 720, 2160 and 2160. Four-dimensional elements 
of two kinds are involved, namely 432 as and 270 8ps; but the five- 
dimensional faces-are all alike, being 54 “ half-measure-polytopes 77 1,4. From 


ae (| O,— o, |? + | D — v, |? + | 0 — 2: \2) 4. 

33 See Coxeter, 8, p. 396. (It is obviously immaterial whether we take the suffix 
numbers to be 0, 1, 2 or 1, 2, 3). In this notation an operator of period nine (such 
as the 8 or 84 of §1) loses its artificiality, being expressible as (8 to tt 8; ti th, 9: tatty) 5 
thus the cycle of nine lines required in the construction of Fig. 6 is simply 

| (Sets tots o8z its titia Uido Salo tythy 4381) - 

™ Cf, Frame, 18, p. 660. We shall have further comments to make on that paper 
in § 8. 

#8 This must not be confused with the above mentioned linear group of order 61840, 
which has the simple group as a factor group but not as a subgroup; nor with Witting’s 
collineation group of order 51840, which is a ‘direct product, as was pointed out by 
Maschke, 21, p. 321. 

J #16, pp. 104-108. 
37 The vertices of the (n + 3)-dimensional polytope 1, are alternate vertices of the 
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each of ihe 72 vertices emanate twenty edges which bale in pairs to ten 
regular hexagons lying: in planes through the centre; hence there are altogether 
72° 10/6 == 120 such diagonal hexagons. The 27 pairs of opposite 1,8 
correspond to the lines on the cubic surface, the 36 pairs of opposite vertices 
correspond to the double-sixes, and the 120 diagonal hexagons correspond to 
the trihedral pairs.” Moreover, the planes of these hexagons fall into 40 sets 
of three absolutely perpendicular planes, corresponding to the triads of tri- 
hedral pairs. 

New codrdinates for the vertices of l, can of course be.derived from 
those which we found for 22: in § 5, but it is perhaps more interesting to 
obtdin them from the collineation group. Corresponding to the 36 double- 
sixes, Burkhardt 2° found 36 linear complexes 

BMD Por + opes) = 0, 
34 (@poz + psi) = 0, 
(6,1) 344 (Pos + op) = 0, 


vpo + © D Pos + Gos — ow" Dag — Pa ag oF Pia = 0. 
: (x, A, p= 0, 1,2). 
In terms of the 2’s, these are the hyperplanes 


f ; . - 3% (oxy eae w Ey) em 0, 
i Ds -E Dra + Dies + oE, + oa art m0, , 


which are perpendicular to the vectors. 
(6. 31) (+ 34ia, 0,0), (0, +3#ta, 0), (0,0, = 4a), 
(6.32) . (ct of, Ha, + o”) (all + or all —). 


(6. 2) 


In this case there is no systematic way of picking out one from each pair of 
oppositely directed vectors, but the 72 points having these codrdinates are 
easily recognized as the vertices of lez: (edge 3%). In particular, the point: 
(1,1, 1) corresponds to the simplex (0,1, —w*t), (—o*t, 0,1), (1, —**,0), 
of 2a. l 
Clearly, the six points (+ 3#iwh, 0,0) aré the vertices of a diagonal 
i hexagon, and we have a simple verification of Todd’s remark * that twelve 
of the 120 diagonal hexagons can be selected so as to include all the vertices 
of 1z, just once. ; 


measure-polytope or hyper-cube, Y„,, Thus 1,, is the tetrahedron (having alternate 
vertices of the cube, y,), and 1,, is the cross-polytope 8,; but 1,, is not regular 
forn > 1. . 

38 Todd, 28, pp. 204-205. 

#6, p. 325. 

2098, p. 205. 
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To normalize the hyperplanes (6.2) we multiply by 3%. 
in i(t, — wrZ,) = 0 is 


f = 0, — (— tw’)t (oe, — ot.) = DE, 


(6. 4) T'a = Lo, 


Ta == Tg. 
The reflection in 34 (æ, -+ tz + Ts + & + Ë: + %) — 0 is 
di = Ti — S, 
(6.5) La = Ti — S, 
Ta = T; — 5, 


where s == (2, + Te + de + i + Ëe + &s)/8. 


The reflection 


The group of order 51840 which such reflections generate is conveniently 


3,3 
denoted by a | ox (for brevity) [377+], since it has the abstract 
3 


definition êt 


0? = N? a N,? = P? = Pi Q? | 
= (ONY = (NN,)5= (OP) = (PP)? == (0Q)° 
— (0N,)? = (OP,)? = (PQ)? = (QN)? = (NP)? 
— (P.Q)? — (QN)? = (N:P)? (NP) = (N:P)? = 1. 


N, | P 


(6.6) 





Q 
Fig. 11. The group [877]. 


In order to generate the group in this elegant manner, we have to select six 
of the 36 reflecting hyperplanes in such a way that the angle between two of 
them is r/2 or 7/3 according as the period of the product of the reflections 
is 2 or 3. In other words, we have toeselect six vertices of 1.. which are 


31 Coxeter, 10, p. 164. 
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connected by five edges as in Fig: 11. In this diagram ** it is to be under: 
stood that pairs of vertices not. joined by edges are distant 2% edge-lengths 
(= 6%). N, denotes one: of the two opposite vertices which are ire à 
by the. reflection N 13 similarly for the other letters.. ‘ 
z Since the transformation (6.4) .is simpler than (6.5), it is Pre to. 

take as many as possible of the vertices from the set (6.31) and as few as 
possible from (6.32). The former set of vertices belong to three hexagons 
lying in absolutely perpendicular planes; so we cannot use them exclusively 
(since Fig. 11 is connected). We take NN, to be a side of one of these 
hexagons, PP; to be.a side of another, and Q to be a vertex of the third. 
The remaining vertex, O, has to be chosen from the other set, and we naturally 
take it to be ae 1,1). We thus obtain 


(— 3i, 0,0), | P: (0, — 3%, 0), 
x (3#iw?, 0,0), . ° a o :P (0, Bhia’, 0), 
| 0 (11,1), 
Q - (0, 0, 3kiu?). 


The corresponding he are given in the first section of Table II. 


7. Collineations which generate [8***]’ according to a known abstract 
definition. Writing Por — Pas, Pos — Par, Pos — Dia for Ts, Te, Ts, the trans- 
formation zı = T, — 8 of O (Table II) becomes ` 


Por — Das = Por — Pos — (t -+ E), where 
t = (Por + Poa + Pos — Pea — Psi — iz) /3. 


This is consistent with either . 


“Po Por — i, . P= Patt ; S 
OF , % | 
D'où = — Pas — À, P'a3 = — Por + E 

Thus the group [3>>1] is generated by collineations ** in the six variables pau, 
or equally well by anticollineations. (For details, see the second and third 
sections of Table IT). In the case of the collineations it is impossible to 
regard the p’s as Plücker codrdinates in a projective space defined by Zo, 41, 22, Zs. 
But in the case of the anticollineations this can be done, as in the-final section 
of the table. An ambiguity of sign appears at this stage, since reversing the 
signs of thë z’s has no effect on the p’s. Hence although the group generated 


83 The reader may be. interested to draw plane projections of ln aa to 
the projection of 2, shown in Figs. 5 and 6, and to ai out a set of five edges related 
as in Fig. 11. : 

58 When working with the p’s tie. is no. need to distinguish between collineations 
and linear transformations. (Burkhardt, 6, p. 320). 
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by anticollineations in the 2z’s is still [3771], the transformations themselves 
generate a group of order 103680 which we shall call the binary [3+]. 
The signs have actually been chosen so as to make the transformations 
satisfy (6.6) with the “—1” omitted. These relations provide an abstract 
definition for the binary [3>>1]. For, we could have continued them by 
writing “ = Z, Z? = 1”; but it is unnecessary to do so, since the relations 
P? mm Q? — (PQ)? = Z imply Z° == 1 (and are fulfilled by the quaternion 
group). l D 
' The most important subgroups are {N:, N, O, P, Pı}, of index 72; 
{N, 0, P, Pı, Q}, of index 27; and {N:N,N,0, N-P, NiP1, NiQ}, of index 2. 
The last of these, having the abstract definition 


D = V? = W? = X? = Y? 
(7.1) { — (UY)? — (VW)! = (WX) = (VF) 
= (UW)? = (UX)? — (UY)? (VX) = (WY)? = (XY), 
is naturally called the “binary [3%*+]’,” since by adding the extra relation 
“—=1” we obtain [3877+]’, which is the simple group itself. Table III 
(immediately deducible from Table II) gives linear transformations of the 
ps which generate [87*1]’, and linear transformations of the zs which 
generate the binary [3**:1}. These transformations, while possibly no more 
elegant than Burkhardt’s, have the advantage of generating the groups ac- 
cording to a known abstract definition. 
The operator of period twelve considered in § 1 is 


(7.2) R=UVWXY 

—w 0 0 0 ko k—k)°(00 0 —w? 

0— 0 0 Ok k k 00 — 0 
0 kk—k 0 Ow 0 0 

0 0 “Oa? —k k 0 —k v0 0 0 


00 0—t1 00 aœ 0 
00—1 0 0 0 0 —we 
X 01 0 90 —w 0 0 0 
10 0 0 0 w 0 0 


0 —wk—k —ok 
"t By analogy with the binary icosahedra] group Pë = Q? = (PQ)*. (See Seifert 
and Threlfall, 25, p. 218.) Since the binary icosahedral group arises as a linear group 
in two variables, and the binary [3% *1] as a linear group in four variables, it would 
perhaps have been better to name the latter the quaternary [333,1]. 
35 Coxeter, 10, p. 160. 


s k — (w — w?) /3 == 3°44. 
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The central is generated by 


ts 0 0 0 
0—1 0 0 
as o0 0—1 0 


0 0 0—i 


We have seen elsewhere *’ that the abstract group [3%**] is generated by 
the two operators R and O. The results on which this statement is based apply 
to the binary [31] without change. (For instance, QOQ = 07Q707Z 
= 000, QPQ == P1Z — P.) It can be shown similarly that the binary 
[8>] is generated by R and X. In fact, writing for brevity | 


Ta = RXR", we have 

To == X = N,P:, 

Ts = R°N,R - BPR am PO, 

T: = OPO - 0Q0 == OPZQO, 

Te =— R°P,R- ROR = PN, 

T; = R* PR: RNR = OQON,, 
whence 

ToT, == N,20, 
TTT, = 0QZ, 
TTT T5 = OP, 

and finally, since . 
Z=% =X, 
U = N N = NO OP-PN = 27 7 LTL oT Ta = XRX R (XR*)PXRAXK® 
F = N0 = ZTT s = XI RXR", 
W = NP mu N ZO - OP = ToT LTT Ts == XROX A (XR) XR, 
Y = N,Q = N, 0 : OQZ = ZT oT TT oT, = XO RXR? (XR) XR". 


In other words, the group of linear transformations, of order 51840, is 
generated by | 


—wk —ok o'k 0 0 0 0 1 


mak aki fy sake a joo 0 
ok 0 k =) © 01 0 0 
0 —wk —k —wok 10 0 0 


By the same kind of argument, the second generator can be U instead of X.** 
Since the congruence 


c+ 2¢+4+1=0 (mod?) 


31 Coxeter, 11, p. 615. 
. *Brahana (4, p. 533) proved that the simple group [3%,°,1]’ is generated by two 
operators which may be identified with our UVW and XY (or W,NOP and P,Q). 


` 
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has the roots 2 and 4, one of the simplest modular representations *® of the 
binary [3°*1]’ is derived by putting 2 for œ, 4 for k,* and regarding all the 
coefficients as residues modulo 7. Thus the group is generated by the 
transformations 





5000 4048 0003 
0500 . 0444 0050]. 
CS Neorg ote = 4430 |? Ro core 
0003) 3403 UN AUS NS) 
0006) 0040 fo 
0060 n 0005 
*= 5100! *—|5000 
1000) 0400 


in the field GF [7], or alternatively by XY (or U) and 


6620 
6103 
2046 |” 
0536 


The reader will probably agree that these matrices are easier to manipulate 
than those of (7.2). 


8. The representation by lines in PG (3,4). Instead of GF [7], we 
may use the field GF [2°] defined by the irreducible congruence i 


a? + x+1=0 (mod 2). 


Tables I, II, IIT all remain valid if we interpret w as a root of this congruence 
(a primitive root in the field) and define the conjugate & of any mark u to 
be its square. Since — 1 == 1, we can replace every minus sign by plus. Also 
k = (v — o?) /3 =o + o° = 1. Since Z = 1, there is no longer any distinc- 
- tion between collineations and linear transformations. We easily verify that 
the abstract definitions (6.6) and (7.1) (with “=—1”) are satisfied. The 
transformations 


w00 0> ` FÉES 00 0 o? 


0 w 0 Ü OAL À 00 w 0 
= 7 == y= 7 
U 00 w? 0 4 L -[1110 |? W Ow 00? 

000 «° 1101 w0 00 


3 Brauer and Nesbitt, 5, p. 6. 
49 k = (w—w°)/3 = (2—4)/3 = 4 (mod 7). 
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0001) . [0 0 w 0 
2k 6 GA 000% 
poa | | a 

rrol” T a0 00]? 

1000 0 w 0 0 


being unitary, generate [3%°1]’ qua HO(4, 2?).* 

Following Frame,*? let us: regard these as collineations in the finite 
geometry PG(3,4) with homogeneous codrdinates (Zo, 21, 22,23). We observe 
that the line whose Plücker coërdinates (Por, Poo, Pos» P23; Par, Pre) are 
(0, o, ©”, 0,w,w) is invariant under the transformations N,, N, O, P, Q of 
Table II, and consequently also under the transformations U, V, W, Y of 
Table III (with the coefficients modified to fit the Galois field, as described 
above). This line is transformed into (0, œ, w°, 0, o°, w) by P, or by X, and 
into the 27 lines | 

(0, wr, wt, 0, 03, oH), 
(8.1) -. (of, 0, wò, 5#, 0, D>), 
(wò, of, 0, a, a, 0), 


by other operators of either [3221] or [8%21)’, 
-In other words, since the coefficients of the 27 linear complexes (5. 2) 


satisfy 


Qorlzs F ossi + lose — ra 2=0, 


the corresponding linear complexes in PG (3,4) are special, each consisting 
of all lines which meet one of the lines (8.1). On the other hand, the 36 
linear complexes (6.1) remain general, since for them the invariant is 
—3=1. 

Another consequence of reducing the coefficients to marks of GF [2°] is 
that the cubic form zo? + 21? + 2,5 + Z% is now invariant under the group 
[321] generated by U, V, W, X, Y; it is transformed into its conjugate 
by the anticollineations which generate [3°71]. Thus the “ cubic surface ” 


(8. 2) to? + 2,8 + ae + 25° == 0 

in the finite geometry PG(3,4) is invariant under a group of collineations 
and anticollineations which is simply isomorphic with the group of auto- 
morphisms of the lines on the general cubic surface in ordinary projective 
space. This result suggests the possibility that all the automorphisms of the 
lines on the special cubic surface (8.2) can be realized as collineations and 


41 For alternative generators of HO(4, 27), see Frame, 17, p. 482. For other repre- 
sentations of [3%,*,1]’, see Dickson, 14, p. 298, and Brahana, loc. oit. (4, p. 533). 
“18. 
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t 
anticollineations. - This is in fact the case, since the lines in question are 
` precisely (8.1). The nine lines (0, oò, w#,0,&,w) belong to the trihedral 
pair # which is put in evidence by writing (8.2) in the form CK 
(Zo + 21) (Zo + %21) (20 + w*21) + (22 + Za) (22 + 022) (22 + w23) = 0. 
These “trihedra” are degenerate, each consisting of three planes through a 
line. Each of the planes is therefore met by the opposite “trihedron” in 
three concurrent lines (in contrast to a tritangent plane of the general cubic 
surface, in which the three lines form a triangle). | 
The same thing happens in the. case of the special cubic surface 
| . | Boo + #8 4 2g? + a 0 
in ordinary projective space. The 18 planes 
| | au + wrey == 0  (e<») 
each contain three concurrent lines of the surface. But the 27 planes: 
| Zo + wz, + wits + wes = 0 ; 
are proper tritangent planes, each containing a triangle. E.g. the plane 
Zo + Zı + 23 + 28 == 0 contains the triangle cut out on it by the planes 
Zo + 41 = 0, Zo + 2a == 0, Zo + 23 == 0. 
‘This dichotomy of the 45 tritangent planes indicates that the group of auto- 
morphisms of the lines on this special cubic surface is a proper subgroup of 
[3**1], But the full symmetry is restored when we pass to the finite 
geometry by regarding the #8 as. marks of GF[2?]. For then all the 45 
planes degenerate the same way; e. g. the planes | 
| Z+atatae=0, ~+2=0, ty + #2 = 0, Zo +4=0 © 


concur at the point (1,1,1,1). 
The configuration symbols for the whole space * PG(3,4) and for the 
cubic surface (8.2) are easily seen to be: 








By this we mean that the “surface ” contains 45 of the 85 points of the 
4 Cf, Frame, 18, p. 661. ts a Ae a ers 
44 Schoute, 23, p. 5 (m = 4). 
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space, and 27 of the 357 lines; and that the 45 isotropic “ planes of the space, 
each containing three of the 27 lines, may be regarded as “tangent” planes 
to the surface. The configuration thus determined is self-dual. There is a 
one-one correspondence between the 45 points and the 45 planes, each point 
being the point of concurrence # of the three lines in one of the planes. Each 
plane contains the corresponding point and twelve other points of the set. 
Hach line contains five points and lies in the five corresponding planes. 

The plane corresponding to (Zo, 21, Z2, Z3) 18 (to, th, Ur, Us) Where tly == žy. 
Hence the Plücker codrdinates of the lines (after multiplication by w or & 
if necessary) satisfy 

Por = Pas; Poz = Psis Pos = Pis, 


as in (8.1). Since fioipes == Joi? = Por, the first three of the Plücker coördi- 
nates, so normalized, are the same as Frame’s “ non-homogeneous coördinates 
(4),” while the remaining three are their respective conjugates. Frame’s 
rules for the incidences (Theorem 2) follow immediately. In particular, the 
three sides of a triangle on the general cubic surface correspond to two inter- 
secting lines and a third line which is linearly dependent on them (ie. 
concurrent and coplanar with them). | 


9.. The 120 trihedral pairs and the polytope 4,,. Witting has shown * 
that the groups we have been considering have subgroups of index 40 of two 
distinct types. In the complex projective space with coôrdinates (Zo, 21, Ze, Zs), 
the plane Zọ== 0 is transformed into 40 planes, and the set of four planes 
20212223 = 0 is transformed into 40 tetrahedra. The vertices of the tetrahedra 
are 40 points whose coôrdinates are the same as the tangential codrdinates of 
the planes. The 40 planes are 43 


zy == 0 (x= 0, 1,2, 8), 

21 + odla + ozs == 0 (A, p = 0,1, 2), 
_ — 2o — wht, + wiz; = 0, 
— wey + okz, — 2; = 0, 
— wt) — WMZ, + Za = 0, 


or, when multiplied together, Fo == 0 in Maschke’s notation.*® 
When we interpret the coefficients as marks of the field GF[2?] (and 


t Frame, 18, p. 659. 

4 Thus the phrase “which form a triangle” should be deleted from the middle 
of p. 659 of Frame’s paper. 

‘729, pp. 41-43. See also Burkhardf, 6, p. 319. 

48 Blichfeldt, 8, p. 161. : 

4° 21, p. 333. 
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therefore ignore the negative signs), these are precisely the 40 non-isotropic 
planes of PG(3,4). Hence, by Frames Theorem 8,5 the 40 tetrahedra 
correspond to the 40 triads of trihedral pairs of the cubice surface. In 
particular, the tetrahedron 2921223 = 0 corresponds 5t to the tried 


loto Euz Eg tly |UoSo Ursa U28i | Solo Siba Sols 
lila Lolo Lola |Ui81 Uoso Uos: | Sit, Sato Sota 
trus Tour tty | UoSe UoSı UiSo | Sats Soli Silo 
in Hall’s notation. | 
When we interpret (Zo, 21, 22, 23) as non- opel coordinates (in 
complex Euclidean four-space), we find that the point (3%4, 0, 0,0) is trans- 
formed into the 240 points 


(= 3%io%, 0, 0, 0), (0, + 34iwx, 0,0), (0,0, = 3%iw, 0), (0,0, 0, + 3m 
(0, + af, + oò, + oF) (with signs agreeing), 
(9.1) 4 (%0, + at, +), ` 
| Se 
(FA, F ws w* » 0). 
These correspond in sets of six to the 40 points considered above in the pro- 
jective three-space, the six points of each set being derivable from any one of 
them by multiplying all four codrdinates by the same power of — e. Thus 
.. (+ 34tw, 0, 0,0) is one such set of six, and (0, + oò, + o, + wò) is another. 
When interpreted as points of real Euclidean eight-space, each set of six 
forms a regular hexagon. We thus have 40 hexagons lying in different planes 
through the origin. We shall see that the 240 points, so interpreted, have a 
far greater degree of symmetry than the 40 planes in which they lie; in other 
words, the 240 points can be distributed among 40 hexagons in many different 
ways. We shall prove, in fact, that the 240 points are the vertices of another 
of Gosset’s semi-regular polytopes, namely 





se Frame, 18, p. 660. 

51 In detail, (1,wA,0,0) and (0,0,1,a#) give the line (0, 1, wz, 0, adta, wr), To 
normalize this, we an by witz, obtaining (0, witH, wh-u, 0, w-À-u, w-Ate) or 
(0, wta, wr-a) or t,t To arrange nine such lines as a trihedral pair, we fix 
À for the rows and x for if e columns. 


THE POLYTOPE 21. 481 


or 4, (of edge 3%). This eight-dimensional polytope © has seven-dimensional 
faces of two kinds: 17280 simplexes ær, and 2160 cross-polytopes 8. The 
numbers of edges, triangles, tetrahedra, a.s, @;’8, and ags are. respectively 
6720, 60480, 241920, 483840, 483840, and 138240 + 69120. Its symmetry 
group [3], of order 696729600, is generated by reflections ** in hyperplanes ‘ 
which perpendicularly bisect the joins of pairs of opposite vertices. 

Hight of these 120 reflections suffice, the abstract definition being * 


O? ma N? — Ne= Ne — N,?— Pt P= Q? 
(ON) =(NN,)* = (NN) = (Nas)? (OP) = (PP) =(09)* 
—(ONy)? = (OP:)?=(PQ)? =(QN)? = (NP) mn (NN) (NN 3) = (NN 
(PQ) — (ON)? = (NP) = (NP) = (NP) 1 (v=1,2,3), 


Nz A E P, 





Q - 
Fig. 12. The group [3%*+]. 


analogous to (6.6). There is a central of order two, whose quotient group is 
derived by inserting the extra relation 


(N: N: N, N OPP, Q)” =l. 
Seven rotations, such as 
NNa, N,N, NN, N:0, N:P, NP: N,Q, 


generate a subgroup of index two. This subgroup of the central quotient 
group 5% of [3%*7] is the simple group FH (8, 2), of order 174182400. 


53 Gosset, 19, p. 48. 
32 Coxeter, 8, p. 388. 
si Coxeter, 10, p- 171. The subgroup [353,1] generated by O, N, Ny, P, Pa, Q has 
no direct connection with the binary [3°,2,1] generated by linear transformations of 
the z's. (The O, N, ete, of Table I aræof period four). en 
` $s Coxeter, 10, p. 174. There, and on p. 179, the word sub-group has several times 
heen used by mistake for factor group. 


48 H. 8. M. COXETER. 


i 

In order to prove that the points (9.1) are the vertices of 4, we select 
eight of them whose mutual distances are indicated in Fig. 12. By com- 
parison with Fig. 11, we easily find one such set of eight points to be 


(3%, 0, 0, 0), 
(— w*, w”, 0, — w*), 
f N, (0, — 3%, 0, 0), Pi’ (0, 0, — 34, 0),. 
N (0, Biu? 0,0), i P (0,0, 8#w?, 0), 


O (0,1,1,1),, 
Q (0,0, 0, 34°). 


The corresponding reflections are given in the first section of Table IV. It 
only remains to be observed that the given set of 240 points is invariant under 
. these transformations. 

‘When we pass to the real Euclidean eight-space, a more natural form of 
coordinates for Fig. 12 (on a different scale) is 5 


a (2,—2,0,0,0,0,0,0), . 
a (0,=~ 2, 3,0, 0, 0, 0, 0), ` 
(0, 0, 2, — 2, 0, 0, 0,0), Py (0, 0, 0, 0, 0, 0,—2,2), 
N (0,0, 0, — 2, 2, 0, 0,0), : P (0,0, 0, 0, 0, 2,—2; 0), 
O, (0,0, 0, 0, 2,2, 0,0), 
Q (11,1,1,1,1,1,1). 


The group [8**+] is then generated by reflections in the hyperplanes 


Yom Yi, 
Yı = Y3, ; i 
Ya = Ys, i Ye = Yr, 
YoYo. Ys = Yo, 
Ya + Ys = 0, 
zy = 0. 


' (See the second section of Table IV). 
Since the symmetry group of 4s: is [3622], of order 696729600, while 
that of the 40 hexagons is the binary [3***], of order 108680, it follows that 
such a set of 40 hexagons, which together use up all the 240 vertices of Ant, 
can be selected in 6720 ways. Having made one such selection, we find that 
the planes of the 40 hexagons (which correspond to the non-isotropic planes 
of the finite geometry 57) fall into 40 sets of four absolutely aad 


. For the ne coërdinates of all the vertices of án (edge 2°/*), see Coxeter, | 
8, p. 2. ({(PA), is an alternative symbol for 4,,). : 
“7 Frame, 18, p. 660 ($ 4). 
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planes. Hence, finiity these forty sets of four planes eorn pona to the forty 
triads of trihedral pairs.of the cubic surface. 

Another subgroup of index 6720 in [3#%1] can ‘be iid by specializing ` 
` a single one of the 1120 *8 diagonal hexagons of 4. (instead of forty of them). 
For, in (9.1), the plane of the hexagon (+ 3%io, 0, 0, 0) is absolutely per- ` 
pendicular to the six-space z — 0, which contains the 72 points 


(0, + shia, 0, 0), (0, 0, £ Blw, 0), y (0, 0, 0, E BHi), 
' (0, of, oñ, o), (0, — of, — wr, — wt). 
By (6.3), these are the vertices of the six-dimensional polytope *° 1,2, whose 
symmetry group, [3**1] X Gs, is derived from that of 2: by adjoining 
the central inversion. In the notation of Table de this subgroup is 
(Ns, N, N, O, P, PQ}. 

The 40 planes and 40 absolutely perpendicular six- spaces give, oe inter- 
section with a six-space of general position, a configuration of 40, points and 
40 four-spaces, which ‘is the real counterpart of Witting’s configuration of 
40 points and 40 planes in complex projective three-space. Witting described 
his configuration as the three-dimensional analogue of the Hessian configura- 
tion of 9 + 12 points and 9 + 12 lines in the complex projective plane. The 
real counterpart of the latter comes from the planes of nine diagonal triangles 
of 2», and of twelve diagonal hexagons of the semi-reciprocal 122, together with 
the absolutely perpendicular four-spaces, by intersection with a four-space of 
general position. | 
TABLE I : 

The simple group [333,1], of order 25920, as a collineation group (after Burkhardt) 
































B K o D 8, 
ey Bo. ` _ & : —k, PA 
. Bye — k(x, +2, + 2,)* fy " —#% -s 
g= — ke; + wa, + wa) CA — a : wey 
= — kiss t an H or)  - & CA we, 
P'a = — k (Pa + Pa + Pa) Pa . 77 Pa wPo 
Pa ©  —k(Pa + oPa + pe) Pos Pu - WPa 
Da = — k (Pa + oe + Pe) Pos Pa Des 
n= . (Pa + Pa + Pu) .. Pa TT Pa “Pn 
Pa = k (Pa + Pua + wPu) Pa Pa : Ds 
P'a = k (Pu + Pa + WPa) Pu ; Pe Pi 
Oye OT — k{o + o, + o) ee Dy | _— way 
D, = — k (ay + wey + ota) ` Oy — &: (oy 
s=. — k (m, + oa, + ww) o, — 8, ZA 








*k=(w— w) /3 = 31/24, wmo Ti 


s$ Coxeter, 10, p. 181. 
* Coxeter, 10, p. 178; 12, p. 477. 
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TABLE II 


The group [3*,%,1], of order 51840, generated by anticollineations - 




































































z M N: 0, P P, 9 
„r = a, ob, - o—s* LA LA T, 
D, = A LA D — 8 we KA . La 
a, D D 2—8 2, A EA 
P'un — Pn TT Ops Pa —tt Pu Pa Pa 
P'a = Pa Pa Pa—t Pa — Pan Poa 
Pla = Pes Pe Pa—t Pos Po — — Spo 
“P'a = — Pa — WPu Patt Pz ` De ' Pa 
P'u = Pa Pa Putt = OF Deg —Poa Pu 
mes P'a = Pa Pa Putt Pu Psa — Dog 
pr 
P'a = Pa OPa — În — t — Brn — Dr — În 
. P'a = — Pn — Pun — Du — t Der Dos — Bu 
Pa = — Pa — Da — Dan — z — Ps — fu wPo 
P'u = Bu Ca — fa + ? — Ba — Pa — Pa 
P'a = — Pan — Pa — Pa +t CSA Bn —Ba 
-P'a = — Pa — Pa — Pa +t — Po — Ba Dia 
So = FA CEA k(S: + 2+ 8) way Ža "wk, 
“= — Žo — wit, k(— 8 —& +8) — wa, — 8 CEA 
g= Ž, . wa, k(— & + — 2) — wh — B, — of, 
ey — à — wR, k (— 4) — 2, +&) we, FA — we, 
*s—(m+ateo+a+e + 2,)/3. | 
E= (Pu + Pa + Pa — Pn — Pn — Pir) /3. 
TABLE III , 
New collineations for generating [3*,°,1]” 
U=NN y = N,0 W=NP X=NP, Y=NQ 
v, = wg, D —s* LA FA CA 
La = O 1 — 8 or LA D 
g = LA a, — 8 @, Dy wa, 
P'a = -Pa — Pat} — Pn — Pn — Pu 
P'a = Po “Pa —t Pa — Pan Pa 
P'a = Po Pa t Pa Pa 7 WP rs 
Pa = WDay ` — Pa +t — Pa — Pa — Pua 
P'n = Pa Putt — Po — Pa Pa 
Pin = Pis Patt Pu Pra — 2D 
ey = — WB, k (2, + 2, — 2) — we, — 8 CA 
g, = - — oZ, k(e® + & + 2) — 2; —% Es 
a = — ote, k (z + %— %) ss, z m7 
ds = — ws, k {z + #,— 2) OB, FA - wy, 








* 8 = (m + z, + 2, + 2, +i + B) /3. 
tt = (Pa + Pa + Pa — Pn — Pn — Pu) /3. - 
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TABLE IV 
Two ways of generating the group [3t 3.1]; of order 696729600 :‘ ~ 


; z 
T 




















Ny. M X, N. o C PONR Q 

r= Ža Z—p* Zo Bo i Za CA Zo Zo 

= Žž, ntr FA we, %—oft gı z By 

= A Z, Zs | Bp La — 0 wz, 2, 2s 

“y= Z3 Es —p Zs By Z— 0 FA Zs wes 
Vo V= V2 = Val Y= Vo Ye Vo Vy =Y, — 2/4 
Va Y= Va Ya Ys Vs se Ve Ys Yi Vs (p= 0,1,...,7) 





2 p = (za — 2, + za + WF, — wh, + w) /3. 
tons + 2 + Z — z — B —&#) /3. 
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LE MOUVEMENT BROWNIEN PLAN.* 
© Par M. PAUL Lévy. 


Introduction. Nous nous proposons d’étudier les fonctions aléatoires 
A(t), Y(t) qui, dans le mouvement brownien, et en projection sur un plan, 
définissent les coordonnées du centre d’une molécule considérée indépendam- 
ment des autres. Il s’agira seulement du mouvement brownien mathématique, 
dont nous donnerons plus loin la définition précise. Indiquons tout de suite 
qu’il se distingue du mouvement brownien réel parce que le libre parcours 
moyen est supposé réduit à zéro. On est ainsi conduit à étudier des fonctions 
continues d’allure excessivement irrégulières: elles présentent, dans tout in- 
tervalle, une infinité de maxima et minima; leur quatre nombre dérivés sont 
infinis, sauf sur un ensemble de mesure nulle où un de ces nombres est fini 
et sur un ensemble dénombrable où deux sont finis; il y en a en tout point 
au moins deux qui sont infinis. Il s’agit là bien entendu de propriétés presque 
sûres, c’est-à-dire réalisées avec une probabilité unité, mais pouvant être en 
défaut dans des cas possibles, quoique de probabilité nulle; il nous arrivera 
de ne pas rappeler la nécessité de cette restriction; il nous semble que, dans 
des questions où il est bien entendu qu’il s'agit de phénomènes aléatoires, 
il ne peut en résulter aucune ambiguïté. 

L'indépendance stochastique de X(t) et F(t) a conduit les mathé- 
maticiens à étudier d’abord le mouvement brownien linéaire (c’est-à-dire 
projeté sur une droite). Cette étude a pris naissance dans les travaux in- 
dépendants les uns des autres de Bachelier ! et de N. Wiener,? et de nombreux 
travaux lui ont été consacrés depuis quelques années. Le lecteur pourra 
trouver un exposé des principes fondamentaux de la théorie ainsi édifiée dans 


* Received October 17, 1939. 

1 Bachelier, Oaloul des probabilités (1912). A cette date, Bachelier apparaît 
comme un précurseur. Si la manière dont sont introduits les problèmes où le temps 
joue le rôle d’une variable continue laisse à désirer, il n’en reste pas moins que c’est 
daus cet ouvrage que l’on trouve pour la première fois l’idée que la loi de Gauss 
g’introduit nécessairement comme conséquence de la continuité d’un processus additif, 
et la relation entre ce processus et l'équation de la chaleur. Il faut aussi signaler 
plusieurs formules relatives à l'écart maximum, et peut-être la formule (que j'ai 
cherchée en vain danus un grand nombre d'ouvrages antérieurs) qui, dans le cas des 
lois absolument continues, définit la loi dont dépend la somme de deux variables 
aléatoires indépendentes. | 

. FN. Wiener, Differential Space (Publications of the Massachusetts Institute of 
Technology, Ser. IT, N° 60, juin 1923). 
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notre Théorie de l’addilion des variables aléatoires (1937), p. 166 à 173. 
. Des résultats nouveaux ont été indiqués dane notre travail récent sur quelques 
processus stochastiques homogènes.” Nous rappellerons au §1 du présent 
travail la définition mathématique: du mouvement brownien, c’est- à-dire la 
définition stochastique de X(t), et quelques propriétés connues de ce mouve- 
ment utiles pour la suite. Quant aux théorèmes généraux du calcul des 
probabilités que nous cousidérerons comme connus, ils se trouvent tous dans 
notre ouvrage de 1937 cité ci-dessus; nous prions le lecteur de sy reporter 
en cas de besoin. 

`` Les § 2 et 3, concernant respectivement le mouvement brownien linéaire 
et le mouvement plan, sont consacrés à l’étude de quelques propriétés locales 
des trajectoires. Les § 4 et 5 sont consacrés respectivement à l’étude d’une 
expression qu’on peut représenter symboliquement par l'intégrale ` 


B= f XOF 


ét que nous appelons Fosċillation brownienne de x (t) dans l'intervalle (0, 1), 
et A celle de interes | 


= f non 


qui, avec. des axes convenablement choisis, représente Paire comprise entre 
la courbe C trajectoire du mouvement brownien plan pendant l'intervalle 
de temps (0.1), et sa corde. | 

I s’agit là d’intégrales stochastiques d’un type essentiellement nouveau. 
On peut les définir comme limites de sommes, ce qui, pour Paire 9, revient 
à la considérer comme limite de ‘celle définie en remplaçant le courbe C par 
une ligne polygonale inscrite; mais il ne s’agit pas d’une limite ‘ordinaire; 
suivant les cas il s’agira de convergence en probabilité, ou de convergence 
en moyenne quadratique (qui, comme on sait, implique la précédente), ou de 
convergence presque sûre. Cette dernière notion ne peut d’ailleurs inter- 
venir que comme conséquence d’une hypothèse restrictive relative au mode 
de division de l’intervalle de variation de ¢ en intervalles très nombreux .et 
très petits; il nous suffira de supposer que tout point de division une fois 
choisi soit conservé dans les subdivisions ultérieures. | 

Le hasard peut d’ailleurs intervenir, d’une part dans le choix des points 
de division, d’autre part dans celui des fonctions X(t) et Y(t). Le point 


3 Compositio Mathematica, vol. 7 (1930), pp. 283-339. Ce travail sera désigné dans 
la suite par l’abréviation “ processus,” et notre livre cité dans le texte sera désigné 
par “ Var, aléatoires.” 
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de vue le plus simple consiste à fixer un mode de division de l'intervalle 
d'intégration, et à considérer Y(t) et Y(t) comme aléatoires, et démontrer 
dans ces conditions l’existence des limites B et S; c’est ce que nous ferons 
au début de chacun des § 4 et 5. Mais nous envisagerons ensuite le point 
de vue inverse; c’est de ce point de vue que nous considérons qu’il introduit 
en analyse une notion d’une nature toute nouvelle. Pour chaque détermina- 
tion, soit de A(¢), soit de la courbe C, les points de division étant choisis 
au hasard, il peut arriver que les sommes utilisées pour définir B et S con- 
vergent presque sûrement vers des limites, et cela sans qu’on puisse conclure 
à l’existence de l’intégrale au sens ordinaire, c’est-à-dire d’une limite in- 
dépendante du mode de division de Vintervalle intégration et existant dans 
tous les cas. Il s’agit 14 de propriétés non aléatoires de chaque fonction X (t) 
ou de chaque courbe C. Le résultat peut-être le plus important de ce travail 
est que, dans le schéma aléatoire du mouvement brownien mathématique, 
on obtient presque sûrement des trajectoires ayant ces propriétés. Quant 
à la nature de la limite, dans le cas de B, elle n’est pas aléatoire; on a B— 1. 
Dans le cas de S, c’est une nouvelle variable aléatoire dont nous avons cherché 
à définir la nature; c’est l’objet de la fin du $ 5 (5° à 12° de ce paragraphe). 
Il ne semble pas que la fonction de répartition de S soit susceptible d’une 
expression simple; nous donnons plusieurs résultats relatifs à cette fonction, 
dont le plus important est peut-être le suivant: la détermination de la loi à 
deux variables § et L (Z étant la longueur de la corde sous-tendant Pare C) 
dépend d’une équation aux dérivées partielles du second ordre et du type 
elliptique, vérifiée par une des dérivées de la fonction de répartition, et qui, 
compte tenu de ce qu’il s’agit d’une fonction de répartition, la détermine 
complètement. 

Observons, en ce qui concerne B, que le fait que le hasard réalise avec 
une probabilité unité des fonctions pour lesquelles cette intégrale existe, 
n'implique pas qu’il soit facile de nommer une telle fonction. , Le § 4 contient, 
sur ce sujet, quelques remarques qu’il peut être utile de compléter et préciser. 
On pourrait étudier aussi laire S au même point de vue, en cherchant à 
nommer une courbe pour laquelle Paire existe au point de vue stochastique, 
mais non au point de vue de l’analyse ordinaire. 

Au $ 6, nous montrerons que la courbe C est, avec une probabilité unité, 
un ensemble de mesure superficielle nulle. Pourtant le fait que B soit positif 
implique que la courbe fasse assez de détours infiniment petits pour pouvoir 
remplir une aire. Mais, pour qu’elle remplisse effectivement une aire, il 
faudrait une organisation de ces détours infiniment petits que le hasard n’a 
aucune chance de produire. 
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Cette remarque s’étend à d’autres schémas aléatoires que celui du mouve- 
ment brownien. Au $ 7, nous étudions quelques exemples de tels schémas. 
‘Tous vérifient cette condition que la courbe IT décrite par le point mobile . 
quand le paramètre ¢ varie de zéro à un est composée de deux arcs stochastique- 
ment semblables à la courbe complète; nous entendons par là que n’importe 
quel ensemble de courbes qui sont des courbes I’ possibles, est aussi, à une 
similitude près, un ensemble de formes possibles de chacun de ces arcs, et 
` cela avec la même probabilité que pour T. Cette propriété, dans le cas du 
mouvement brownien, est réalisée pour Parc correspondant à n'importe quel 
intervalle de variation de t, et le rapport de similitude stochastique est la 
` racine carrée de la longueur de cet intervalle. Dans le cas des schémas 
étudiés au § 7, cette propriété n’appartient qu’aux ares correspondant aux 
intervalles de variation de ¢ compris entre deux multiples consécutifs de 2, 
` de sorte que l’ensemble des valeurs de ¢ qui sont des fractions dyadiques joue, 
dans l’étude de ces courbes, un rôle tout à fait remarquable. Le rapport de 
similitude stochastique, suivant les schémas étudiés, est déterminé pour chacun 
des arcs partiels considérés, ou au contraire aléatoire; mais dans ce cas les ` 
rapports relatifs aux différents arcs ne sont pas indépendants. Toutes les 
fois que la somme des carrés de ces rapports, étendue à n’importe quelle 
division de'la courbe en arcs stochastiquement semblables à la courbe entière, 
est égalé à l’unité, on peut dire, comme dans le cas du mouvement brownien, 
que la courbe fait assez de détours infiniment petits pour pouvoir remplir 
une aire, Il peut alors s’agir de courbes non aléatoires, composées de parties ` 
. semblables au tout, et notamment de deux courbes bien connues qui seront | 
désignées par I et IT, et qui remplissent effectivement des ‘aires. Mais, 
toutes les fois que le hasard joue un rôle. suffisant dans la définition de la 
courbe, pour les mêmes raisons que dans le cas du mouvement brownien, 
il est infiniment peu probable que la courbe remplisse effectivement une aire. 

Nous étudions aussi, pour ces schémas, laire S comprise entre Parc et 
. la courbe. Dans les cas où nous venons d'indiquer que la courbe pourrait 
à première vue remplir une aire, mais n’a aucune chance de le faire effective- 
ment, cette aire se présente sous la forme d’une somme 3 + sy d’aires triangu- 
laires, la série, 3s,” étant convergente.. Si alors les signes sont choisis au 
hasard, la série qui définit S est presque sûrement convergente; il est presque 
sûr que la courbe étudiée limite une aire définie au sens indiqué à propos 

du mouvement brownien. Si au contraire tous les termes sont positifs, ou si 
` (ce qui sera le cas pour les schémas qui seront désignés par les notations 
T: et T2) ils sont groupés en groupes “tendus de termes de même signe, 
la série considérée est divergente; & apparaît alors comme infini ou in- 


ij 
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déterminé. Comme le signe des aires triangulaires + sy peut être défini par 
une loi précise même dans des cas où le hasard intervient par ailleurs dans ` 
la définition de la courbe, la question de Vexistence de Paire § n’est pas 
liée à la précédente; du moins l’existence de Paire S ne permet aucune con- 
clusion au sujet de la mesure -periei de Fensémble des points de la 
courbe. 

Le grand nombre des schémas aléatoires que l’on pourrait ainsi étudier “ 
et des problèmes qui se posent nous a obligé, non seulement à faire un choix, 
mais dans certains cas à nous contenter de démonstrations résumées ou même 
d'énoncés sans démonstrations, et aussi à énoncer sans en indiquer la solution 
des problèmes qui nous semblent mériter d’être étudiés ultérieurement: 

Le plan initial de ce travail comportait un dernier paragraphe consacré 
à des types d’intégrales et d'équations différentielles stochastiques qui géné- 
ralisent laire 9, et notamment à l’intégrale 


U— f #(X Y, 2)47 (0 +x(X, Y, Z)AY (t) +4(X, F,2)42(0) 


qui définit le travail de la particule mobile dans un champ de forces, et à 
Véquation aux différentielles totales 


aU (t) = $(, ¥, 2,0) ax (t) +x(4, ¥,2,U)d¥ (t) + y(4,¥,2,U)dZ(t), 


X, Y, Z étant les coordonnées d’une particule dans le mouvement brownien 
à trois dimensions, et ¢, x, y étant des fonctions continues à la Lipschitz. 
‘On peut définir U comme limite de sommes ou de solutions d'équations aux 
différences finies. Le point de vue auquel nous envisageons l’étude de ces 
équations est d’ailleurs différent de celui adopté par M. S. Bernstein dans son 
mémoire connu sur les équations différentielles stochastiques en ce sens que 
nous supposons les expériences qui déterminent X, Y, Z effectuées avant Pin- 
tégration. L'intégrale U(t) ayant une valeur initiale donnée sera donc une 
fonctionnelle dépendant d’une manière non aléatoire des trois fonctions. 
aléatoires X, Y, Z. En vertu de propriétés presque sûres de ces fonctions, 
les opérations qui aboutissent à la définition de U(t) ont un sens, dans les 
conditions indiquées à propos des expressions B et 9: il peut exister des modes 
de division de l'intervalle d'intégration en intervalles partiels pour lesquelles 
il n’y ait pas convergence de ces opérations ; mais ces modes de division sont 
exceptionnels. i 
On peut naturellement généraliser la théorie padite en prenant pour 
la trajectoire du point X, Y, Z un schéma aléatoire différent de celui du 
mouvement brownien. 
Les circonstances présentes (en septembre 1939) m’ont décidé à renoncer, 
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pour le moment, à la rédaction de cette dernière partie, et à demander à la 
direction de l'American Journal of Mathematics de publier ce travail dans 


- son état actuel; je Pen remercie à l’avance.f 


1. Définition de la fonction aléatoire X(t). Nous considérerons une 
variable réelle ¢, variant de zéro à l’infini, et désignerons par X(t) une 
fonction aléatoire de ¢ ayant les caractères suivants: X(0) —0; quels que 
soient t= 0 et r > 0, l'accroissement AX (t) — X (t +7) —X(t) est une 
variable gaussienne d'écart type Vr; il est de plus stochastiquement indé- 
pendant du passé [¢ est-à-dire de l’ensemble des valeurs prises par X(t) dans 
Vintervalle (0,t)]. 

Rappelons comment on peut définir une suite d'expériences aboutissant 
à la détermination d’une fonction aléatoire X(t) ayant effectivement les 
caractères précédents, Considérons à cet effet trois valeurs fo, tı, t de t 
(to < tı < te), et posons 

X’ am X(t) — Z(t), MX’ —=X(t2) —X(h), 
X=% +Z” = X (#2) — X (to). 


Les conditions imposées à X(t) ne sont évidemment compatibles que parce _ 
que la somme des deux variables gaussiennes X’ et X”, d’écarts types respectifs 
Vti — to et Vty— t, est bien une variable gaussienne d'écart type V ts — to. 
Dans ces conditions, quand on connaît X (to), on dispose de deux procédés 
stochastiquement équivalents pour déterminer X(t,) et X(t). On peut, par 
deux expériences indépendantes, déterminer les deux accroissements successifs 
X’ et X”. On peut aussi déterminer d’abord Vaccroissement total, puis 
interpoler, c’est-à-dire déterminer X (t,) d’après la loi de probabilité condi- 
tionnelle dont dépend cette variable lorsque X(t.) et X(t.) sont connus. 
Dans ces conditions, X (tr) a pour valeur probable le nombre 


(ts ~~ t) X (ty) + (CA — to) X (t2) 


Fes tz — to 


obtenu par une interpolation linéaire, et Æ (t) — m, est une variable gaus- 
siene d’écart type 
_ ee 
o> ql ZA: to | 
En particulier, si fı — fo == t: — l1 = T, ON & 0, = Vr/à, c’est-à-dire que 
la différence entre Æ(#:) et sa valeur probable est une variable gaussienne 
d'écart type V? fois plus petit que quand on connaît seulement X (to). 


t Quelques-uns des résultats établis dans ce travail ont été énoncés dans deux 
Notes présentées à l’Académie des Sciences (0. R., t. 207, p. 1152; t. 209, P. 140, -” 
et erratum, p. 387). 


LE MOUVEMENT BROWNIEN PLAN. 493 


Ce résultat est en relation évidente avec les propriétés connues de la loi 
de Gauss à deux variables, dans le cas isotrope: d’après ces propriétés, si X’ 
et X” sont deux variables gaussiennes indépendantes de même écart type Vr, 

7 + "ELA + Ff 
PP Eee et 

sont deux variables gaussiennes indépendantes d'écart type Vr/2; cette 
remarque définit parfaitement la loi dont dépend X’ lorsque X est connu. 

On peut donc, sans changer la loi de probabilité imposée pour l’ensemble 
des trois variables X (to), X(1:), et X(t2), procéder par interpolation, c’est- 
à-dire déterminer X (t,) seulement après la détermination préalable de X (to) 
et X (la). Par suite, pour déterminer V(t) dans l'intervalle (0,1), on peut 
déterminer (1), puis, par des interpolations successives, déterminer X(4), 
puis X (4) et Æ(4), et ainsi de suite. Désignons par X,({) la fonction con- 
tinue égale aux valeurs ainsi obtenues quand ¢ est multiple de 2-", et variant 
linéairement dans chacun des intervalles compris entre deux multiples con- 
sécutifs de ce nombre; Xa (t) peut être considéré comme la n*™e approximation 
de X(t), et X(t) peut être défini comme la limite presque sûre de ces 
approximations. On a en effet le théoréme suivant: 


THEOREME 1. Il y a une probabilité unité pour que, quand n augmente 
indéfiniment, X,(t) tende vers une fonction continue X(t), et cela uni- 
formément dans l'intervalle (0,1). 


La démonstration est très simple. Désignons par ô, le maximum de 
| Xna (t) — Xa(t)|, dans l’intervalle (0,1). C’est le plus grand des modules 
de 2» variables gaussiennes de même écart type q™*? (q == 1/V2). Compte 
tenu du lemme de Boole, on a donc 


Qn+1 
1 En == Prfôn = qtr} = — 
(1) a { q? Tn} N 


e2), 





Si l’on prend pour £a la valeur cV 2n log 2, avec c > 1, a, est le terme 
général d’une série convergente. D’après le lemme de M. Cantelli (ou celui 
de M. Borel, puis-que les 6, sont indépendants), il existe alors presque 
sûrement un nombre fini V tel que, pour n > N, on ait 6, < cg™* Vn log 2, 
ce qui établit la convergence uniforme presque sûre de ,(¢) vers une limite, 
évidemment continue, X(t), c. q. f. d. 

Bien entendu le nombre W est aléatoire. Si donc la convergence est 
uniforme par rapport à X(t) (de zéro à un, ou dans n'importe quel intervalle 
fini), elle n’est pas uniforme par rapport au choix de X(t); mais il suffit 
Wécarter des cas de probabilité arbitrairement petite pour qu’elle le devienne, 
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On vérifie aisément que la fonction X(t) ainsi obtenue vérifie toutes les 
conditions indiquées dans la définition théorique. On peut d’autre part 
arriver au même résultat en remplaçant l’ensemble des fractions dyadiques, 
qui joue un rôle essentiel dans les expériences que nous venons de définir, 
par n’importe quel ensemble de valeurs de ¢ dénombrable et partout dense 
dans l'intervalle où l’on veut définir X(t). Le résultat obtenu est stochastique- 
ment indépendant du choix de cet ensemble. 

Rappelons maintenant trois lemmes dont le lecteur trouvera la démon- 
stration dans nos travaux antérieurs [Processus, formules (15) et (20); 
- Var. aléatoires, p. 172]. Le premie remonte à Bachelier. 


LEMME 1. Pour x > 0, ona 
Fe Max X(u) > a} — Pr{| X(t)| >ca}—V a/mt Sr ew/2iqu. 


‘Lemme 2. Pour x supérieur à la fois à 0 et a, ona 


Pr{ Max X(u) > 2/X(t) =a} = eet, 
ut 


[Rappelons que Pr{A/B} désigne la probabilité de À dans l’hypothèse B.] 


Lamas 3. Etant donnés tı > 0 et c>1, il existe presque sûrement un 
nombre n > 0 tel que, pour t St, e 0 <r Sq, on ait. — 


PX (t-+ 7) —X(t)| < eV 2 log1/r. ‘ 
84 au contraire c < 1, la probabilité de l'existence de y est nulle. 


2. Etude locale de X(t). Nous pouvons nous contenter d’étudier les 
propriétés de X(t) au voisinage de l’origine ; les résultats obtenus s’appliqueront 
évidemment au voisinage de n’importe quel autre point, soit à droite, soit à 
gauche de ce point; mais bien entendu, si l’on trouve qu’une certaine circon- 
stance est infiniment peu probable au voisinage de n’importe quel point donné 
davance, cela n’empêche pas qu’il puisse exister avec une probabilité positive 
des points, impossibles à connaître à raven au voisinage Le elle 
goit réalisée. : 

Commençons par établir un théorème qui ramène l'étude locale de X(t). 
à l'étude asymptotique de cette fonction, pour ¢ infini. Pour l’énoncer, nous 
poserons 


(2) | t=, X(t) = Vig(u), 


_ de sorte que ¢(t) est, pour chaque valeur deu, une variable gaussienne réduite. 


À: 
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Twtortare 2. La définition stochastique de $(u) est invariante par le 
changement de u en —u (damien aussi par celui de # en u + c). | 


-© En d’autres termes, les propriétés stochastiques de X @/ V# sont in- 
variantes par le changement de ¢ en 1/1. 
Considérons d’abord la suite des valeurs t, = q” de t, q étant ici un 
nombre quelconque entre zéro et un; posons X (tn) == Xy= px Vs. Do 
l'indépendance des accroissements successifs de X(t), on déduit 


(8)  . pni = bn V + da VIT, 


¢’n étant une variable gaussienne réduite indépendante de du, nus” °° 3 
d’autre part, d’après le principe d’interpolation exposé au § 1, X (tes) pouvant 
être déterminé par interpolation entre X(0) et X(t), on a 


(4) bn = da VI Hon VI =O 

$”n étant une variable gaussienne réduite indépendante de $n, nm-re 
donc les mêmes expériences à faire pour déterminer la suite des #, de gauche 
à droite, ou bien de droite à gauche. En d’autres termes la nature stochastique 
: de cette suite est invariante par le changement de n en — n (ee en h—n,h 
étant un entier donné). i 

Cette symétrie stochastique n’est d’ailleurs pas détruite quand, après 
avoir déterminé ¢(n) (c’est-à-dire p-n, si on a pris pour q la valeur 1/e) 
pour toutes les valeurs entières de n, on effectue des interpolations pour 
déterminer les nombres (n + 4); chaque nombre ¢(n-++ 4) a en effet la 
même correlation avec ¢(n) et avec ¢(n+1). Le résultat obtenu n’est 
d’ailleurs autre que la suite des ġa pour g—e#. En effectuant de nouvelles 
interpolations, on arrivera à définir p(u) pour les valeurs de u multiples de 4, 
puis de 4, et ainsi de suite. A chacune de ces opérations, la symétrie est con- 
servée, et l’on aboutit, à la limite, à la détermination de ¢(w) par un processus 
stochastique absolument symétrique. Or. il équivaut bien à celui par lequel 
nous avons défini p(u), puisque, dans la définition de X(t), rien n’empéche : 
de choisir, pour les interpolations, les valeurs de ¢ dont les logarithmes sont 
les valeurs de u qui interviennent dans le procédé de détermination de pu) 
` que nous venons de décrire. Le théorème 2 est ainsi démontré. 

Tl entraîne bien simplement de nombreuses conséquences. Ainsi Pen- 
semble des racines de X(t) n’a presque sûrement aucune borne supérieure; 
cela résulte aisément de ce que, d’après le lemme 1 appliqué à æ—%X{u), 
on peut déterminer une fonction f{t,z) supérieure à ¢ et telle que, dans 
Vhypothése X(t) =z, X(u) ait, avec une probabilité donnée «, au moins 
une racine comprise entre t et f({,z). En prenant alors pour tre un nombre 
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arbitraire, et posant tm.i—f[7,X(7.)], on obtient une suite de nombres 
aléatoires croissants qui séparent des intervalles dont chacun a une probabilité 
æ de contenir une racine de X(t). Ces probabilités étant indépendantes, 
il résulte de la loi forte des grands nombres, sous sa forme la plus classique, 
qu'il y a presque sûrement une infinité d’intervalles (Ta, tau), ayant une 
fréquence tendant vers æ, contenant chacun au moins une racine de Y(t). 

Il résulte alors du théorème 2 que les racines positives de X(t) ne sont 
pas non plus bornées inférieurement: il est presque sûr que la racine zéro'de 
X(t) west pas isolée. | 

Bien entendu, comme toutes les fois que l’on applique une méthode de 
transformation, on peut transformer, non seulement le théorème, mais Ba 
démonstration, et obtenir ainsi une démonstration indépendante de la trans- 
formation employée. Dans le cas qui nous occupe, c’est ce qui donne à la 
fois la démonstration la plus simple du résultat obtenu, et celle qui permet 
le mieux son extension à d’autres types de fonctions aléatoires. 

L’ensemble des racines de X (1) étant fermé, et ne comprenant aucun 
intervalle, il existe bien entendu des racines isolées à gauche, ou à droite; ` 
elles forment au plus un ensemble dénombrable. Si nous définissons une 
racine # par la condition d’être la plus petite racine äu moins égale à un 
nombre donné to, sauf peut-être dans l'hypothèse infiniment peu probable 
X (to) = 0, elle est isolée à gauche; ce renseignement ne modifiant en rien 
Pallure probable de A(t) pour t > to, il est presque sûr que @ est, comme 
zéro, une racine non isolée à droite. Or il y a au plus une infinité dénombrable 
de racines isolées à gauche, done d’occasions de trouver une racine isolée à 
la fois à gauche et à droite, et chaque fois la probabilité de cette circonstance 
est nulle. La probabilité totale est donc nulle: tl my a presque sûrement 
aucune racine de X(t) isolée à la fois à gauche et à droite. 

Naturellement, ce résultat s’applique aux racines de X(t) — 7v, si x est 
donné. Pourtant X(t) a dans tout intervalle une infinité dénombrable de 
maxima et’minima; mais l’ensemble des valeurs maxima ou minima de X(t) 
n’a aucune chance de contenir une valeur x donnée d’avance. 

Pour une étude plus complète de l’ensemble des racines de X(t), le 
lecteur peut se reporter à notre mémoire antérieur (Processus, § 7). 

Indiquons une autre application du théorème 2. M. Khintchine a établi 
le théorème du logarithme itéré, d’après lequel : 


) 
5 Pr À Yim su eel tan 
(5) l HS V2 log | u | 
Il Pa démontré successivement pour u tendant vers + œ, et pour u tendant 
vers — œ {done t vers zéro). D’après le théorème 2, un de ces résultats 
entraîne l’autre. 
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Nous nous proposons maintenant, en supposant toujours, pour fixer les 
idées, g < 1, d'étudier quelques propriétés de la suite des nombres Yn — $n V tn. 
D’après la relation de récurrence (4), ils forment une chaîne de Markoff 
simple et homogène dans le temps (c’est-à-dire stationnaire; mais ce terme, 
généralement employé, nous paraît impropre). Nous pourrions renvoyer le 
lecteur à la théorie générale de ces chaînes; mais l’étude précise d’un cas 
particulier n’est peut-être pas inutile. 

Indiquons d’abord la formule 





(5) Pr f tim sup LWL bos, 

n>% VW 2 log n 
analogue à la formule (5). Il nons suffira, pour la suite, de savoir que la 
limite considérée ne saurait dépasser l’unité, c’est-à-dire que, si c > 1, il existe 
presque sûrement un nombre N tel que, pour n > N, on ait 


(6) [nl <cV2logn. 


“Cela résulte immédiatement de ce que la probabilité de l'inégalité inverse, 
égale à 


vf a equ < NE f i e2 _ udu = — t ; 
a J'oVatogn m d) ovja logn cV2logn cn®Vrlogn 


est le terme général d’une série convergente. 
Démontrons maintenant que la loi forte des grands nombres s’applique 
à la suite des | 4, |, c’est-à-dire que: 


THÉORÈME 3. La fréquence des on inférieurs à un nombre donné x tend 
presque sûrement (et cela d’une manière uniforme) vers la probabilité théorique 


Prion < t} = F(x) = f ohda. 


Nous montrerons que la différence entre cette fréquence et F(s) devient 
presque sûrement inférieure en valeur absolue à un nombre donné arbitraire- 
ment petit que nous désignerons par 3e. Il suffit évidemment d'établir le 
résultat analogue à celui énoncé, mais ne faisant intervenir que les valeurs 
des #, pour les valeurs de n de la forme no + vp (v= 0,1,2,° ><); p est un 
entier que rien n’empéche de choisir en fonction de e et aussi grand qu’il est 
nécessaire, et no est un quelconque des nombres 1,2,: : +, p. [idée directrice 
de la démonstration est que, si p est assez grand, n et onp sont presque 
indépendants, et qu’on peut appliquer la loi forte des grands nombres comme 
gil s'agissait d’une suite de variables aléatoires indépendantes. 
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D’après la formule (4), que Von peut appliquer à l’étude de la corrélation 
~ entre dx et dais à condition de remplacer q par g*, on a 


(7) pra =p VO + VI 
Y» étant une variable gaussienne réduite, indépendante de ¢,. La corrélation | 
entre D et pau est bien, d’après cela, d’autant plus faible que À est plus grarid. 
En désignant par ¢ un nombre positif donné, nous choisirons A assez grand 
pour que ` 


(8)  6VGS4 ViI-gei—e 
Si alors | ġa | = ¢, il résulte de la formule (7) que 
(9) : ha on | Se(1+ | ya |), 
. d’où Pon déduit aisément, du moins si e est assez petit 
(10) | Pr(daa < 2/1 bx |= 4} —F(2)| = 


‘Si y. désigne une variable gaussienne réduite, l’expression 


an on hp( (LÉ B =) 


où. Max [a, b] désigne le plus grand des nombres a et b, est une variable 
aléatoire à valeur probable finie. En désignant sa fonction de répartition par 
G(w), on peut done prendre pour p un entier tel que 


S E (wo) Se, 


et, en désignant par H le plus petit multiple de p au moins égal à Q, sa | 
` valeur probable est 


En) = f” His) < S 2860) + f7 (p+ 0) dG (0) phe 


Désignons alors par Wo, W1,: * -,Ws,° * « des déterminations indé- 
pendantes de y, et, par Hx, la détermination de H qui correspond à y's. 
srl n’y a qu’à utiliser cette remarque évidente que, si | log $| < e (avec 


Wy > 0), et |Y”—W |< e et si, f(æ) étant la densité de probabilité de y, on 
a|æ|f(o) <m et f(s) < m, on commet sur la fonction de répartition une erreur 
au plus égale à em + am, en remplaçant y par ¥”. Dans le cas de la loi de Gauss, 


FF 
formule (10) en résuite. 


m =m =; si e est assez petit, on peub prendre a= e et e= (Võr— l)e; la 
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D’après la loi: forte des grands nombres (qui s'applique à H, d’après un | 
. théorème de À. Kolmogorof), on & presque sûrement 
(12) tim Hah Mott He eo cp te 
20 
Considérons alors la suite des nombres 
Ny = to, Ni = No + Ho,: í Nu Net Hr: 773 


quel que soit Ho, ils comprennent la plupart des termes de la progression 
VENTE 


eee eae ie Š sno + vp,: FT 


_ la fréquence des termes qui manquent étant presque sûrement, à partir d’un 
certain moment, inférieure à «/(p + e), donc à e On peut donc ne considérer 
que les valeurs de ñ de la forme Nz, et il suffit de démontrer que, parmi ces 
valeurs, la fréquence de celles pour. lesquelles n < x diffère de F(z) d'au 
‘plus 2e. | 
Nous allons montrer à cet effet que, pr détient la détermination de 
én pour n = Nz, on peut appliquer la formule (8) pour ¢= | gx | et h = Ha, 
et par suite aussi la formule (9), dans laquelle on peut évidemment identifier 
Yn avec la variable gaussienne désignée ci dessus par yz; elle s’écrit donc 


(9) [prnpr Se(14 | Ye |). 

Tl wy a qu’à procéder par récurrence. La définition de He étant restée 
arbitraire, nous pouvons supposer ce nombre assez.grand pour que le résultat 
énoncé soit vrai pour k=-0. Nous pouvons alors le supposer vrai pour une 


certaine valeur k. Il résulte dans ces conditions de la formule (9), et de la 
définition de H (d’après pp H Z= Q9), que Pona - 


Hen = Toga] ing 3! log Max (3 Pkn’, 5) ; 
c’est-à-dire 

| dada < & 1— gra = (1 — «)?. 
La nouvelle application de la formule (8), et par suite celle de la formule 
(9’) pour la valeur k + 1, sont donc justifiées. 

La formule (10), conséquence des formules (8) et (9), s'applique donc 
aussi pour n -p h == Ny + Hk = Neu, pna = deu, | onl —o—| $x |. La 
fonction de répartition conditionnelle dont dépend #’:1, lorsque #x est connu 
diffère donc de F(z) dau plus e, 'et, d’après la loi forte des grands nombres 
relative aux variables enchaînées, la fréquence des valeurs inférieures à x 
parmi les nombres #1, d'2,° * *, dx est presque sûrement, à partir d’une, 
cértaine valeur de k, comprise entre Ee) — 2e et F(x) + 2e, ce. q. É d. 
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COROLTATRE 1. La fréquence des valeurs*de n pour lesquelles on a 
(18) o l er ee 


tend presque stirement, pour n infini, vers la probabilité théorique de -ces 
inégalités, déduites de la formule (4), c'est-à-dire vers l'expression 


un | f+ — Va 

a an Je Ce 2(1— q) ) aan 

Indiquons seulement le principe de la démonstration, dont le lecteur : 
reconstituera aisément les détails. Chaque petit intervalle de valeurs possibles 
pour dy est réalisé pour les valeurs 1,2,---,n de y avec une fréquence peu 
différente, si n est grand, de sa probabilité théorique; donc un grand nombre 
de fois. On peut donc appliquer de nouveau Ja loi forte des grands nombres 
et conclure que ces valeurs de $ entraînent les différentes valeurs possibles 
pour œv, avec des fréquences peu différentes de leur probabilité théorique. 

Ce corollaire s’étend sans difficulté au cas où Pon considère simultanément 
un nombre quelconque de termes consécutifs de la suite des on. 


COROLLAIRE 2. La fréquence des changements de signes dans la suite des 
: bn tend presque sûrement vers (1/r) Arc tg V (1—q)/q (donc vers $, si q== $). 
`- Ce corollaire est évidemment un cas particulier du corollaire 1. 


CoRoLLaIRE 3. La fréquence des valeurs de n pour lesquels l'intervalle 
(tn tn) contient au moins une racine de X(t) tend presque sûrement vers 


(2/x) ArcigV (1 —q)/4. 


Ce corollaire résulte immédiatement du Corollaire 1, et de la loi forte des 
grands nombres. Si la suite des X» est supposée connue, la fonction X (t) 
devant être déterminée ensuite par des interpolations dans chacun des inter- 
= valles (fus, x), la fréquence des intervalles contenant au moins une racine: 
est presque sûrement infiniment peu différente de la moyenne des probabilités 
conditionnelles théoriques. Rappelons que, pour chacun de ces intervalles, 
cette probabilité conditionnelle, évideminent égale à un pour Æ,X,4 = 0, 
est dans le cas contraire, d’après le lemme 2, | 


2X,2 mi \ 2hnhnir 
ex ( pres) — exp ( Vi q | 
. I résulte évidemment du corollaire 1 que cette probabilité conditionnelle 
est presque sûrement convergente en moyenne arithmétique vers sa valeur 


probable. La fréquence des. intervalles® (4,1, ,) qui contiennent au moins 
‘une racine converge donc elle-même presque sûrement vers cette valeur prob- 





LE MOUVEMENT BROWNIEN PLAN. 501 


able, c’est-à-dire vers la probabilité a priori de Pexistence d’une racine dans 
Pintervalle (tn, én) quand on ne connaît ni X,,ni Xn. Cette probabilité 
a la valeur connue (2/x) Arc tg V (1—q)/q [Processus, formules (42) et 
_ (44)°], ce qui termine la démonstration du corollaire 3. 


3. Le mouvement brownien plan. 1°. Ce mouvement est celui d’un 
point A(t) dont les coordonnées rectangulaires X(t) et Y(t) sont deux 
fonctions aléatoires du type que nous venons d'étudier, indépendantes l’une 
de l’autre. Pour chaque valeur du paramètre ¢, que nous appellerons la cote 
de A(t), ce point dépend de la loi de Gauss isotrope, d'écart type Vt; nous 
appelons ici écart type, non la valeur quadratique moyenne de la distance 
R(t) du point A(t) à l’origine 0, mais la valeur quadratique moyenne com- 
mune de X(t) et Y(t). On sait que R(t) dépend de la loi défini par 
(15) Pr{R(t) > r} = etl, 


Les propriétés du vecteur A(t)A(t-+ 7), déplacement du point mobile 
pendant l’intervalle de temps (t, t+ 7), sont naturellement indépendantes de 
t, et du passé; ce vecteur dépend de la loi de Gauss isotrope d'écart type Vr. 

De nombreuses propriétés du mouvement brownien plan sont des con- 
séquences si évidentes des propriétés correspondantes du mouvement brownien 
linéaire qu’il suffit de les énoncer. Tel est le cas du principe d’interpolation : 
si to < tı < to, et si A(t.) et A(t:) sont connus, la position probable de 
A(t,) est le point My obtenu par une interpolation linéaire, et df,A(t,) 
dépend de Ja loi de Gauss isotrope d’écart type 


tı — to) (ta — ty) 
nm JS), 
2°. Pour étudier la forme de la courbe au voisinage de l’origine 0, nous 
considérerons toujours la suite des valeurs ty = q” det (0 <q <1). Nous 
écrirons A, par abréviation de A(t,), et désignerons par M, la position pro- 
bable de ce point quand 0 et 4, sont connus; c’est le point défini par la 
formule vectorielle 


OM, = Än. 


On remarque que OM, et H,An sont deux vecteurs, indépendants l’un de 
Pautre, et dépendant respectivement des mêmes lois que OAnu et Ana Án; les. 
propriétés stochastiques des deux triangles OMndAn et OAnsAn sont done 
identiques (si q = à, on peut écrire indifféremment OMnAn ou AnMa0). 


ê Ces deux formules, que nous avons établies séparément, sont équivalentes; dans 
l’une, l'intervalle que nous désignons ici par (ta t,), est désigné par (ttu); 
dans l’autre, il est désigné par (t— «u, t). 
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L’angle 0 de chacun de ‘ces triangles, orienté et compté de —7 à + r, 
est une variable aléatoire 6 dont la loi ne dépend que de g; elle est symétrique; 
.la valeur quadratique moyenne de @ est un nombre pa o = 0(q), in- | 
férieur à 7/2. 
3°. Nous désignerons par Ra == pnVÉn et @n les coordonnées diese 
de An; nous prenons pour @, la détermination obtenue en supposant 
| @ | & et en considérant les Ay comme des positions successives d’un point ` 
mobile qui se déplace sur la ligne polygonale 464,4, : : : ; chacun des accrois- 
sements nı — @, = 0, est donc une variable aléatoire du type 4 que nous 
venons de considérer." Mais les différents 6, ne sont pas indépendants. Si 
l’on détermine successivement les points A, dans l’ordre des n croissants, la loi 
conditionnelle dont dépend 6, lorsque le passé est connu ne dépend que de Ry; | 
elle est toujours symétrique et comporte une valeur quadratique moyenne 0’, 
fonction de Ra et q; la valeur probable a priori de g'a? est naturellement o?. 
En ce qui concerne les Rp, on établit aisément le théorème guivant, 
analogue au théorème 3 relatif au mouvement brownien linéaire: 


THÉORÈME 4 La fréquence des valeurs supérieures à r dans la suite des 
pa tend presque sûrement vers la limite 


Prpn > 1} = Pris > rV ty} = 672, 


Nous ne développerons pas ig démonstration, tout à fait analogue à celle 
du théorème 3. On peut d’ailleurs aussi, en tenant compte de l'indépendance 
de X(t) et de Y(t), le considérer comme un corollaire du théorème 8. 

Indiquons aussi, en ce qui concerne les Rp, la formule 





(16) Pr À tim sup ul foe 
n> V2 lo 
analogue à la formule (5°). | 
Comme il est évident que Rx = | Xn |, done pm=|#{|, la borne 
inférieure donnée pour | na| par la formule (5’) s’applique à ps. Pour 
établir la formule (16), il reste à montrer que, pour c > 1, il existe presque 
sûrement un N tel que, pour n > N, on ait 


(17) pn < cV? logn. 


Cela résulte évidemment de ce que la probabilité de Gi inverse, n°, 
est le toms général d’une série convergente. 


TI y a indétermination dans la valeur à prendre pour ©, si Pun des |6,| 
{»=0,1,...,n— 1} a la valeur v, c'est-à-dire si la ligne A,A,: - - À, passe en ©. 
La probabilité de cette circonstance étant nulle, il n’en résulte aucune difficulté. 
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On peut aussi déduire la formule (17) de la formule (6), et de la 
remarque que langle ®), dont dépend l’orientation de la courbe autour de 
Vorigine, est choisi au hasard et indépendant de la suite des pn. Si alors on 
. pouvait trouver, avec une probabilité positive, des valeurs de n arbitrairement 
grandes pour les quelles on ait pa = cV logn, il en serait de même en 
imposant la condition supplémentaire cos @, > cos a, æ étant assez petit pour 
que ccosa = č > 1; cela est en contradiction avec la formule (6), écrite 
en remplaçant c par c. | 

Les grandes valeurs des pn ne sont donc pas plus grandes que celles des 
| $n |; dans un cas comme dans l’autre, on peut appeler grandes valeurs celles 
qui sont supérieures à (1— e) V2 log n, e étant un nombre positif donné et 
très petit. Mais bien entendu les grandes valeurs de pn sont plus fréquentes 
que celles des | n|. Eles correspondent à des vecteurs 04, ayant, à un 


angle arbitrairement petit près, toutes les orientations possibles, et il faut 


choisir ceux qui font avec une direction donnée un angle très petit ou très 
voisin de m pour trouver une grande valeur de | #, |. 


4°, Occupons-nous maintenant des angles 8, et On. Remarquons d’abord 
que, comme conséquence du fait que les différentes valeurs possibles pour pr 
sont réalisées avec des fréquences tendant vers leurs probabilités théoriques, 
et de ce que la nature stochastique du triangle 0AnAni est fonction de pn 
supposé connu, les différentes formes possibles pour ce triangle, et par suite 
les différentes valeurs de On, sont aussi presque sûrement réalisées avec des 
fréquences tendant vers leurs probabilités théoriques. Il s’agit d’une nouvelle 
application du principe utilisé pour le corollaire 1 du théorème 3. 

Notons surtout que, o’, étant une fonction (non aléatoire, bornée, et 
continue) de pn, les différentes valeurs de o’, sont réalisées avec des fréquences 
tendant presque sûrement vers leurs probabilités théoriques, et l’on a presque 
sûrement 
(18) ‘Tim test ter Elor} = o, 


n> 


et par suite aussi 


(1e) lim = (02 O8 +--+ Ou) = 28 
n> ` 


8 Cette formule peut être soit obtenue comme application directe du premier 
alinéa du présent § 3, 4°, soit déduite de la formule (18) et d’une formule que nous 
avons établie antérieurement [Var. aléateires, formule (22), p. 252] d’après laquelle 
il y a presque surement convergence en moyenne arithmétique vers zéro de la suite 
des variables 6,7 — P,’ 


+ 


+ 
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Supposons que l’on détermine d’abord les modules, puis les signes des 6y. 
Après détermination des modules, et en supposant 04, pris comme. origine 
des „angles polaires, ®, se présente sous la forme 


(19) ` - On [| +e [62] +--+ em | On|, . 
les signes ev étant choisis au hasard indépendamment les uns des autres. . 
Comme les | & | sont bornés, et que la série 30,* est divergente, il résulte 
du second théorème limite du calcul des probabilités que ©, est asymptotique- 
ment une variable gaussienne. Son écart type, d’après (18’), est un infiniment 
grand équivalent à «Van. Donc @,/5 Vn est asymptotiquement une variable 
gaussienne réduite. En termes précis: e étant un nombre positif arbitraire- 
ment petit, il existe presque sûrement un nombre N tel que, pour g quelconque 
-etn >N, on ait 


- (20) | Pr{®, < ot Vn} —F(z)|<e 


Pr désignant une probabilité conditionnelle, évaluée en. suppa, connus 
les | Oy |: 

Le nombre N est aléatoire; mais, en ee des cas de sebi 
inférieure à e on peut lui assigner une borne supérieure non aléatoire n’. 
Comme Pr est la valeur probable de Pr’, et que, dans les cas où Pinégalité 
(20) n’est pas vérifiée, son a membre est du moins au plus égal à un, 
on a, pour n>n 


(21) | Pr{@, < oz Vn} — F(2)| < 2e, 


_ c’est-à-dire que: ©, est asymptotiquement une variable gauceiennss son écart 

type est un infiniment grand équivalent à oVn. - 
L'angle ®, apparaît ainsi comme le gain d’un joueur dans une partie 

de pile ou face à enjeu aléatoire, pouvant dépendre des enjeux antérieurs, 

et déterminé pour chaque coup par une expérience préalable. La formule 

presque sûre (18) permet d’assimiler cette partie à une partie à enjeu fixe a? | 
La plupart des résultats connus relatifs à une telle partie s’appliquent 

de même ici, notamment la loi du logarithme itéré, d’après laquelle on a 

presque sûrement 

(22) Trp RER 

Ne, À ahd V 2n log: log n, 

le même résultat s’appliquant à —@,. La ligne polygonale 4:42: ` :4,:::. 

tourne donc indéfiniment (et fort irrégulièrement) autour du point 0 avant 

‘de l’atteindre, @, ayant des valeurs arbitrairement grandes des deux signes. 


°On peut aussi appliquer directement le second théorème limite du calcul des 
probabilités sous la forme, applicable à certaines suites de variables enchaînées, que 
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5°. Désignons par ©’, l'angle Ao0Ay, compté de —r à +r. Il diffère 
de ©, par un multiple de 2r qui, si p > 1, a une probabilité positive de n’être 
pas nul. Or, dans ce cas, | @’,|<|@,|. La valeur cuadratique moyenne 
de ©’, est donc inférieure à celle de @,, ce qui s’exprime par la formule 


(23) o(q?) < Vpo(q). 


Cette formule permet de comparer les angles polaires obtenus pour un 
même point Anp si l’on va de À, à ce point, d’une part en suivant le ligne 
polygonale A64:42° ` * Au, d'autre part en suivant la ligne raccourcie 
AoÂpA2p © ` Ånp Les valeurs quadratiques moyennes de ces angles sont 
respectivement Vnp o(q) et Vno(g). D’après la formule (23), la seconde 
est plus petite; cela était à prévoir: la seconde ligne évite les détours de la 
première. 

Inversement, on peut suivre de plus en plus exactement les détours de la 
courbe C en prenant pour g des valeurs de plus en plus voisines de l’unité. 
Pour que A, coïncide avec le point A(t) correspondant à une valeur de t 
donnée entre zéro et un, nous prendrons qg=—1{"/"; la valeur quadratique 
moyenne de langle polaire obtenu pour A(t) est alors Vno(q). Il y a lien 
de s'attendre à ce qu’elle croisse avec n, puisqu’en prenant pour n des valeurs 
le plus en plus grandes on suit de mieux en mieux les détours de la courbe; 
on est sûr, par ce qui précède, qu’elle croît quand on passe d’une valeur initiale 
no à une valeur n, multiple de no. Il y a intérêt, pour connaître la nature 
de langle polaire @(t) obtenu en suivant la courbe elle-même (angle bien 
défini; il s’agit d’une courbe continue ne passant en général pas par l’origine, 
comme nous le verrons plus loin), de chercher à définir l’expression 
(24) E{@?(¢)} = lim no? (#/") = : log — Fin eu. 

Nous allons montrer que cette expression est infinie. 


A cet effet, H, désignant le pied de la perpendiculaire abaissée de A, sur 
040, et p et r étant des nombres positifs, considerons l’hypothèse 


(E) 04o >p, |4H|<rVi—q, |H: | <rvV1— 4. 


Si elle est réalisée, quand q tend vers 1, AoA, est petit, et 4.04, est un 
infiniment petit équivalent (au sens de Bernoulli) à H,Ai/0Ao. On en déduit 





o?( D > a Hd?) yf 1 
25 um) int > = Pr{ BSE € ; 
(25) ge ais 
a 
j'ai indiquée antérieurement (Var. aléatoires, pp. 237-242). Jai préféré ici profiter 
de la symétrie des lois dont dépendent les @,, et de l’indépendance des e„, pour indiquer 
une démonstration plus simple. 


4 
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dans cette formule, € désigne une valeur probable calculée dans l’hypothèse 
E; en ne tenant compte que des cas où cette hypothèse est vérifiée, on à bien 
une borne inférieure de la valeur probable de 6,7/(1— q) qu’il s’agit d’évaleur. 
Si maintenant p et 1/r tendent vers zéro, les deux premiers facteurs tendent 
vers Punité ; le dernier facteur * 


e/a f > e*/"du/u 
Pp, 


augmente indéfiniment; il en est donc de même de o?(g)/(1—q), c. q. f. d. 
_ On gexplique aisément ce résultat en observant que, si la courbe passe 
très près de l’origine, l’angle polaire varie rapidement. "Or, sans avoir une 
‘probabilité positive de passer exactement à l’origine, Parc A(¢)A(1) a une 
probabilité positive d’en passer arbitrairement près; on gexplique aisément 
que cette probabilité soit suffisante pour que la valeur eo moyenne 
de @(t) soit infinie. 
Revenant alors au cas où g est fixe, considérons la suite des points Ax, 
et les valeurs qui leur correspondent de l’angle polaire 0, — O (tn). Cet 
angle apparaît comme une somme de termes aléatoires indépendants dépendant 
d’une même loi symétrique et à valeur quadratique moyenne infinie. On sait 
que dans ces conditions, par leffet de ces grandes valeurs qui se trouvent: 
réalisées de temps en temps quand n augmente, Pordre de grandeur à prévoir 
pour 0, est, si n est grand, supérieur à celui de Vn. Ces grandes valeurs 
de temps en tenips réalisées correspondant à des arcs AnA» qui passent très 
près de l’origine, on voit que, si 04, est en général de l’ordre de grandeur 
‘de Vtm il y a parfois des valeurs plus petites [et aussi des valeurs plus 
grandes, comme le montre la formule (16)]. Il en résulte que, quand t tend 
vers zéro, 0A(t), qui doit finalement devenir nul, varie fort irrégulièrement ; 
la courbe a l’aspect d’une succession de boucles qui se ferment de plus en 
.plus près de Vorigine. : 
Il peut être intéressant de préciser @avantage. Disons seulement que 

la loi à valeur quadratique moyenné infinie que nous venons de considérer a, 
pour toute exposant a < 2, une moyenne d’ordre a finie; elle appartient au 
domaine d’attraction de la loi de Gauss. Par suite la on de langle 
polaire © sur un arc A(t’) A(t”), divisée par une fonction convenable de 
t/t”, dépend d’une loi qui tend vers la loi de Gauss réduite quand ce rapport 
augmente indéfiniment ‘ou tend vers zéro. 


6°. Les résultats qui précèdent 8 aiet évidemment à l'étude de la 
courbe Ç au voisinage du point correspondant à n’importe quelle valeur donnée 
det, soit a gauche, soit à droite de ce point (gauche signifiant ici du côté des. t 
décroissants; droite, du côté des ¢ croissants). Nous appellerons tangente : 
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en un point une droite passant par ce point et laissant d’un même côté un 
petit are de courbe contenant ce point; il peut y avoir des demi-tangentes, 
à gauche, ou à droite. Au point A(t) correspondant à une valeur de £ donnée, 
ou choisie au hasard par une expérience indépendante du choix de C, il n’ 7 
a presque sûrement ni tangente, ni demi-tangente. _ 

“Par contre, il existe presque ‘sûrement des points exceptionnels, où il y 
a des tangentes. Nous avons déjà observé que tout intervalle de variation 
de t'contient une infinité dénombrable de maxima et minima de Y(t) ; chacun 
d’eux correspond à une tangente parallèle à Paxe des y. Ce résultat s’appli- 
quant aux tangestes parallèles à n’importe quelle direction, il y a une infinité 
continue de tangentes. Il y a d’autre part une double infinité de demi- 
. tangentes: toute droite coupant la courbe est une demi-tangente à chacune des 
extrémités des intervalles extérieurs à la courbe. 

I n’y a presque sûrement aucune tangente double parallèle à Paxe des y; 
les maxima et minima de X(t) constituent en effet une infinité dénombrable, 
de sorte que l’ensemble des valeurs obtenues pour ces maxima et minima n’a 
aucune chance de contenir une valeur qui soit obtenue une seconde fois. La 
même remarque s'applique aux tangentes doubles parallèles à une direction 
donnée. Par contre, comme nous allons le montrer, W y a presque sûrement 
une infinité dénombrable de tangentes doubles (mais la probabilité que l’une 
d'elles ait une direction donnée d’avance est nulle). | 

Considérons à cet effet deux arcs A(t/)A(t) et A(t”)A(&”) de C, ` 
et désignons respectivement par I” et I” les plus petits contours convexes 
entourant respectivement ces arcs. Il existe de zéro à quatre tangentes 
communes à I” et I’; comme A(t’) et A(t) sont presque sûrement in- 
térieurs à I”, et que A(t”) et A(t”) sont intérieurs à I”, si to’, th’, to” et 
t” sont choisis au hasard, ces droites sont presque sûrement des tangentes 
doubles à C. i 

Sur wimporte quel arc dé C, nous pouvons cacy deux points distincts 
A(t) et A(t”), puis prendre tp’ —t’, t/—#, to” —t”, t”—#", assez petits 
pour que I” et I” soient extérieurs l’un à l’autre et que les droites que nous 
venons de considérer existent. Pour tout arc de Ọ, il y a donc des tangentes 
doubles; donc il y en a une infinité dénombrable au moins. : 

D’autre part, pour toute tangente double A(#)A(#”), on peut définir 
dans l’espace représentant l’ensemble des quatre nombres ty’, ty’, to”, h”, 
un domaine (E — ts, t/—#, t — t, t;”— t” positifs et assez petits) 
tel que, pour tout point de ce domaine, A(t’)A(t”) soit une .des quatre 
tangentes doubles (au plus) que Pon peut définir en partant de ce point. 
Il ne peut donc y avoir qu’une infinité dénombrable de tangentes doubles. 

Désignons par T le plus petit contour convexe entourant un. arc 
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A(t’) A(t”). Nous allons montrer qu’il est presque sûrement constitué par 
une infinité dénombrable de tangentes doubles, formant un ensemble partout 
dense autour de I, c’est-à-dire que n’importe quelle tangente à T est limite 
de tangentes doubles; il n’y a pas de point anguleux. En d’autres termes, 
si M est le point de T d’abscisse curviligne s, la tangente en M à T fait avec 
une direction fize un angle 6 qui est une fonction de s continue, évidemment 
monotone, et à dérivée presque partoul nulle. 

La démonstration va résulter de propriétés presque sûres de C et de T 
au voisinage du point de T où x est minimum: la courbure est infinie, mats 
` 0 varie d’une manière continue. Ces propriétés doivent de même être vérifiées 
au point de contact de la tangente définie par une valeur de 6 choisie au 
hasard ; elles ne peuvent donc être en défaut que pour des valeurs de 0 ayant 
une probabilité nulle d’être choisies, c’est-à-dire constituant un ensemble de 
mesure nulle. Il n’en serait pas ainsi s'il y avait sur T des points anguleux, 
ou si l’ensemble des points pour lesquels la courbure est positive et finie 
constituait sur T un ensemble de mesure linéaire positive. 

Considerons donc le minimum de 2, que nous pouvons évidemment sup- 
poser réalisé à l’origine, et pour é == 0. Il suffit aussi de considérer Pare voisin 
de ce point correspondant aux valeurs positives de & Nous avons étudié 
antérieurement les propriétés X(t) dans ces conditions (Processus, $ 9, 1° et 
2°): X(t), pour une valeur de { très petite et choisie au hasard, est en 
général de l’ordre de grandeur de Vt; les grandes valeurs, fortuitement, mais 
presque sûrement réalisées, sont de l’ordre de grandeur de V2é log | logt]; 


les petites valeurs sont de l’ordre de grandeur de Vi | log? |=", «e tendant 
vers zéro avec t. 


D’autre part la variation de Y(t) est indépendante de l’hypothèse que 
X(t) soit positif, Il est donc presque sûr que, dans la suite des valeurs de é 
tendant vers zéro et correspondant soit aux petites valeurs, soit aux grandes’ 
valeurs, de X(t), on trouve des valeurs arbitrairement grandes de Y(t)/ Vi 
(positives ou négatives), mais que Y({)/V 2t log | log t | est borné (et asymp- 
totiquement = 1). 

De la comparaison des inégalités ainsi obtenues pour X (t) et Y(t) résulte 
dune part que F(#)/X({) prend toutes les valeurs possibles entre — œ et 
+ œ, Tautre part que ¥?(t)/X(t) tend vers zéro. L’hypothèse d’une 
courbure finie et celle d’un point anguleux sont ainsi exclues, c. q. f. d. 
Indiquons enfin sans démonstration V’extension suivante des résultats 
précédents: dans le mouvement brownien à p dimensions, i est presque sûr 
que, pour n'importe quel arc de trajectoire, la plus pelite hypersurface convexe 
qui le contienne à son intérieur a un plan tangent bien défini en lout point et 


LE MOUVEMENT BROWNIEN PLAN. 7" 509 


qui varie d’une manière continue, mais ne comporte aucune parlie courbe et 
est constituée par une infinité dénombrable de faces planes. On peut donc, 
dans l’espace considéré, faire varier lorientation d’une variété linéaire à deux 
dimensions sans que pour aucune d’elles le mouvement projeté sur cette yariété 
mette en défaut les propriétés du mouvement brownien que nous venons 
obtenir. Ce résultat va beaucoup plus loin que celui qui consiste à dire que, 
pour chaque arc de chaque courbe C considéré isolément, elles ont une proba- ° 
bilité égale à l’unité. 

4. La notion d’oscillation brownienre. 1°. Nous nous placerons 
(Vabord dans le cas du mouvement linéaire, et supposerons que ¢ varie de zéro 
à un. Supposons cet intervalle divisé en un grand nombre  d’intervalles 
partiels dont le plus grand ait une longueur très petite en; désignons par At 
un quelconque de ces intervalles, par AX la variation correspondante de X(t), 
ct posons 
(26) a ba = X (AtL)?, B, = 2 (AX y’. 

On a évidemment 


E{Bn} = SAÏ = 1 

E{ (Ba — 1)?} = XE{[ (AX)? — At]?} = 3b, — 20, + bn 
et par suite 
(27) E{ (Ba —-1)?} = 2b, = 23At Max At = Ren. 


Si donc on fait varier le mode de division de Pintervalle (0,1) en intervalles 
partiels de manière que e» tende vers zéro, il y a convergence en moyenne 
quadratique de By vers l'unité; donc aussi convergence en probabilité. 

Nous allons compléter ce résultat par l’étude de cas où il y a convergence 
presque sûre. 

Désignons par B’p la valeur de B, lorsque l'intervalle (0,1) est divisé 
en 2” intervalles égaux et montrons d’abord que: B’, tend presque sûrement 
vers Punité. 

C’est une conséquence immédiate de la formule (27), qui s’écrit, dans le 
cas considéré 

E{ (Bp — 1)*} = 217. 
L’inégalité de Tchebycheff donne alors 
Pr{| Bop —1| 29/2} < 2/P, 


et, cette expression étant le terme général d’une série convergente, il existe 
presque sûrement une valeur de p à partir de laquelle on a 


e 
| B'a — 1 | < p72”, 


ce qui démontre et précise le résultat annoncé, 
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Il serait facile, en ne faisant que des raisonnements très simples, de 
généraliser ce résultat. Nous allons établir un théorème plus général, qui 
comprend toutes ces généralisations presque évidentes. 


2°. Considérons une suite de valeurs tı, a, ' >, lv, de t, comprises 
entre zéro et un, et formant un ensemble partout dense dans l’intervalle (0,1). 
Les #— 1 premiers nombres ty définissent une division de cet intervalle en n 
intervalles partiels, à laquelle nous associerons comme tout à l’heure la somme 
non aléatoire ba et la somme aléatoire By; bx, au plus égal au plus se des 
intervalles partiels, tend vers zéro pour n infini. 


Taéondire 5. Ona 
Pr{ lim B, = 1} = 1. 


noo 
Pour le démontrer, observons que, quand n augmente d’une unité, la 
variation de la somme b» provient d’un seul terme, que nous désignerons par 
tn7, qui se trouve remplacé par une somme ta? + Tr? = Th? — Btn Ta”. On 
en déduit 


(28) by — Das = Sr tn”, 
et, pour Ba, on a de même 
(29) B, S Bası cee En n”, 


Éx et é” étant les accroiseements de X (t) dans deux intervalles contigus, de 


longueurs respectives tn’ et tx”. Comme ils sont indépendants, on a 


(30) l ElEn En} mO, Cén En) = talon” 


‘Proposons nous antenin d'étudier Heseiarion de By aed n varie 
dans un 1 intervalle (p,q); nous poserons 


Tp = Max | Ba — Ba |. 
pSn<q 


Nous considérerons d’abord des probabilités, que nous: désignerons par 
des lettres accentuées (Pr ou €’), évaluées en supposant connus les termes 
. de Bg, c’est-à-dire que l’on connaît les valeurs absolues des accroissements AX . 
dont les carrés interviennent dans By, mais non leurs signes. D’après (29), - 
les ÉRRRRAUORE successives de Bo, Ba-», * * +, Bp dépendent des signes de 
Én En” * pour les valeurs q —1t,qg—2,:--,p de n. Ces signes sont indépen- 
dants; pour tout n inférieur à q, | én’ | et | éx” | étant connus, les deux signes 
possibles pour é'é” sont également probables; le choix d’un signé détermine 
| én | = | é +é” |, et Pon se retrouve dans les mêmes conditions pour 
déterminer | &: | par un nouveau groupement de termes. 
` Nous sommes ainsi dans les conditions voulues pour appliquer une 
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inégalité connue de A. Kolmogoroff, ou du moins son extension au cas de 
certaines sommes de variables enchaînées que nous avons indiquée antérieure- 
ment (Var. aléatoires, pp. AO , et il vient 

PH (Dye 955 5 Elesin a 
Or Pr et € sik respectivement les valeurs probables de Pr et €. Compte 
tenu des formules (28) et (80), il vient 


(31) PT ng <È S (bp eu < = À bp 
Posons nain 
T, = lim To T = lim Ty 
| q>% p00 | 
Ces limites, finies ou infinies, existent presque sûrement, à cause du carac- 
tère monotone de Ty, ‘et, bp tendant vers zéro pour p infini, on déduit de 
Vinégalité (31) 
(32) PT, = SA by. PT Emo. 


Comme cela est vrai quelque petit que soit «, il est presque sûr que T = 0, 
c’est-à-dire que B, a une limite B. Comme enfin il y a convergence en: 
probabilité vers Vunité, ona B=], c.q. f.d. 

Naturellement, comme dans tous les énoncés de cette nature, il y a con- 
vergence uniforme de la suite des Ba, sauf dans des cas de probabilité inférieure 
à un nombre arbitrairement petit. On peut en effet, d’après (32), déterminer 
pa (pour h =1,2,-:-) de manière que : 





` . t f 
o PT EE} S hbo < -h 
et par suite - | l . , 
| 1 1 . . 
Pre < b Ta go Tag Di 3 lan 


Comme, dans les cas de convergence vers Punité, Zn entraîne | Bu —1| Sfr 
le résultat énoncé est bien établi. - 

On remarque aussi que la-convergence obtenue est indépendanté du choix 
des ty, si l’on assujettit ce choix à la seule condition que en (donc aussi by») 
soit borné supérieurement ae une fonction donnog ‘den qui tende vers zéro 
pour ninfini > > ' s 


Be: Introduisons maintenant, le hasard dans le choix des tn. Nous. s sup- 
poserons ces nombres choisis guccessivement d’après des lois qui peuvent n'être 
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pas indépendantes les unes des autres; mais, pour chaque n, la loi à n variables 
ti, ta, * >, t est bien déterminée et indépendante des expériences qui détermi- 
nent Y(t); de plus ces lois doivent être telles que en tende presque sûrement 
vers Zéro. Tel sera le cas si, par exemple, pour n'importe quel intervalle At, la 
_ probabilité que #, soit dans cet intervalle a une borne inférieure indépendante 
de tı, ta," "ln, et qui soit le terme général d’une série divergente. 

La suite des B, dépend alors de deux séries d'expériences indépendantes 
Pune de l’autre. La première a pour objet de déterminer la suite des tn, et 
Jon sait (cf. Var. aléaloires, p. 22) qu’il est en tout cas possible de faire ` 
correspondre les différentes suites possibles aux différentes valeurs d’une 
variable T comprise entre zéro et un, et cela de manière que la probabilité de 
n'importe quel ensemble de suites possibles soit égale à la mesure de. l’ensemble 
des valeurs de T qui leur correspondent (si cet ensemble n’est pas mesurable, 
la probabilité est indéterminée entre une probabilité intérieure et une proba- ` 
bilité extérieure, égales respectivement à la mesure intérieure et à la mesure 
extérieure de cet ensemble) ; chacun des tẹ est une fonction mesurable de T. 
D’une manière analogue, on peut représenter par une variable unique U 
l’ensemble des choix qui déterminent successivement X (1), X(4), X(4), 
X(4),:::, et par suite toutes les fonctions #,({) du théorème 1; X(t) ‘est 
la limite presque sûre de X, (t), la convergence étant uniforme (en tet U) 
en dehors d’un ensemble de valeurs de U de mesure arbitrairement petite. 

Désignons par Æ l’ensemble des points T, U, du carré OS T=1, 
0 = U =1,.pour lesquels on ait limB,—1; par Æ l’ensemble complé- 
mentaire. Nous allons montrer que 


| THÉORÈME 6. L'ensemble F est mesurable et de mesure nulle. 


Si l’on admet que F est mesurable; la démonstration est immédiate: il est 
presque sûr que lime, == 0, et que cela entraîne lim B,=—1, En d’autres 
termes, sauf pour des valeurs de ¢ constituant un ensemble de mesure nulle, 
ensemble des points de Æ” situé sur la droite T — t a une mesure linéaire 
nulle. Donc Æ” a une mesure superficielle nulle. 

T reste à montrer que E est mesurable. On sait que la probabilité de la 
convergence d’une suite de variables aléatoires, et celle de sa convergence 
vers une limite donnée, sont toujours bien déterminées. Pour montrer que ce 
théorème est ici applicable, il faut montrer, non seulement que chaque inégalité 
Ba < B a une probabilité déterminée, c’est-à-dire que B, est une fonction 
mesurable du point T, U, mais qu’il ex est de même de toute combinaison en 
nombre fini d’inégalités de cette forme. e Ce second résultat est d’ailleurs une 
conséquence évidente, non du premier résultat considéré isolément, mais de ce 
résultat et du fait qu’une même représentation du résultat des experiences sur 
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le plan des 7’, U permet d’étudier toutes les fonctions B»; on sait en effet que, 
dans ce plan, la partie commune à plusieurs ensembles mesurables est un 
ensemble mesurable. On est donc ramené à démontrer que chaqnda fonction Br 
est une fonction mesurable des point T, U. 


Nous démontrerons un résultat plus général, qui aura ‘las loin une autrè 


application: st p(æ,%a,° * `, 2) est une fonction continue de l’ensemble g 
ses n arguments, l'expression 
(33) D = pI (t1), X (ta), ` x 


est une fonction mesurable du point T, U. 

La démonstration est immédiate, en utilisant la définition de X(t) comme | 
limite des approximations #,(t). En remplaçant X(t) par X,(t), ® se trouve 
remplacé par une expression ®y qui est une fonction continue de ti, te, ° *" ; tn 
et des 2 quantités X(h/2) qui interviennent dans la détermination de 
X(t). C’est une fonction continue d'un nombre fini de fonctions mesurables 
de T ou de U, donc du point T, U; cest donc une fonction mesurable de ce 
point. | 

I suffit donc de montrer que $, tend en mesure vers $, pour y infini; 
c’est-à-dire que eet d étant arbitrairsment petits, on peut HR y tel 
que, pone v > v, on ait 


Prffd—æ|>é}<e 
Or on peut d’abord déterminer M tel que 
Pr{Max | X(t)| > M} < à 


Les nombres X(t,),X(tz),- © °, X (tn), X(t), Xv(te),- © <, Xv(tn) étant 
ainsi bornés (en dehors de cas de probabilité inférieure à ¢/2), on n’a à con- 
sidérer qu’une région où la fonction ¢(#1,22,: * ', 2a) est uniformément 
continue: si done chacun des X (ta) — Xy(h)(h==1,2,:::,n) ne dépasse 
pas en valeur absolue un certain module de continuité y—#(é), on a 
| #— p| Se. Or nous avons vu qu’en négligeant des cas de probabilité. 
inférieure à un nombre arbitrairement petit (nous prendrons ici e/2), 
| X(t) —Xv(t)]| peut, pour tout ¢ entre zéro et un et tous les cas non négligés, 
être rendu inférieur à un nombre arbitrairement petit (ici 1); il suffit que v 
soit assez grand. Dans ces conditions on a bien | —&, | <<, sauf dans les: 
cas négligés dont la probabilité totale eet inférieure à «/2 + 2 — e, C. q. Ê d. 


CoRoLLAIRE. La partie Ba(t) de la somme B, qui ‘dépend des valeurs 
de X(u) dans l'intervalle (0,t) tend presque sûrement, pour n infin, vers 
` B(t) =t et cela uniformément quand t varie de zéro à un. 
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Le théorème précédent s’applique évidemment pour chaque valeur de t 
comme pour la valeur unt? En considérant alors un ensemble dénombrable 
de valeurs t'y de ¢, partout dense entre zéro et un, il y a convergence presque 
sûre de chacun des B,(v’”) vers B(t,); en effet, pour chaque t», il n’y a 
divergence que dans des cas dont la probabilité est nulle; la réunion de tous 
ces cas @ encore une probabilité nulle. En dehors de ces cas, il y a en tous Ies 
points ¢’, convergence de la fonction monotone B,(t) vers la limite B(t) 
continue, done uniformément continue, dans l'intervalle fermé (0,1). On 
sait qu’il y a alors convergence uniforme dans tout l'intervalle, c. q. £. d. 

4°, Arrivong maintenant à une application du théorème de Fubini, qui 


- est fondamentale. Désignons par F l’ensemble du plan des T, U, intérieur 


au carré 0 ST <1, 0< US 1, et correspondant à l’ensemble des cas où il 
y a convergence uniforme de B,({) vers ¢ dans l'intervalle (0,1). On peut 
indifféremment calculer sa mesure. en intégrant par rapport à T la mesure 


` linéaire de sa section par une droite U = const., ou en faisant l’inverse. C’est 


par la première méthode de calcul que nous avons déterminé cette mesure, et : 
montré que le complément de F est de mesure nulle. L’interversion de Pordre 
des intégrations nous donne immédiatement un résultat important. Pour 
Vénoncer simplement, nous dirons qu'une fonction X(t) est un modèle de 
mouvement brownien linéaire si, la suite des tn étant choisie au hasard, on 
obtient avec une probabilité unité une suite de fonctions Ba(t) ayant une 
limite non aléatoire B(t), fonction continue et croissante de t; pour chaque 
intervalle At, la variation AB(t) sera la mesure de Pareille boit brownienne. 
La conséquence annoncée du fait que l'ensemble, F ait pour mesure Pane 
#énonce alors ainsi: 


Taéorème 7. Le schéma stochastique du mouvement brownien linéaire 
réalise avec une probabilité unité un modèle de mouvement brownten linéatre ; 
de eae B(t) =t. 


Quelques remarques sont nécessaires pour bien comprendre le définition : 


` qui précède. Il est d’abord évident que, pour une fonction donnée X(t), il 


peut arriver que B,(t) ait une limite presque sûre autre que ż. Cette limite 
est nécessairement une fonction non décroissante de t Si elle est constante 


‘dans un intervalle, c’est que la fonction X(t) n’y est pas assez irréguliére pour 


pouvoir donner une idée du mouvement brownien. Il est peut-être aussi 
possible, si X(¢) a au voisinage d’un point une allure trop irrégulière, que 


19 On remarque d’ailleurs qu’en raison de l'indépendance. des oscillations de X(#) 
dans les deux intervalles (0,t) et (#,1), il ne peut avoir convergence presque sûre 
de B, = B, (1) vers une limite que si B aft) et By —B, (e) ont séparément des limites 
presque sûres. 
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B(t) y soit discontinu. C’est pour cela que nous avons supposé la fonction 
B(t) continue et croissante, et il est évident que dans ce cas il n’y a qu’à 
prendre cette fonction comme nouveau paramètre pour être ramené au cas où 
B(t) =t. 

I faut alors prendre garde que ce changement de paramètre modifie la 
loi de probabilité dont dépend le choix des én; c’est avec la loi de probabilité 
ainsi transformée que l’on pourra considérer la nouvelle fonction X(¢) comme 
un modèle de mouvement brownien linéaire pour lequel on ait B (t) = t. 

L peut être utile de préciser la loi dont dépend le choix des t» de manière 
que la définition de ce que nous appelons un modèle ne dépende d’aucun 
élément arbitraire. Le plus simple est de supposer que chaque t» soit choisi 
indépendamment des autres, et avec une probabilité uniformément répartie de 
zéro à un. On peut montrer que: la notion de modèle de mouvement brownien 
linéaire ainsi obtenue n’est pas changée si l’on remplace cette loi de répartition 
uniforme par une autre lot absolument continue pour laquelle la densité, de 
probabilité soit comprise entre deux nombres positifs. 

Nous n’indiquerons que le principe de la démonstration. Même si l’on 
suppose seulement que B,(1) tend presque sûrement vers B(1), il en résulte 
que, pour tout ¢ compris entre 0 et 1, Ba(t) a presque sûrement une limite 
B(t); autrement les oscillations de Ba(t), qui dépendraient du choix des 
points de division entre 0 et ¢ plus que de leur fréquence, ne seraient pas 
presque sûrement compensées par celles de Bn(1) — BA(t), qui dépendent 
des points de division choisis entre # et 1. 

Il en résulte évidemment que l’on peut augmenter dans un rapport 
déterminé la probabilité d’un des intervalles (0,4) et (¢,1) et diminuer en 
nn celle de Pautre; cela ne peut pas empêcher que B,(f) et 

Bx(1) — Bn (t) tendent respectivement, et presque sûrement, vers B(t) et 
B(1) — B(t); donc B,(1) vers B(1). 

On peut raisonner de la même manière pour n’importe quelle division 
de l'intervalle (0,1) en intervalles partiels, et un passage à la. limite facile 
conduit au résultat énoncé.. 

Par suite, même si l’on précise la définition de modèle de mouvement de 
brownien linéaire par la condition que pour le choix de chaque ¢ la probabilité 
soit répartie d’une manière uniforme, si l’on trouve pour B,(¢) une limite 
presque sûre T — B(t), pourvu que tous les rapports .AT/At soient compris 
entre deux nombres positifs, le changement de variable qui consiste à prendre 
T comme nouvelle Robe est légitime. On est ainsi ramené au cas où 
B®) =t. 


5°. La convergence de Ba = B,(1) vers B-=B(1) est bien entendu 
presque sûre, mais non sûre. Désignons par 8, la borne inférieure de Ba, et 
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par 8, sa borne supérieure, quand on fait varier les points de division. On a, 
au sujet de ces nombres, les résultats suivants: 


THÉORÈME 8. Pour n'importe quel modèle de mouvement brownien, By 
tend vers zéro, pour n infini. 


THÉORÈME 9. Pour la fonction aléatoire X(t) du schéma du mouvement 
brownien linéaire, il esi presque sûr que Bn augmente indéfiniment avec n, 


Le premier de ces théorèmes résulte de ce qu’un modèle de mouvement 
brownien linéaire est nécessairement une fonction continue X(t). On peut 
alors, si n est assez grand, définir entre X (0) et X (1) une suite de nombres 
croissants 21, Ta, * °°, En, telle que la somme des carrés des intervalles ainsi 
séparés soit arbitrairement petite, puis définir entre zéro et un des nombres 
croissants ti, tz * +, tn- tels que (ty) = zv (væ 1,2, : :,nm—1). On 
obtient ainsi pour Bn une valeur arbitrairement petite, c. q. f. d. 

On voit même aisément qu’on peut prendre pour les {, n'importe quel 
ensemble dénombrable et partout dense entre zéro et un, donné d’avance; il 
euft de les ranger dans un ordre convenable pour que B, tende vers zéro (ou 
vers n'importe quelle valeur donnée entre zéro et B). 

Pour démontrer le théorème 9, observons que, si les points de division 
sont assez nombreux, les valeurs des accroissements AX se répartissant suivant 
leur probabilité théorique, on aura avec une probabilité supérieure à 1 — «/2 
des intervalles de longueur totale supérieure à #0 pour lesquelles 
(AX)? > 2cAt (c étant arbitrairement grand, e arbitrairement petit, et k 
déterminé en fonction de c). Conservant ces intervalles, et subdivisant les 
autres, on arriver a de nouveau à trouver une fraction supérieure à k de la 
longueur de chacun d’eux pour laquelle on aura, pour les nouveaux intervalles 
obtenus, (AX)? > 2cAt, et cela en exceptant des cas de probabilité totale 
inférieure à e/4. Prenons alors pour p un entier tel que (1— k)’ < $. Après 
p opérations analogues, sauf dans des cas de probabilité inférieure à 


f2+f/4+- +ew<e, 


on aura obtenu une division de l’intervalle (0,1) en intervalles partiels pour 
laquelle plus de la moitié de la longueur totale sera constituée par des inter- 
valles partiels tels que (AX)? > 2cAt; done X(AX)° > c, ce qui démontre le 
théorème 9. 

Le résultat ainsi obtenu pour la fonction aléatoire X (t) n’est pas, comme 
dans le cas du théorème 8, applicable à tous les modèles de mouvement 
brownien linéaire. On peut définir de têls modèles pour lesquels on a toujours 
(AX)? = cAt, donc Ba Sc, c étant une constante suffisamment grande. 
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Prenons maintenant pour n une fonction lentement croissante n(h) d’un 
entier À (par exemple la partie entière de log log log h), et supposons qu’à 
chaque valeur de h on fasse correspondre n — 1 points de division choisis au 
hasard, d’où résultera une valeur de Bwen, — B'a. Le grand nombre d’expéri- 
ences ainsi faites pour une même valeur de n conduira à trouver, pour By, 
des nombres remplissant l'intervalle (Bn, Bn), et, si n(h) croît assez lentement, 
la suite des B'r aura presque sûrement pour valeurs limites tous les nombres 
de l'intervalle (0, 8), £ étant la limite de By. 

D’après cette remarque, même s’il est possible de généraliser le résultat 
obtenu au sujet de la convergence presque sûre de B, vers B, on ne peut pas 
Pappliquer sans aucune restriction relative au choix des modes de division de 
l'intervalle (0, 1) successivement considérés. Nous ne savons pas, notamment, 
s’il serait suffisant que le nombre des points de division soit constanrment 
croissant pour que la convergence de B, vers B soit presque sûre. 


6°. Lrexistence de fonctions qui soient des modèles de mouvement 
brownien linéaire n’était pas évidente a priori. Au point de vue idéaliste, elle 
résulte du théorème 7. Mais ce théorème ne nous donne aucun moyen de 
nommer une telle fonction; c’est ce que nous allons faire maintenant. 

Pour cela nous nous inspirerons de ce que fait le hasard ; nous chercherons 
à limiter. M. Borel a montré, qu’on ne peut pas, d’une manière générale, 
imiter le hasard ; si l’on imite certains caractères d’une suite de nombres choisis 
au hasard, on en omet nécessairement d’autres. Mais si l’on porte son atten- 
tion sur certaines conditions bien déterminées (ici celles qui interviennent dans 
la démonstration du théorème 7), on peut, à ce point de vue spécial, imiter le 
hasard. | 

Nous prendrons d’abord, comme modèle d’une suite de nombres 
Tis Ta,’ ` `, En, * `, Choisis au hasard entre zéro et un, la suite des parties 
fractionnaires des nombres n/log n. Ele présente ce caractère que, pourvu 
que (n’—n)/logn augmente indéfiniment avec n, les æy d'indices compris 
entre n et n’ se répartissent uniformément entre zéro et un, la fréquence de 
ceux qui sont compris entre zéro et n’importe quel nombre z compris entre 
zéro et un tendant nécessairement vers v. On imite le hasard en ce qui con- 
cerne l’uniformité de sa répartition; mais on ne l’imite pas dans ses caprices; 
une suite de nombres effectivement choisis au hasard ne serait pas constituée 
par des suites partielles de nombres croissant régulièrement de 0 à 1. 

L’uniformité de la répartition serait encore mieux réalisée si l’on prenait 
an au lieu de n/logn (il suffirait qe n’—n augmente indéfiniment pour 
obtenir une répartition uniforme des termes d’indices compris entre n et n’). 
Mais il y aurait entre £a et £e» (qui serait la partie fractionnaire de 24,) une 
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corrélation que nous évitons en prenant pour £a la partie fractionnaire de 
n/log n; il n’y a alors aucune corrélation entre +, et Ten ni, plus généralement, 
entre £y et Zw, pour n’ = 2n, 

Posons alors Za = F (én), F(E) désignant la fonction de répartition de 
la loi de Gauss’ la suite des é sera un modèle de suite de variables gaussiennes 
choisies au hasard. Or, d’après le 8 1, la détermination successive des nombres 


(1), 4($), X(4), IH, (4), X(#) 7, 


qui aboutit à la détermination de X(t), dépend de variables gaussiennes 
choisies successivement. Il suffit de choisir la suite des é qui vient d’être 
définie pour obtenir un modèle de mouvement brownien linéaire; nous nous 
coenen d’indiquer ce résultat sans démontration. 


| 7°. L’extension des résultats précédents au cas du mouvement Wiwa 
plan est immédiate. Nous désignerons ici par Al la longueur de la corde 

A(t) A(t + At), et ferons correspondre à chaque Den polygonele an , côtés 

inscrite dans Parc 4(0)A(1) la somme . Le 


(34). By = 3(41)? — 3[ (4X)? + (A7). 


Les résultats obtenus pour le mouvement brownien linéaire s'appliquant 
séparément à X(t) et Y(t), on voit qu'ici Ba tend vers 2 [ou vers 2t si Pon 
considère Pare A(0)A(+#)] sous les conditions qui, dans le cas du mouvement 
linéaire, assurent la convergence vers 1 (ou vers ¢). La limite obtenue paura 
toujours être appelée mesure de l’oscillaiion browntenne (plane). 

On peut aussi se proposer de définir des modèles de mouvement brownien 
plan. Il faut d’abord supposer qu’il y ait, pour chacune des sommes X (AX)? 
et 3(AY)? relatives à chaque intervalle (0, t), convergence presque sûre vers t. 
Mais cette condition est trop peu restrictive; elle pourrait être réalisée en 
prenant Y(t) — X(t). Le mouvement serait rectiligne, et donnerait une bien 
mauvaise idée du mouvement brownien plan. Il- est alors indiqué d’ajouter 
une condition d'indépendance de X(t) et Y(t); ce sera que XAXAY tende 
vers zéro, dans les conditions indiquées à propos de la convergence de %(AX)? 
vers l’unité [ou vers ¢, s’il s’agit de Parc A(0)A(#)]. Cette condition, équiva- 
lant à celle que la mesure de l’agitation brownienne linéaire soit toujours bien 
définie et ait la méme valeur en projection sur n’importe quelle droite du plan, 
est évidemment réalisée, avec une probabilité unité, dans le schéma He 
du mouvement brownien plan. | 

L'extension du théorème 9 au mouvement brownien plan est évidente. Il 
n’en est pas de même du théorème 8. La somme (34) n’est en effet très petite 
que si les deux termes X(AX)* et X(AY)? sont tous les deux très petits. Or, 
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gi dau d’eux peut être rendu très petit, il west pas évident qu’ils peuvent 
être rendus simultanément très petits. Nous ne Eroa que signaler ici aie 
question. 


5.  L’aire limitée par la courbe €. 1°, Etudiant maintenant te mouve- 
ment plan, nous désignerons par S(t) Paire comprise entre Pare A(0) A(t) 
et sa corde, les conventions de signes étant celles'que l’on fait pour représenter 
une aire par une intégrale curviligne étendue à son contour; nous écrirons 9 
au lieu de S(1). 

L'intégrale qui représente l’aire ne sera pas une ie au sens de 
Riemann, mais une intégrale stochastique, analogue à certains points de vue 
à l'intégrale B du §4. Elle pourra être définie, non comme limite sûre d’une 
somme, mais comme limite en probabilité, ou en moyenne quadratique, ou 
encore comme limite presque sûre. . | 

Comme pour l’étude de Ba, nous allons commencer par un cas simple en- 
considérant les lignes polygonales L’n, inscrites dans Pare A(0)A(1), ayant 
chacune pour sommets les points de -cotes multiples 2". Désignons par S's 
Paire comprise entre L’y-et I’n.:. Elle est la somme de 2* triangles, dont les 
aires sont des variables aléatoires indépendantes les unes; nous préciserons 
plus loin la loi dont elles dépendent; il suffit d’observer ici qu’elles ont une” 
valeur probable nulle, celle de leurs carrés étant 1/22##, On en déduit 


E{Sn}=— 0, ESR} — 1/28, 
La série Xa, qui représente S, est donc convergente en moyenne qua- 
_dratique. Quoique ces termes ne soient pas indépendants, il est facile d'établir 


sa convergence presque sûre. On peut, par exemple, utiliser l'inégalité de 
Tchebycheff, qui donne 


Pr{| S'a | > 1/209} < 1/22. 
Cette probabilité étant le terma d’une série convergente, il existe presque 
sûrement un nombre N tel que, pour n > N, on ait 
| Fa S 1/2 (+614 
ce qui établit le résultat annoncé. 


2°. Considérons maintenant, comme au 2° du §4, une suite de nom- 
` bres £, f2,° °°, Én, ' © *, compris entre zéro et un et formant un ensemble 
partout dense dans vet intervalle. Nous désignerons par La» la ligne brisée 
allant de A(0) à A(1) et ayant cgmme sommets intermédiaires les points : 
A,, Ag, ©, An [en écrivant As au lieu de A (tx) |, rangés dans l’ordre des ¢ 
croissants. Nous désignerons par S» Paire comprise entre La et la corde 
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A(0)A(1), et par Ta la différence Sus1 — Sn, qui est Paire d’un triangle ayant 
pour sommet An et pour base un côté 4,4,” de La. Nous désignerons sa 
longueur par ln, par ty’ et ta” les cotes de À,’ et Ax”, et poserons 


“at 
Tho ty _ ty’; Ta = tn = te’, Ta” = tn” = Én. 


Si €,_, désigne une valeur probable calculée en connaissant Ai, Ae,**-. An, 





Toma 
Ena{Tn} = 0, Exa{Tn?} = 2, 
4th 
( 3 5 ) š Tn Tn Ta Ta” 
E{Tn?} = ee EU} = —Q— = F (bn — Dass), 


bn ayant la niéme signification qu’au $ 4, de sorte que la formule (28) est 
toujours applicable. D’aprés la premiére de ces formules, on peut appliquer 
Pinégalité de A. Kolmogoroff à la somme XT%. Tl vient ainsi 


yi : ese (ae = = 
Pr{ ee | Sansa Sn | =a V bn bnp} =e 


et, en faisant augmenter p indéfiniment 


(36) Pr{Max | Sv—-8,| =e} Se (=: Vis, à) ; 

von © € 
Comme bn tend vers zéro pour n infini, e et é peuvent être rendus simultané- 
ment arbitrairement petits. La convergence presque sûre de la suite des Sp 
en résulte. 


3°. Montrons maintenant que, si l’on remplace la suite des ¢ par une 
autre suite analogue i, fe,- © : ,,: © +, les deux expressions S et § successive- 
ment obtenues pour Paire étudiée sont presque sûrement les mêmes. Il suffit 
évidemment de montrer que les aires polygonales Sw et S, dont elles sont les 
limites sont infiniment peu différentes en probabilité, et pour cela de montrer 
qu’elles sont Pune et l’autre infiniment peu différentes en probabilité de Paire 
Sa” limitée par la ligne polygonale inscrite dans C ayant pour sommets tous 
les points de cotes tı, ta: © -, tir, Fi, fe, fn- Pour Sn, par exemple, cela 
résulte de la formule (36), qui s’applique évidemment à tout mode de sub- 
division de Vare A(0)A(1) commençant par les points de cotes ti, te, :*", tut. 

On peut aussi, en utilisant les formules (35), montrer que les moyennes 
quadratiques de Sa — Sn” et Sn — Sa” sont infiniment petites. | 


11 Au point de vue de l'uniformité de la®convergeuce par rapport au choix de la 
suite des t,, il faut noter que e et e’ ne dépendent que de b,, lui-même borné supérieure- 
ment par e, [formule (27)]. z 
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4°. Introduisons maintenant le hasard dans le choix des ty. Les 
raisonnements étant identiques à ceux faits à propos des Ba, nous ne ferons 
qu’en. rappeler les grandes lignes. C’est pour n’avoir pas à les recommencer 
que nous avons introduit à propos de l’étude de Ba expression générale (33), 
dont B, n’était qu’une forme particulière; il faut seulement noter que Sn 
dépend des deux fonctions aléatoires X (t) et Y(t) ; mais cela ne change rien 
au raisonnement fait à propos de l’expression (33), et le résultat obtenu à cet 
endroit s'applique à Sn. Nous n’avons donc pas à craindre que nos raisonne- 
ménts introduisent des ensembles non mesurables ou des probabilités non 
déterminées. La probabilité de la convergence de Sn vers S est bien déterminée, 
et il importe ‘peu, pour la calculer, qu’on fasse d’abord les expériences qui 
déterminent les tn, puis celles qui déterminent C, ou l’inverse. Or nous savons 
que, pourvu qu’il soit presque sûr que la suite des ty est partout dense, il est 
presque sûr que S» tend vers S. En disant que, pour une courbe C déterminée, 
Paire S est stochastiquement définie si, les points t» étant choisis au hasard, 
8, tend presque sûrement vers une limite non aléatoire S, nous voyons que: 


THÉORÈMR 10. Le schéma aléatoire du mouvement brownien plan con- 
duit, avec une probabilité unité, à une courbe O pour laquelle l'aire S est 
stochastiquement bien définie? 


5°. La question se pose naturellement de déterminer la loi dont dépend 
8. Nous traiterons d’abord un problème plus élémentaire: déterminer la lot 
dont dépend l'aire d’un triangle inscrit dans C, ses sommets ayant des cotes 
données. 


Nous désignerons ces cotes par t— r, t,t ++”. L'aire est évidemment 
de la forme #V 7/7” T, la nature de la variable aléatoire T étant indépendante 
de t, r’ et +”. Si À désigne la longueur d’un vecteur gaussien réduit, et si y 
est une variable gaussienne réduite, la longueur du côté A(t—+)A(t) du 
triangle étudié est de la forme AV r’, et celle de la hauteur perpendiculaire à 

.ce côté est nV r”; X et y sont indépendants, et T == An. 
Calculons les moments de la variable aléatoire T. On a évidemment 


87 j Espn = E(TIP4} = EA] E {qr} — 0, 
(37) À Bay = E{XP}E(q??} — 20 (p+ 1): 1-3-5- (2p—1) = (2p) |, 


13 Bien entendu, laire S n’est définie que stochastiquement. On peut développer à 
ce sujet des remarques analogues à celles qui nous ont conduit au théorème 8. Le 
résultat est que © a presque sûrement If propriété suivante: on peut définir la suite 
des t, de manière que 8, ne tende pas vers 8, et même de manière que 8, ait n nimporte 

quelle limite donnée, finie ou infinie. | 
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et par suite 


(38) DE) = E (0?) = $ (at — (21 <1). 


1 
1+2? 
Le fait qu’on trouve pour (2) une série entière à rayon de convergence 
positif suffit, comme on sait, pour être assuré que la fonction caractéristique 
de la loi étudiée est bien celle définie sur tout laxe réel par le prolongement 
analytique de cette fonction. Il s’agit done de la première loi de Laplace, 
c’est-à-dire de la loi symétrique définie par 


(39) Pr{| T | > x} = 6%, (z > 0). 


6°. Etudions maintenant la loi dont dépend S(1) ; à cause de la simili- 
tude stochastique entre la courbe C et ses parties, S(t)/t dépend de la même 
‘loi. Cette variable aléatoire étant en corrélation avec la longueur L(t) —avVi 
de la corde A(0)A(t), nous étudierons la fonction de répartition à deux 
variables, évidemment indépendante de t 


F(a, p) = Pr{9(t) < at, L(t) < pVb. 

Nous commencerons par admettre que les trois premiéres dérivées de cette 
fonction sont définies et continues, sauf peut-être pour « == 0 ; cette hypothèse 
sera justifiée plus loin. Il résulte d’autre part de la manière dont S a été défini 
comme somme d’une série qui converge en moyenne PA (3:5; 1°), 
que ses deux premiers moments sont finis. 

Nous désignerons par ydi et »Vdt les deux composantes de 
A(t)A(t+ dt) suivant la direction A(0)A(¢) et la direction perpendiculaire ; 
é et n sont deux variables gaussiennes réduites, indépendantes l’une de l’autre, 
et indépendantes de Pare A(0)A(t), et par suite de S(t) et L(t). De 


(L + 8L)? = (L + éV dt)? + ndt 
[en écrivant L au lieu de L(t)], on déduit 


(40) SL == EV dt + wat + o(dt), 

tandis que la variation de 9 = S(t) est évidemment 

(41) | - W= ia Vdi + S,(dt), 

Sı(dt) dépendant de la même loi que S(dt), donc que di§(1); cette aire a 


sa valeur probable nulle, et est stochastiquement indépendante de Varec 
A(0)A(#), donc de S(t) et L(t), mais non de é et y. 
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~ „Pow former une équation vérifiée par la fonction F(a, p), nous allons 
calculer de deux manières différentes la probabilité 


LA 


(42) Pm Pr(S + 8 < ai, L-4- 8L < VO. 


Une première évaluation de P repose sur la. dus que, F (a, p) ne 
dépendant, pes. du ne on a 


! 


Pr{S + 58 < a(t + dt), L. + aL. < p Vt + dt 7 Fa pi) 
En définissant as a et pr par, les formules 


a(t + dt) — at, AVE dep Vi, 
où Pon tire - ae TAE 
a — & UD (dt 0), 
il vient nn ts fe 
(48) P=F(a,p,) =—F(a, p) —a $ Pi or ‘Fyt 0( db) 

La scouts manière d’évaleur P repose sur les expressions (9) et (41) 
de ôL et 89, qui montrent que HRDRANDRES, 


as $. ome “bv dt — a) (e T. ydi ` 
= Pr} s<(a ne à Joker) Vi 
est de la forme P<+o(dt). Pour évaluer P;, nous poserons 


- à | HE dt, 


Rpt | Me i | 





et re par Ps, 1, au lieu de P;, une probabilité Guiot évaluée en 
supposant connus é, ņ, et 8: (dt) ; P; est sa valeur probable. On a évidemment 


Pm f me Ea À = De 
-fi [P,a — À Ep, t E pro +R E Ja 


| K| ayant une borne supérieure, fonction de a’ et p’ seulement. Comme 


| Ef} = 0, Ei) == 1, Em |} < ©, : 
il vient Pe e 7 7 


Pa E(P) EPE eee MP ala AJ o(d). 
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` Tenant compte Pautre part de 


d 
F(a, p) = F(a, p) — SAG) p, ~(¢ eV + tr 
+e Fre + Ko(dt), 


1 - 
où |K, | est borné par un polynome en |é{,[n{| et | S:(dt)/t| dont la 
. valeur probable est finie, et où l’expression désignée par o(dt) n’est pas 
aléatoire, il vient 


di dt, dr à 
Pera P+ GP Ot fi Eala AJAA + o (di). 


Comme P, = P--o(dt), la comparaison de cette formule et de la 
formule (43) donne | 


(4)  2aP, Heir + f°X TT elada, 
d’où, en dérivant par rapport à p et posant Fp = G, 
(45) : (: + +) G+ RaGa + =) CA $ Ga” e + Fo m 0. 


7°, Tæéonèws 11, F(a,p) est la seule solution de l'équation (44) 
qua soit une fonction de répartition. 


L’équation (44) exprime en effet que 8(t)/t et L(t)/ a dépendent 
d’une loi à deux variables indépendantes de t. Si, pour ‘t= to > 0, on 
prenait une loi initiale quelconque, la variation de 9 et celle de L étant ensuite 
définies par les formules (40) et (41), on aurait, au lieu de-F(a, p), une 
fonction F(&, p, t)i vérifiant une équation analogue à l’équation (44), mais 
où il y aurait un second membre 2¢/":. Une solution de cette équation étant 
bien déterminée par sa valeur initiale (pour t= to), l'équation (44) exprime 
bien la condition nécessaire et suffisante pour que la loi de Pre considérée 
ne varie pas avec t. 

| Supposons alors qu’on ait deux solutions différentes F, et F; de l'équation 
(44), qui soient des fonctions de répartition. On peut leur faire correspondre 
deux systèmes de variables aléatoires Sı, Lı et S2, La qui, pour deux courbes 
C1 et C2: dépendant de processus convenablement déterminés pour ¢ variant de 
zéro à un, représentent l’aire comprise entre l’arc A (0) A (1) et sa corde, et la 
longueur de cette corde; le système Sı, L dépendra de la loi définie par F3; 
Ia, La de celle définie par Fa 
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Déplacons maintenant la figure sur laquelle est tracée une de ces courbes, 
de manière que, pour les deux courbes considérées, A (1) ait la même position, 
et A(1)A(0) la même orientation; les deux origines A» et A; des deux courbes 
seront ainsi sur une même demi-droite issue de leur extrémité commune A(1). 
Faisant ensuite varier ¢ à partir de la valeur 1, on prolongera O, et C3 par la 
même courbe C dépendant du schéma stochastique du mouvement brownien. 
Alors le système Si(t)/t, La(t)/VE ne cessera pas de dépendre de la loi 
définie par F,; le système S.(t)/t, Le(t)/ Vt, dépendra de même de celle 
définie par F2. 

Or | LZi—Le| est Boe supérieurement par la distance Ada, et 
R(S2— S1}/A142 représente la distance du point A(t) à 4,42; cest une 
variable gaussienne de paramètre Vt— 1. Il en résulte qu’asymptotiquement, 
pour ¢ infini, les deux systèmes de variables considérés sont confondus; en 
termes précis, les différences 2 
Sı — 8, Lı — d: 

oa Vt 
tendent en probabilité vers zéro. Ils dépendent done, à la limite, de la même 
loi de probabilité; done F, = FF, c. q. f. d. 


8°. Quoique les équations (44) et (45) résolvent théoriquement le 
problème posé, nous n’avons pas pu obtenir l’expression explicite de F(a, p) ; 
peut-être n’existe-t-il aucune expression simple de cette fonction. Dans ces 
conditions il peut être utile d’indiquer d’autres méthodes qui permettent 
d'étudier la variable aléatoire 9 et sa correlation avec L. Dans ce qui suit, 
Let S ne désigneront plus L(t) et S(t), mais L(1) et S(1). 

On peut d’abord calculer les moments de la loi à deux variables L et 8.¥ 
Le moment 








2 


E{LPS1} = Tyg 


étant évidemment nul si g est impair, il suffit de calculer ceux dont le second 
indice est pair. Le calcul repose sur la formule (41), formule exacte où l’on 
peut remplacer ¢ et dt par un. En désignant par L et L, les longueurs des 
cordes A(0)A(1) et A(1)A(2), et par ¢ leur angle, qui est une variable 
choisie au hasard entre —m et + ~, cette formule prend la forme 


(46): S(2) — 8 + Sı + $ LL, sing. 


18 Tous ces moments sont finis; cela résulte évidemment de 
- Bpa © t Bay 4 + Bo,2g)r 
et de ce que, comme nous le verrons plus loin, la protabilite des grandes valeurs de | 8 j 
décrott comme une exponentielle. 







4 


N 
p CENTRAL a 





526 > M. PAUL LÉVY. 


- Or $ d’une part, L et S d’autre part, Lı et S, en dernier lieu, constituent 
des groupes de variables indépendants les uns des autres; la loi à deux variables ` 
L et S.est la même que celle dont dépend Ln, Sa et, par un changement d’unité,. 
détermine celle dont dépend le système S(2),L(2). On a ainsi 


az E{S? (2) } = 4B os = E{S? + 8,3 + 4 LL? sin? $} > 
T nn IE (eir $} = Eon + 4, 

ot par suite | 

(47) Eos = E(P) — à, Ets (i) =f. 


On détermine ensuite Eza en ut EL (2)8(2)} à Paide de la 
formule (46) etde +: , 
(46°) : Le (2) a7 oe Ls 7 ALL cos $, 


‘puis Eh, 4, et ainsi de suite. Tous les moments sont ainsi biens: sans: difficulté, 

mais successivement, le calcul dépendant chaque fois de moments antérieure- 
. ment calculés. C’est donc une méthode de rence, et l’expression générale 
de ces moments peut être difficile à obtenir. 


9°, Une autre méthode repose sur les remarques sur Pinterpolation faites. | 
plus haut ($3,1°). La détermination de Parc 4(0)'4 (1) résulte de la déter- : 
. mination de vecteurs gaussiens indépendants, qui définissent ‘successivement 
; A(1), puis A(4), A(4), A(4), et ainsi de suite. Désignons par Co, Xo(t), | 
Fo(t), So cë que deviennent respectivement la courbe © et les grandeurs ` 
X(t), Y(t) et 8, quand on remplace par zéro la longueur du premier de ces 
vecteurs, sans changer les autres. Si Pon a orienté laxe des x parallslement à 
AOA on a évidemment 


Y(t) — Y(t), XO = Hol) + (0<t<1), 2 
- et par suite 7 NES 
| (48), ron i BAK) = 8 LL = f ¥ (Hdt. 


Comme Sp et R 1 ne , dépendent, que de’ Vorientation du premier des vecteurs. 
| sucċessivement choisis, et des vecteurs suivants, l’ensemble des variables So et. 
I, est indépendant de L. Dans ces conditions la formule (48) définit bier la. 
nature de la corrélation entre L et S; elle montre notamment que le moment 
conditionnel €’{S?}, calculé dans Viypothése L = À, est un polynome de degré 
p en À (évidemment nul si p est i impair, et pair si p est pair). 

Pour préciser ces renseignements, on, peut chercher à définir les lois dont 
dépendent I, et So et la corrélation entié"ces variables. Y(t) étant une somme 
. de termes gaussiens ‘de la forme nV dt, a IL est de Ía forme . S | 
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1 = 
q(1— t) Vdi 
les étant liés par la relation 


1 za 
Y,=F(1) =f nVdi —0. 
0 


Sans cette relation, la loi à deux variables Y, et 7, serait une loi de Gauss, 
bien déterminée par ses moments du second ordre 


E(t} — f° (1—4)2dt = 4, nr =f Gerad ES nt, 


Par suite, dans l'hypothèse Y, = 0, I, est de la forme m/2 V3, m étant une 
variable gausstenne réduite. Le produit ILL est alors de la forme T/2V3, 
T dépendant de la loi définie par la formule (39). 

our l'étude de la loi à deux variables Z:, So, on peut former une équation 
aux dérivées partielles analogue à l’équation (45) relative aux variables L et S. 
On peut aussi calculer ses moments. Observons seulement que, quand Y(t) 
et par suite J, sont connus, So dépend d’une loi symétrique; on a donc, si p 
est impair i 
(49) E{S6P1:9} = 0. 


On déduit ensuite de la formule (48) 
E{S?} == E{SS?} + + E{L?}E{L*}, 


et par suite 
id 5 
, i a a ee a 
(49) E{8o7} = PET) 2 Dé 


(On remarque que €{1.?}, calculé AR Y, =0, ala valeur 1/12, 
et non 1/3). 


10°. La dernière des méthodes que nous voulons indiquer repose sur la 
remarque que la loi étudiée peut être définie comme limite de celle dont dépend 
Paire S, comprise entre une ligne polygonale Z, inserite dans Pare A(0)A(1), 
et la corde de cet arc. Cela est bien évident, puisque Sn» tend en probabilité 
vers S(1). Or S, est une somme de n triangles A(0)A(t)A(¢+ dt) (t et 
t + dt désignant les cotes de deux sommets consécutifs de La); il suffit donc 
d'étudier la loi dont dépend cette somme. 


us 
16 La manière la plus simple de calculer le coefficient numérique ava est sans doute 
$ . 
de remarquer que €{[  [¥,(#) + t¥,]dt}?} a la valeur % obtenue pour €{1,°} 


0 
quand Y,—Y(1) n’est pas supposé connu. 
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Désignons toujours par éV dt et y V dé les composantes de A(t) A(t + dt) 
suivant la direction A(0)A(t) et la direction perpendiculaire, et considérons 
la loi conditionnelle dont dépend Sn, et à la limite S, lorsqu’on connaît les é 
et les | y |, et par suite toutes les valeurs successivement prises par L(t). 
L’aire Sa se présente sous la forme d’une somme #3 + L(t)| y | Vdt, les 
signes seuls étant indéterminés, et indépendants les uns des autres. En 
négligeant des cas très peu probables, le plus grand de ces termes est très petit 
par rapport à la somme (la vérification ne présente aucune difficulté). Il en 
résulte que la loi ‘conditionnelle limite obtenue pour & est la loi de Gauss, 
c’est-à-dire que J est de la formé of, o désignant la valeur quadratique moyenne 
de S, pour la loi conditionnelle; ¢ est une variable gaussienne réduite; elle est 
donc indépendante de o, et l’on est'ramené à étudier la loi dont epeng 


Pexpression 
dei 3 PTE +f I(t) dt, 


la convergence considérée étant une convergence en probabilité. 
Or 407 est la somme des deux intégrales 


T= [za t= f ¥?(t) dt, 
o 0 


indépendantes l’une de l’autre, et dépendant d’une même loi; on est ramené 
à étudier cette loi. 

On peut appliquer à l’étude de la loi à deux variables J et X — Æ(1) 
des méthodes analogues à celles appliquées à l’étude de 8 et L. Indiquons 
seulement sans démonstration qu’en posant 


2 P(X < a, J < p} — H(a,8), 


on obtient l’équation aux dérivées partielles 
(50) ` H+ aH’. + (48 — 207) Hp + He = 0, 


qui joue un rôle analogue à celui de Péquation (45), mais qui est du type 
parabolique. On peut montrer, ici encore, qu’elle détermine complètement la 
loi étudiée. 

On peut aussi calculer successivement les différents onde de la loi à 
deux variables J et X, et aussi montrer que la corrélation entre J et X est du 
second degré, c’est-à-dire que. 


J = Jo 421X + F, 
Io et Jo étant indépendants de X. 
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L’étude de J est ainsi analogue à celle de S, mais à certains points de vue 
plus simple ; on remarque que J est une intégrale de type classique, bien qu’elle 
dépende d’une fonction aléatoire; de plus elle ne dépend que de la seule fonc- 
tion X (t). Malgré cela la fonetion H ne nous a pas plus que G paru susceptible 
d’avoir une expression simple. 

La formule obtenue 


(51) S = o$ (407 == J + Jao > 0) 


wen est pas moins susceptible de donner des renseignements utiles sur la 
-nature de la variable aléatiore 9. Elle montre notamment que la fonction de 
répartition de & est continue et indéfiniment dérivable, sauf peut-être pour la 
valeur zéro de la variable; la loi qu’elle définit est symétrique. Ce sont en 
effet des propriétés qui sont nécessairement vérifiées par un produit de deux 
facteurs indépendants, si elles sont vérifiées par un des facteurs (ici £). 


11°. Nous allons indiquer une autre conséquence de.la formule. (51): 
au point de vue de l’ordre de grandeur de la probabilité des grandes valeurs de 
| S |, la lot dont dépend S est comparable à la première loi de Laplace. 


Majorons d’abord la probabilité des grandes valeurs deo. On a 
Pr{o > 8} S Pr{Max (J, J1) > 28} S2Pr{J > 287} 
< 2Pr{ Max | Z (t)] > sV2} < 4Pr(Max X(t) > V2), 
t ` l 
c’est-à-dire ; 
Pr{o > 8} S 8Pr{X > sV2}. 


Par suite, si eV? > 1, on a, pour s assez grand 
(52) Pr{o > 8} S eP? == Pr{crd > 8}, 


À désignant toujours la longueur d’un vecteur gaussien réduit. 

Or une inégalité de cette forme subsiste si l’on multiplie à la fois o et À 
par la variable | £ |, non négative, et indépendante d’elles. Elle exprime en 
effet qu’on peut établir entre o et À une corrélation telle que l’on ait toujours 
o< cà. On en déduit 


(53) Pr{|8|>s}SPr{e|T|>s3}  (cV2>1), 


T — M dépendant de la première loi de Laplace, comme nous l’avons vu plus 
haut. 

Pour obtenir au contraire ung borne supérieure du premier membre, 
remarquons que dans la formule (48), lorsque L et Y(t) (done I:) sont 
connus; X(t), et par suite So, dépendent de lois symétriques. Il y a donc 
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une chance sur deux pour que So soit du signe de J;, et que par suite on ait 
ll eo On en déduit 





Pr{| 8 | >) = HP I >= HP; > de 


et par suite, pour c assez petit (2¢V8 <1) et s assez grand 
(54) Pr{| S|>s}2Pr{¢|T|>s} | (2cV3 <1). 


I] est alors s probable qu’on peut déterminer une constante absolue (comprise 
entre 1/V2 et 1/2 V3 ou égale à un de ces nòmbres) telle que la formule 
(58) s'applique pour c > ¢ etla formule (54) pour c < c. 


12°. . Nous allons maintenant établir le résultat annoncé plus haut con- 
cernant l'existence et la continuité des dérivées de F(a, p). Nous n’avons pu 
` y arriver que par l’utilisation simultanée des différentes méthodes d’étude de 
cette fonction exposée ci dessus; mais nous ne serions pas surpris qu’il existe 
une démonstration plus simple. | 

Utilisons d’abord la formule (48) ‘où, quand Y(t) et par suite I, sont 
connus, So dépend évidemment d’une loi continue [et même'à fonction de 
répartition indéfiniment dérivable; on le voit aisément en: étudiant l'influence 
d’un des paramètres indépendants qui définissent X(t), par exemple Xo(4)]. 
I en résulte qu’il ne peut pas y avoir de relation linéaire entre 8, et I, qui 
ait une -probabilité positive d’être réalisée.. La probabilité conditionelle de 
g < a, quand L a une valeur connue p, est donc une fonction continue de « et p; - 
comme on obtient évidemment F(«, p) en multipliant cette probabilité con- 
ditionnelle par pe-/*dp, qui est la probabilité dé l'intervalle do, et en 
intégrant par rapport à p, il en résulte que la dérivée G — F5 existe, et, est 
une fonction continue de p; au facteur pe*/* près, elle représente la FRE 
concientelié de S <a, dans Vhypothése L — p. 

: Utilisons maintenant la formule (51). L'hypothèse L = p ne mädifie 
nae le‘fait que, dans cette formule, £ soit une variable gaussienne indépendante 
_ deo. La probabilité conditionelle de S <a, évaluée dans cette hypothèse, ` 
“est donc aussi, comme la probabilité non conditionnelle de la même inégalité, 

une fonction continue et indéfiniment dérivable de a, sauf peut-être pour, 
a—=0; G, et par suite F, sont done aussi indéfiniment re par 
a poet à. a, 

La formule’ (48), 5 een tenu de la remarque faite tout à Puente sur la 
loi dont dépend Sy quand I est ponnn, dénne aisément une autre démonstration 
de'ce résultat, , NOÉ 

“‘Réeportons nous maintenant au rañtonnéinent : par + lequel hous ayons: établi : 
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l'équation (44) ; nous allons montrer que la continuité de F et G et de toutes 
leurs dérivées par rapport à æ sont des hypothèses suffisantes pour que ce 
raisonnement subsiste, avec quelques modifications. La dérivée Fp? a d'abord 
été introduite par le développement de l’expression 


dt jdt Edt 1, 
F(ap—6 4) - =F (a, p) ENS Pot Py p+ o(di). 
Si Pon n’est pas assuré a priori de son existence, il résulte seulement de la 
comparaison des deux expressions obtenues pour P et P, = P-+-o(dt) que 


ef rare rc 0 +6 Er.) 


est de la forme kdt/8t + o(dt), k = k(a, p), qui ne dépend que des valeurs 
de F(a, À) pour À très voisin de p, étant ce qu’on peut appeler une dérivée 
seconde généralisée. On peut alors écrire l'équation (44), à condition de 
remplacer F” p? par k. 

D’après cette équation, k est une fonction continue de « et p. La différence 


(ae) =F (ap) — f” kla, À) (p—A) ar 


a une dérivée seconde généralisée nulle. Il en résulte qu’elle est linéaire en p. 
Si en effet il wen était pas ainsi, on pourrait trouver une valeur po de p telle 
que la courbe représentative de la fonction K (pour æ constant) soit intérieure 
à une parabole qui la touche au point d’abscisse po. L'expression 


K(a,p) — K (a, po) — (p — po) K’h(&, po) 


serait done de signe constant et supérieure en valeur absolue à c(p —.po)*/2, 
c étant positif ; la dérivée seconde généralisée apparaîtrait done, d’après sa 
définition, comme une valeur moyenne d’une quantité de signe constant et 
supérieure à c en valeur absolue, et ne pourrait pas être nulle, comme nous 
Vavions supposé. : La fonction K(«, p) est donc linéaire en p, et F admet une 
dérivée seconde F”p?, égale à k. 

I s’agit d’ailleurs d’un résultat général: st une Fr f(x) admet une 
dérivée seconde généralisée, définie pour chaque point x par une formule du type 


Pæ) = dim m{ Zetre aro], 


M désignant une moyenne pondérée {par rapport à k), et les petites valeurs 
de h intervenant seules à la limite, "et si f(x) continu, c est une dérivée 
seconde au sens ordinaire. DE l AE at cee ay 8 
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Revenons à léquation (44), dont l’exactitude est maintenant établie. 
Tous les termes autres que F” p étant dérivables par rapport à p, ce terme Pest 
aussi. T’équation (45) en résulte, et, comme elle est du type elliptique, les 
fonctions F(a, p) et G(a,p) sont continues el indéfiniment dérivables, saut 
peut-être pour «= 0. | 

Le résultat subsiste d’ailleurs pour a==0. Si l’on se reporte à la formule 
(51), on peut remarquer que, si l’on remplaçait o par la longueur d’un vecteur 
gaussien, le produit of dépendrait de la première loi de Laplace, pour laquelle 
la dérivée seconde de la fonction de répartition est discontinue à l’origine. 
Mais il suffit, pour écarter la possibilité d’une telle discontinuité, de montrer 
que, dans le cas qui nous occupe, les petites valeurs de o sont trés peu probables. 
Cela résulte aisément de la définition de 407, somme de ‘termes tous positifs; 
la probabilité qu’ils soient tous très petits est excessivement petite (et cela 
aussi dans l’hypothèse où Z est supposé connu). 

Nous laisserons au lecteur de soin de préciser ce raisonnement, ce qui 
peut être fait de deux manières différentes. On peut montrer que, quel que 
soit c positif, on a | 

Pr{o < s} = 0(s°) (s— 0), 


et en déduire directement le résultat annoncé pour le produit of. On peut 
aussi établir seulement le résultat annoncé pour c == 2, et en déduire la con- 
tinuité des dérivées qui figurent dans les équations (44) et (45). En raison 
du type elliptique de cette dernière équation, cela: suffit pour conclure que 
G(a, p) est holomorphe pour toutes les valeurs réelles de æ et toutes les valeurs 
positives de p. 


6. La mesure superficielle de la courbe C. L'objet de ce paragraphe 
est de démontrer le théorème suivant. 


THEOREME 12. La courbe O est un ensemble de pointe dont la mesure 
superficielle esb presque sûrement nulle. 


Le 1° de ce paragraphe est consacré à un résultat préliminaire ; le 2° con- 
tient la démonstration du théorème 12. Le 3° et le 4° contiennent des remarques 
qui nous semblent de nature à faire comprendre, mieux peut-être que la 
démonstration, la véritable nature de ce théorème, et en tout cas préparent les 
généralisations qui seront l’objet du paragraphe suivant; le 3° contient en 
outre un théorème important par lui-même; il donne une condition nécessaire 
pour qu’une courbe remplisse une aire. 


1°. IT suffit de considérer un arc ‘(0)A(t) de la courbe. C. Comme 
c’est un ensemble fermé, il a une mesure superficielle bien déterminée p(t). 
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Nous nous proposons d’abord de démontrer que p == p(1) est une vartable 
aléatoire [donc aussi »(¢)]. En d’autres termes, u est une fonction mesurable 
de la variable U dont le choix au hasard entre zéro et un équivaut au choix 
de C, la notion de probabilité équivalant à la mesure sur l’axe des U. 

La démonstration repose sur ce que p est la limite en probabilité d’une 
suite de variables aléatoires; on sait qu’une telle limite est une variable 
aléatoire. 

Désignons à cet effet par w(p) la mesure de l’ensemble des points 
intérieurs à l’un au moins des n +1 cercles de centres A(0),A(1/n), 
A(2/n),- ::,4A(1), et de même rayon p. (C’est évidemment une variable 
aléatoire; nous allons montrer que, p tendant vers zéro, si n est une fonction 
de p de croissance assez rapide, ya(p) tend en probabilité vers p;7* il en 
résulteral que » est une variable aléatoire. 

D’une part pa(p) est borné supérieurement, d’une manière non aléatoire, 
par #(p), mesure du lieu des points M dont la distance 8(4f) à Pare A(0)A(1) 
ne dépasse pas p. Or, en vertu d’une propriété connue des ensembles bornés 
et fermés, ä(p) tend vers » quand p tend vers zéro. ` 

Désignons d’autre part par ly la longueur du ème côté de la ligne 
polygonale A(0)A(1/n)A(2/n)- - : A(1), et par pa le plus grand de ces 
côtés. On a évidemment . 


Pr{pa > p} S ¥Pr{ly > p} = ner"? = e(p) 


et il suffit que n croisse assez rapidement quand p tend vers zéro pour que e(p) 
tende vers zéro. Or, quand pa =p, l’ensemble des n -+ 1 cercles considérés 
contient la courbe à son intérieur, et w(p) = p; il en est donc ainsi sauf dans 
des cas dont la probabilité est au plus égale à é(p). 

Finalement, w(P) est au moins égal à p, sauf dans des cas dont la 
probabilité tend vers zéro, et dans tous les cas au plus égal à z(p), qui tend 
vers p; la convergence en probabilité de (Pb) vers p est ainsi établie. 

On démontre de la même manière que, pour tout point M, 8(M), distance 
de ce point à la courbe, est une variable aléatoire; la probabilité que 6(Af) — 0, 
c’est-à-dire que M soit sur la courbe, est aussi bien déterminée, et est une 
fonction mesurable ¢(M) du point M. Ces fonctions §(M) et b(M) sont en 
effet limites de celles obtenues en remplaçant la courbe par l’ensemble des 
cercles considérés dans le raisonnement précédent. 


16 La condition que n croisse assez rapidement joue un rôle essentiel dans la 
démonstration du texte. Mais, une foisele théorème 12 établi, on constate aisément 
qu’elle est inutile; #,(#) tend en probabilité vers # quand p tend vers zéro, et cela 
d’une manière uniforme par rapport à n. ` 
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2°. Pour démontrer que » est presque sûrement nul, observons d’abord 
que sa valeur probable m est finie. Cela résulte aisément de ce que 


Pre) < Pr( Max [Æ()|, | F(@)| > n <a? 2 f “eedu: | 


‘Par suite, d’après le caractère d’homogénéité stochastique ‘de la courbe, 
-les mesures des arcs A(0)A(1), A(1)A(2), et A(0)A(2), arcs que nous 
désignerons respectivement par Cı, O2 et C’, ont respectivement pour valeurs 
probables m, m, et 2m. Comme on a évidemment: 


| E{mes 0’} = E{mes Cx} + Efmes Cs} — E{mes CC} 
(C10: étant l’ensemble des points communs à C, et C2), il en résulte que ` 
W = E{mes CCa} = 0, 


c’est-à-dire que la mesure de CC; est presque sûrement nulle. , 
Nous allons en déduire que p est presque sûrement nul. Observons à cet 

effet qu’on ne change rien à » en supposant A(1) connu et en construisant Ci 
_et Cz on partant de ce point: ce sont deux déterminations indépendantes l’une 
de Vautre et dépendant de la même loi de probabilité. Les probabilités qu’un 
point M.appartienne à Ci, Ca, et C02 sont alors (M), ¢(M) et ¢*(M), 


et l’on a 
JS $ (f) dedy, K= f f PAD iey, 


(les intégrations étant étendues à tout le plan) ; # = 0 entraîne donc p = 0; 

Pun et Pautre équivalent à: (M) est presque partout nul. 

- Le théorème 12 est ainsi démontré. On remarque le rôle que joue, dans 
‘la dernière partie du raisonnement, l’indépendance stochastique de C, et Cz 
[une fois le point A (1) connu]. Il importe d’avoir ce point présent à Pesprit 
pour des extensions que nous indiquerons plus NE sans reprendre tout le 
raisonnement. 

Observons d’autre part que (M ), qui est seal une fonction u(r) 
de la distance r du point M au point A (1), est non seulement presque partout 
nul, mais est nul pour tout r positif. Pour le démontrer, il suffit de démontrer 
que c’est une fonction non croissante de r; cela résulte évidemment de la 
similitude stochastique des arcs A(0)A(1) et.A(1— k?) A (L); d’après cette 
similitude, un point situé à la distance kr du point A(1) a la probabilité y(r) 
apatis au second de ces arcs, et par suite, si & < 1, une probabilité 
y(kr) = y(r) dappartenir au premier. On a donc bien #(kr) = y(r), dong 
y(r) == 0 pour tout r positif. 
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Quant au point A(1) lui-même, il est évidemment sur la courbe; mais 
on voit aisément que la probabilité qu’il soit double est nulle. Quoi qu’il y 
ait une infinité non dénombrable de points doubles, les points doubles sont sur 
C des points exceptionnels ; leurs cotes constituent un ensemble de mesure nulle. 


3°. Indiquons maintenant une condition nécessaire pour qu’une courbe 
remplisse un ensemble de mesure superficielle positive. 


THÉORÈME 13. Pour qu'une courbe continue remplisse un ensemble de 
mesure superficielle positive m, il faut que, pour des points de division con- 
venablement choisis, mats arbitrairement denses sur la courbe, la somme 


(34) Ba = 3(Al)? 


soit au moins égale à cm, c étant une constante (supérieure à 3V 83/7; nous 
ne connaissons pas la meilleure valeur possible pour cette constante). 


Quand nous disons que les points de division sont arbitrairement denses, 
nous voulons dire que, quelle que soit la suite de nombres croissants entre 
zéro et un, to = 0, di, ta, - +, tx = 1, il y a au moins un des point considérés 
dans chacun des arcs A (tr-1) A(t»). 

Désignons par pa la mesure superficielle de cet arc A (ta-1)4(ta). Ona 
évidemment Syr = m. 

Or si un ensemble fermé a une mesure superficielle ur, on peut trouver 
dans cet ensemble deux points dont la distance soit au moins égale au diamètre 
An du cercle d’aire yx. On vérifie en effet facilement qu’un contour dont 
aucune corde n’atteint cette longueur ne peut pas entourer une aire égale ou 
supérieure à pm. 

Soit donc A(tx)A(tx”) une corde de Pare A (t1) A(t) de longueur au 
moins égale à Ax; on peut supposer ty’ < tx”. Pour la ligne polygonale 


ACOA (AADA) A(t”) : + A(1) 


inscrite dans C, la somme (34) est au moins égale à 
seta Sen 
ku m 


ce qui établit le théorème 13, sauf en ce qui concerne la valeur de la constante. 

On peut améliorer la valeur de cette constante, et en même temps obtenir 
un autre résultat d’un certain intérêt, en mettant en évidence trois points 
A(ty’)A(tx”)A(tx’”) de chaque arc A(tr:1)A(t»). On voit aisément qu’on 
peut toujours les choisir de maniéte que l’aire du triangle dont ils sont les 
sommets soit au moins égale à c'u (C = 3V3/4x) ; dans le cas d’une aire 
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circulaire, cette valeur représente laire du triangle équilatéral inscrit, et ne 

peut pas être dépassée; en dehors des cas des aires circulaires elliptiques, elle — 

peut sûrement être dépassée. 
Les triangles 


AAAG), A(MAL)A(), ACHAT) AGE), 


forment alors une chaîne de triangles inscrits, analogue à Paire 8’, du 8 5, 1°, 
et dont Vaire totale est au moins c’Sua = cm. Il s’agit, bien entendu, de la: 
somme des aires de ces triangles, prises en valeur absolue. L’eristence d’une 
telle chaîne pour laquelle cette somme soit au moins égale à c'm est donc une 
condition nécessaire pour que la courbe i un ensemble de mesure 
superficielle au moins égale à m. 
Considérons alors la ligne polygonale inscrite dans C ayant pour sommets | 


-. tous ceux de ces triangles. Dans un triangle, la somme des carrés de deux 


côtés est au moins égale à quatre fois la surface. La somme B, relative à 
cette ligne polygonale est donc au moins égale à 4 c’m; on obtient ainsi la con- 
stante 40 = 3 V 3/r indiquée dans l’énoncé du théorème 18. Il est d’ailleurs 
évident qu’elle n’est pas la plus grande valeur possible pour la constante c de 
ce théorème. Il serait intéressant de déterminer cette valeur maxima; il ne 
nous a pas paru au contraire utile d’allonger les raisonnements pour obtenir 
une valeur un peu plus grande que 3\/3/z, mais qui ne serait pas la valeur 
maxima. . 
~ _Indiquons d’autre part sans démonstration que, si l’on introduit le hasard 
dans le choix des points de division comme nous Pavons fait pour définir la 
notion d’oscillation brownienne, au moins ‘pour une représentation para- 
métrique convenable de la’ courbe étudiée (la probabilité étant mesurée par la 
variation du paramètre), et pour les valeurs assez petites de c, on a 


lim sup Pr{B, = cm?} = a > 0. 
N->CO $ 


Les modes de division qui réalisent la condition By = cm? n'apparaissent 
, donc pas comme exceptionnels. La probabilité & ne peut que croître quand 
on prend pour c des valeurs de plus en plus petites; maïs il n’est pas du tout 
certain qu’elle varie d’une manière continue; on peut se demander si l’on 
-West pas en présence d’un de ces cas, tréquents dans la théorie des probabilités: 
dénombrables, où la probabilité ne peut pas être comprise entre zéro et un; 
elle passerait brusquement d’une de ces valeurs à l’autre pour une valeur 
`” déterminée de c: 

Revenant aux résultats établis d’une manière sûre, nous voyons qu’il a 
dégage deux idées. L’une c’est que: Faire totale des chaînes de triangles 
inscrits analogues à Vatre S’n du $ 5,1° est en quelque sorte une approxima- 
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tion de la mesure superficielle de la.courbe ; pour que la courbe puisse remplir 
une aire, il faut qu’elle ne soit pas très petite. D’autre part il faut que, aù 
moins pour une représentation paramétrique convenable, la longueur AJ de la 
corde A(t) A(t + dt) soit en général de l’ordre de grandeur de V dt ou plus 
grande, ce qui entraîne cette conséquence indépendante de la représentation 
paramétrique que By n’est pas très petit; st Ba n’est pas très petit, la courbe 
fait assez de détours infiniment petits pour pouvoir remplir une aire, et les 
grandes valeurs prises par Ba, pour n infini, mesurent assez bien ce qu'on 
pourrait appeler la possibilité pour la courbe de remplir une aire. 


4°. Introduisons maintenant une nouvelle idée. Nous considérons 

spécialement les courbes pour lesquelles, comme pour le mouvement brownien, 

. la corde de l’are décrit pendant le temps dt est de l’ordre de grandeur de V dt, 
de sorte que les sommes Ba relatives à un-are fini ne sont, ni très petites, ni 
très grandes. La courbe fait ainsi exactement assez de détours infiniment 
petits pour pouvoir remplir une aire. Mais.ce n’est qu’une condition néces- 
gaire, non suffisante: pour que la courbe remplisse exactement une aire, il faut 
de plus une organisation de ces détours infiniment petits que le hasard wa 
aucune chance de produire. Seule une loi mathématique précise peut guider 
le cheminement du point mobile dans des zônes déjà en grande partie 
recouvertes de manière qu’il ne se déplace que dans les vides, et finisse par les 

remplir. Les courbes To et T, dont nous parlerons au PARAARES suivant 
donnent dés exemples de cette circonstance. 

Lorsque le hasard joue un rôle suffisant, si la courbe comporte assez de 
détours infiniment petits pour remplir une aire m, on doit donc s’attendre à 
ce qu’elle remplisse seulement une aire m’ = m/k < m, les différentes parties 
de cette aire étant en moyenne remplies k fois (k > 1); si k est fini, m’ est 
positif; si k est infini, m’ est nul. D’après le théorème 12, c’est la seconde 
circonstance qui est réalisée. Nous allons présenter quelques remarques qui 
pourraient conduire à une nouvelle démonstration, mais qui, sous la forme 
résumée que nous leur donneront, ne sont que des raisons intuitives assez 
sérieuses de croire que & est infini. | 

A cause de la similitude stochastique des différents arcs do courbe, au lieu 
d’examiner un même arc à des échelles de plüs en plus petites, nous pouvons 
examiner des arcs de plus en plus grands à une échelle déterminée. Cela nous 
conduit par exemple à étudier la ligne brisée A(0)A(1)A(2) : : : A(n)::: 
indéfiniment prolongée; nous supposerons les sommets de cette ligne marqués 
sur une feuille de papier quadrillé, les côtés des carrés du quadrillage étant - 
égaux à l’unité de longueur ou un peu plus petits que cette unité, de manière 
que deux points consécutifs A(n) et A(n + 1) aient des chances appréciables 


6 
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de ne pas étre dans le méme carré. Montrons d’abord qu’il y a une probabilité 
unité pour que le carré du quadrillage qui contient A (0) contienne une infinité 
d’autres sommets de cette ligne brisée. 

I suffit à cet effet de montrer que, quel que soit A(n) supposé connu, 
on peut déterminer N de manière que l’un au moins des points A(n +1), 
A(n-+2),---,A(n-+N) soit dans le carré du quadrillage qui contient 
A(0), et cela dans des cas dont la probabilité ne soit pas très petite. En effet, 
sauf dans des cas peu probables, ces V points sont à une distance de A(n) ne 
dépassant pas cV N, c étant une constante convenablement déterminée. Ils 
ne peuvent donc se répartir qu'entre des carrés du quadrillage dont le nombre 
ne dépasse pas (cV N + V2)?, soit sensiblement rc?N, et, pour chacun de 
ces carrés on peut borner inférieurement la probabilité qu’il contienne un de 
ces points (comme il s’agit de principes bien connus, nous n’insistons pas sur 
les détails de la démonstration). Il suffit alors que cV N dépasse la distance 
A(0)A(n) pour que cette conclusion s’applique au carré du quadrillage con- 
tenant A(0); il y aura une probabilité supérieure à un nombre fixe « qu’il 
contienne un point A(v) d’indice compris entre n et n + N. 

- On peut alors sûrement déterminer une suite d’entiers croissants 
aa, "Ta," *, tels que, une fois les na premiers points A(v) connus, il 
y ait une probabilité supérieure à « qu’un des Na = na — na suivants soit 
dans le carré du quadrillage qui contient ‘4 (0) ; Nx, dépendant du point A (na), 
est aléatoire, mais sûrement borné. On sait que, dans ces conditions, il est 
presque sûr que l’on obtiendra indéfiniment des points A(y) situés dans le 
carré qui contient A (0).16 

Le même résultat s’applique naturellement à n’importe quel carré du 
quadrillage, et l’on en conclut aisément que l’ensemble des points A(n) forme 
presque sûrement un ensemble partout dense dans le plan; il en est de même, 
a fortiori, de la ligne polygonale ayant ces points pour sommets, et de la courbe 
O elle même. 

On peut alors se représenter de la manière suivante Paspect de cette ligne 
polygonale limitée à ses n premiers côtés, n étant grand. La plus grande 
distance de deux de ses points sera de l’ordre de grandeur de Vn, et elle ne 
recouvrira certainement pas avec une grande densité la plus petite région 
convexe qui l’entoure; il y aura des vides, et il y aura des parties de cette 
région À où la ligne considérée ne passe qu’une fois ou un petit nombre de fois. 
Mais le fait que les remarques précédentes s’appliquent à n’importe quel carré 


- 19 Les remarques que nous venons @exposer dans les deux derniers alinéas repro- 
duisent.& peu près des considérations exposées par M. G. Pólya dans une conférence 
faite au Colloque sur les principes du calcul des probabilités tenu à Genève en octobre 
1937 et publiée chez Hermann. 
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du quadrillage intérieur à cette région et dont on sait qu’il contient au moins 
un sommet À (v),*" prouve que la plupart des carrés qui contiennent un sommet 
en contiennent un grand nombre. Si alors, pour avoir une idée de l’aire 
recouverte, on considère, soit l’ensemble des carrés du quadrillage contenant 
au moins un sommet, soit la chaîne des triangles A (%v) À (Ry + 1) A (2v +2), 
on voit que laire ainsi définie ne sera pas, comme on pourrait s’y attendre à 
première vue, de l’ordre de grandeur de n; elle sera petite par rapport à n, 
et composée en grande partie de régions recouvertes un grand nombre de fois. 
Il en résulte nécessairement qu’il y a des vides, dont les plus grands seront 
une fraction non négligeable de la région R,’ et qui seraient seulement 
recouverts par le prolongement de la ligne polygonale étudiée au delà de ses 
n premiers côtés. 

Utilisons maintenant la similitude stochastique des différents ares de la 
courbe C. Les résultats précédents peuvent s'appliquer à l’étude de la ligne 


A(0)A(1/n)A(2/n)  : - A(1), 


pour une valeur très grande de n. Nous trouvons d’abord un résultat qui 
rejoint les remarques du § 3, 5°, in fine: la courbe n’atteint un point, en 
général, qu'après avoir passé près de lui un grand nombre dé fois; la distance 
A(t) A(t + Tr), quand r tend vers zéro, est en général de l’ordre de grandeur 
de Vr; mais elle est parfois plus grande et parfois plus petite, ce qui donne 
à la courbe l’aspect d’une succession de bouches de plus en plus petites et de 
plus en plus voisines du point A(t); à une échelle excessivement petite, on 
pourra voir le point A (t + 7) approcher de A({), puis s’en éloigner, et cela 
un grand nombre de fois avant que la distance A(#)A(t ++) cesse d’être 
appréciable. 

D’autre part, en ce qui concerne Paire, nous voyons qu’une chaîne de 
triangles inscrits comme celle désignée au $ 5 par S'n, bien que la somme des 
aires de ces triangles prises en valeur absolue ait pour n infini une limite 
positive, ne recouvre qu’une aire de plus en plus petite, mais recouverte un 
nombre de fois de plus en plus grand. Cette aire pouvant être considérée 
comme une approximation de celle recouverte par la courbe, on est conduit à 
conclure que la mesure superficielle de la courbe est nulle. Une extension 
convenable du théorème 13 permettrait de rendre ce raisonnement rigoureux, 


17 La nécessité de cette restriction est évidente: un point pris au hasard dans R 
est À une distance de À (0) qui est de l’ordre de grandeur de Vn. Ce n’est donc qu'après 
avoir placé un nombre de sommets A(x) grand par rapport à n qu’on a une grande 
probabilité d'en trouver un qui soit voisin du point donné. 

. Il faut bien en effet que pour un point pris au hasard dans les régions vides 
on puisse appliquer le raisonnement de la note précédente. Or on ne le peut pas pour 
un point qui serait à une distance d'un des A(») petite par rapport à Vn. 
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et conduirait à une nouvelle démonstration dn théorème 12. La démonstration 
initiale est évidemment plus simple; mais les remarques qui précèdent nous 
ont paru utiles pour montrer que: la courbe C, tout en comportant assez de” 
détours infiniment peltts pour recouvrir une aire, a cependant une mesure 
superficielle nulle, parce que l'allure désordonnée du point mobile ne permet 
pas le balayage méthodique d’une aire; il est infiniment peu probable que ce 
balayage soit réalisé, 


7. Généralisations diverses. 1°. Un des caractères essentiels du mouve- 
ment brownien est la similitude stochastique de deux arcs quelconques de 
trajectoire. Ce caractère est indépendant du nombre de dimensions de l’espace 
considéré, et subsiste par une transformation affine, c’est-à-dire qu’à à la loi 
de Gauss isotrope on peut substituer la loi de Gauss non isotrope. Mais 


' les lois stables autres que celles de Gauss conduisant à des courbes presque 


sûrement discontinues, il ne semble pas que l’on puisse trouver d’autres 
schémas présentant ce caractére et conduisant à des courbes continues.” 

Par contre il est facile de définir des schémas très variés pour lesquels la 
courbe décrite quand ¢ varie de zéro à un est une réunion d’arcs stochastique- 
ment semblables à la courbe entière. Pour nous limiter, nous n’étudierons 
que les courbes pour lesquelles les deux arcs A(0)A(#) et A($)A(1) sont 
stochastiquement semblables à la courbe entière; chacun de ces arcs se décom- 
posant à son tour dans les mêmes conditions, et ainsi de suite, nous voyons 
que chacun des arcs A(h- 2) A[(A + 1)2-"] est stochastiquement semblable 
à la courbe entière. Les points dont les cotes sont de la forme h- 2 sont 
alors des points particuliers de la courbe; l’allure de la courbe en un tel point 


‘ne ressemblera pas à son allure en un point quelconque. Les lignes poly- 


gonales L’,, ayant ces points pour sommets, et les chaînes de triangles inscrits 
désignées par 8’, au début du § 5 se distingueront essentiellement des autres 
lignes polygonales inscrites et des autres chaînes de triangles inscrits; on doit 
s'attendre à trouver pour les L’, et les 8’, des propriétés simples non sus- 
ceptibles d’être étendues sans modification aux autres lignes inscrites Lm et 
aux autres chaînes de triangles inscrits. 

D’autre part, ce qui n’était pas possible (en dehors du cas du mouvement 


„rectiligne et uniforme) lorsqu’on exigeait la similitude de n’importe quel arc 


de courbe avec la courbe entière, devient ici possible: il peut s'agir de simili- 
tude véritable, et non de similitude stochastique. On retrouve ainsi des 
courbes dont nous avons fait une étude systématique dans un mémoire récent 
(Journal de V Ecole Polytechnique, 1938) ; deux de ces courbes seront con- 


13 En tout cas cela est évident si l’on se borne aux schémas pour lesquels les 
déplacement successifs du point mobile sont stochastiquement indépendants. 
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sidérées dans la suite, et désignées par To et T4; F, est la courbe bien connue 
qui remplit Paire d’un triangle rectangle isocèle; pour la courbe Te, nous 
renvoyons à notre mémoire de 1938 pour la démonstration de ses principales 
propriétés. 


2°. Nous allons considérer en premier lieu les courbes T pour lesquelles - 
le triangle A(0)A(4)A(1) est un triangle rectangle isocèle dont 4(0)4 (1) 
est l’hypoténuse. Chacun des 2* triangles qui constituent laire S’, sera aussi 
un triangle rectangle isocèle ; si Pon prend A(0)A(4) pour ünité de longueur, 
les côtés de l'angle droit de chacun de ces triangles auront la longueur 


g{g=—=1/ V2). L’hypoténuse étant placée, le sommet de l’angle droit a deux 
positions possibles, et Paire du triangle, égale en valeur absolue à 2-0 pour 
les triangles de S'a, sera positive ou négative suivant le sommet choisi. La 
courbe sera donc bien définie par la donnée d’une succession de signes; nous 
désignerons par en % = + 1 le signe lié au ième triangle de S's. Nous 
supposerons ¢) == 1, ce qui n’est pas une restriction essentielle. 

Nous étudierons spécialement les deux courbes non aléatoires Ty et D, | 
` définies, la première par en‘ == 1, la seconde par e, = (— 1)”, et les deux 
courbes aléatoires T% et Fs pour chacune desquelles les signes seront déterminés 
par des tirages au sort à chances égales pour les deux signes; mais pour I’; 
le signe ne dépendra que de n et un même tirage au sort déterminera l’orienta- 
tion de tous les triangles de S’n; pour Ts il y aura un tirage au sort pour 
chaque triangle. 

Bien entendu, des règles quelconques ne donneraient pas des courbes 
composées d’arcs stochastiquement semblables à la courbe entière. L’énuméra- 
tion complète des courbes T pour lesquelles il y a similitude (effective, ou 
stochastique) entre chacun des ares À (0) A (4) et A(4)A (1) et la courbe entière 
serait assez longue. Il existe en outre des courbes T composées de quatre 
(ou huit, ou seize) arcs stochastiquement semblables à la courbe, et non deux; 
tel serait le cas si l’on admet qu’un même tirage au sort détermine l’orienta- 
tion des triangles de 8’, pour deux (ou trois, ou quatre) valeurs consécutives 
de n. 

Pour la courbe To, Paire totale des kiona de S'n, qui sont tous orientés 
positivement, a la valeur $, aire du triangle initial. Les différents triangles 
de 8’, ne se recouvrent jamais, de sorte que Yaire totale de 8”, est $. Cela 
conduit à penser que la courbe I, recouvre une aire égale à 4; c’est ce que, 
nous avons démontré dans le travail cité tout à Vheure. | 

D’autre part Paire So + 8”, +: --+ $/,:, aire comprise entre L'o et 
L’, comptée en affectant chacune de ses parties d’un coefficient numérique qui 
indique combien de fois elle est entourée, est égale à n/2; elle augmente 
indéfiniment, et Paire comprise entre la courbe To et sa corde est infinie. 
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On se l’explique bien en observant que la courbe est composée de boucles qui 
tournent toujours dans le même sens. On en déduit aisément que, pour un 
choix quelconque des points de division, on aurait toujours une aire infinie. 
Dans le cas de la courbe Ty, chacune des aires 8’, recouvre exactement 
Paire du triangle initial; à la limite, la courbe remplit le triangle: pour Fe 
‘et T4, une loi mathématique précise, pour des raisons évidentes dans le cas de . 
T, et beaucoup plus cachées dans le cas de To, réussit à faire ce que le hasard 
ne peut pas faire: la courbe remplit une aire sans qu'aucune partie de cette 
aire soit recouverte plus d’une fois. 
_ En tenant maintenant compte des signes, Paire comprise entre L'o et L'n 
se présente sous la forme | 
de sorte qu’elle est égale à zéro si n est pair et à $ si n est impair. On se rend 
bien compte de ce fait, géométriquement, en observant qu'après suppression 
de segments rectilignes dont chacun est parcouru une fois dans chaque sens, les 
‘lignes D’, se réduisent à I’, on à I’, suivant la parité de n. L'aire comprise 
entre la courbe T, et le segment initial A(0)A(4) apparaît ainsi comme 
indéterminée entre zéro et un. ` l 
Pour la courbe T:, Paire 8”, a la valeur «/2. Laire comprise entre la 
courbe et le segment initial A(0)A(1) est alors comparable au gain d’un 
joueur dans une partie de pile ou face indéfiniment prolongée; elle est indé- 
terminée, non entre deux limites fixes, mais entre — œ et -+ œ. D’autre part 
un raisonnement identique à celui fait à propos de T, dans notre mémoire 
cité ci-dessus permet de montrer que les différents triangles d’une même aire 
8’, ne se recouvrent pas: si l’on part d’un réseau de triangles recouvrant le 
plan, chaque succession de signes 6, €z, `  ‘ , €n conduit à un réseau d’aires S'n 
recouvrant exactement le plan une fois et une seule, et, à la limite, on obtient 
un réseau de courbes Ts, infiniment enchevêtrées les unes dans les autres, mais 
recouvrant le plan une fois et une seule; il y a lieu de penser que chacune 
recouvre une aire égale à celle du triangle initial. | 
* Le fait que le même tirage au sort définisse les orientations de tous les 
triangles d’une même aire 9’, suffit à constituer cette loi précise qui fait co 
que le hasard ne saurait faire: deux triangles de S'a ne peuvent pas se 
recouvrir. Fe ns 
I n’en est plus de même pour la courbe Ts, dans la définition de laquelle 
le hasard joue un rôle beaucoup plus grand. D’abord chaque aire §’,, compte 
tenu des signes de ses triangles, est assimilable au gain d’un joueur après 
2" coups de pile ou face, l’enjeu à chaque coan étant 2°), (C’est une 


5 On démontre du moins aisément que chacune a une mesure superficielle au moins 
égale à celle de ce triangle, 
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variable asymptotiquement gaussienne, dont Vécart quadratique moyen est 
gi (g == 1/ V2). La série 38’, est donc une série à termes indépendantes, 
qui converge en moyenne quadratique, donc presque sûrement. L'aire § com- 
prise entre la courbe est sa corde est donc stochastiquement bien définie, dans 
les mêmes conditions que pour le mouvement brownien; on peut aussi montrer 
qu'avec des points de division choisis au hasard il y a, dans les mêmes con- 
ditions que pour le mouvement brownien, convergence presque sûre vers la 
même limite S; mais il ne s’agit pas d’une aire définie au sens de Riemann. 

Comme cest une somme de termes aléatoires indépendants, on définit 
facilement, par sa fonction caractéristique, loi dont elle dépend. Cette 
fonction caractéristique est 


(55) il (cos <f A = sin z II (= sin oe 


0 





La deuxième expression, correspondant à un groupement évident des facteurs 
de la première, donc aussi au groupement correspondant des triangles dont S 
est la somme, montre que S est la somme de variables indépendantes ayant 
chacune une fonction caractéristique de la forme (A/z) sin z/d, c’est-à-dire que 
cette variable est choisie arbitrairement entre — À et + A avec une répartition 
uniforme de la probabilité. Elle dépend ainsi d’une loi absolument continue, 
et il en est de même de 9. 

Montrons maintenant que: la mesure superficielle de la courbe T's est 
nulle. Le principe du raisonnement est la même que dans le cas du mouve- 
ment brownien ($6, 1° et 2°). Mais ici, au lieu d’un facteur $(Af) qui 
intervient deux fois, il faut introduire deux facteurs ¢ġı (M) et (1) qui 
représentent respectivement les probabilités que M appartienne aux arcs 
A(0)A($) et A($)A(1); is sont respectivement égaux à w(r,8) et 
y(r, n/2 — 0), n désignant la distance A(4)AZ et 6 Vangle A(0)A(4)A. On 
sait que le produit y(r, 8)#(r,x/2 — 8) est presque partout nul; il s’agit de 
montrer que chacun des facteurs est presque partout nul. Ce qui était évident 
lorsque les deux facteurs étaient égaux ne l’est plus ici. 

Mais un artifice très simple va nous permettre d’arriver au résultat. Il y 
a une chance sur quatre pour que «4% = € = —1; dans ce cas les points 
A(4) et A(#) coincident, et les arcs ‘A(+)4(4) et A(4)A(#) sont deux 
déterminations indépendantes d’une méme courbe aléatoire, et la probabilité 
qu’un point M. appartienne à l’un ou à l’autre de ces arcs a une même valeur 
(M). Si alors Ts avait une mesure superficielle positive, et cela dans des 
cas de probabilité positive, il y aurait aussi une probabilité positive que les arcs 
A(4)A(4) et 4(4) 4 (4), stochastiquement semblables à T's, aient des mesures 
superficielles positives, et que de plus A(4) et A(#) coincident. L/indé- 
pendance de ces arcs, une fois les points A ($), A ($) et A(#) placés, permet 
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de terminer le raisonnement presque comme dans le cas du mouvement 
brownien: la mesure de ensemble des points communs aux deux arcs con- 
sidérés pourrait être positive, dans des cas de probabilité positive. Il en serait- 
de même des points communs aux ares Á (0)A(4) et A($)A(1). Or, par la 
première partie du raisonnement, qui subsiste sans modification, on sait que 
cest impossible; ce qui établit le résultat annoncé. 

On voit que, si ce résultat a pu être obtenu, c’est parce que la part du 
hasard est bien plus grande que pour la courbe Ts; pour cette courbe, les arcs 
A(4)A(4) et A(4)A(4) sont égaux; pour Ty, ils sont stochestiquement 
indépendants [une fois le point A(4) placé]; cette indépendance joue un rôle 
essentiel dans le raisonnement qui précède. | 


_ 8°. Pour terminer létude des courbes T, nous allons présenter quelques 
remarques relatives à la somme 


(34) Ba = 3 (A1), 


qui, étendue aux côtés d’une des lignes L’,, a la valeur non aléatoire 2. Si on 
Vétudie dans le cas d’une ligne LZ, dont les sommets sont choisis au hasard 
entre zéro et un, des considérations analogues à celles exposées à propos du 
mouvement brownien montrent que, si les points de division, une fois choisis, 
sont conservés, il est presque sûr que pour n infini, B, est infiniment . peu 


différent de sa valeur probable! Mais cette valeur probable n’est plus une 


constante; elle est de la forme P (logn) + en, P(A) étant une fonction 
périodique, et es tendant vers zéro. La suite des Bn présentera donc, sur -` 
l'échelle logarithmique, des oscillations asymptotiquement périodiques. 

Pour établir ce résultat, considérons d’abord la valeur probable pê? == $?(7) 
de (Al)?, AZ étant la longueur d’une corde pour laquelle At a une valeur donnée: 
T, et la cote ¢ de son origine étant choise au hasard entre 0 et 1—4. Sir devient 
deux fois plus petit, a? devient à peu près deux fois plus petit; on obtient 
évidemment ¢7(r)/2 comme valeur probable de (Al)? pour At=7r/2 et t 
choisi au hasard entre 0 et (1—r)/2 ou entre $ et 1—7+/2. La valeur 
probable de (Al)*, pour At==7/2 et ¢ choisi au hasard entre 0 et 1— 7/2 
sera donc de la forme ¢7(r)/2 [1 + O(r)] {on le voit aisément en observant 
que Al est toujours O[p(r)]}. Si l’on donne alors successivement à r les : 
valeurs r, 7/2, r/4,: : +, on voit que (2?/r) $7 (1/2?) tend vers une limite, pour 
p infini, ce qui revient 4 dire que 


(56) PO _ pi(logr) + elr), ` 
e(r) tendant vers zéro avec r, et P, ayant la période log 2. 


# Pour les schémas aléatoires T, et T, il est bien entendu qu’il s’agit de la valeur 


+ 
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Considérons maintenant les points t, ta ia 5 Ína choisis au hasard entre 
zéro et un, qui sont les cotes de sommets de La. On sait que chacun des 
intervalles At — r séparés par ces points dépend de la loi définie par 


Prim > 2} = (1—2 nes (n= ©), 


et que, si n est grand, les différentes valeurs possibles de nr sont réalisées avec 
des fréquences très probablement très peu différentes de leurs probabilités (on 
peut même préciser ce résultat au sens de la loi forte des grands nombres). 
U y a d’ailleurs, asymptotiquement, indépendance stochastique entre l’origine 
t et la longueur r des intervalles considérés, de sorte que, pour un intervalle 
de longueur + connue, la valeur probable de (Al)? est bien #?(r). On en déduit 
que lon a asymptotiquement l 


E{Ba} = f ” P, (log 1)e®Tn?rdr + en 


= [7P (iog z) edz + en = P (log n) + ens, 
© 


ey tendant vers zéro et P(logn) étant une fonction périodique de période 
log 2, c. q. f. d. na l | 

On remarque que P(logn), étant une moyenne entre les différentes 
valeurs de. P,(loga/n), ne varie qu’entre des limites assez voisines l’une de 
Pautre. Comme Ba, si n est, grand, diffère très probablement (même presque 
sûrement très peu) de sa valeur probable, on ne peut pas parler d’une oscilla- 
tion brownienne bien définie comme dans le cas du mouvement brownien, mais 
cette oscillation est indéterminée entre deux limites voisines l’une de l’autre; 


cette indétermination n’a d’ailleurs pas un caractère aléatoire: Ba diffère en. 


effet très probablement (et même presque sûrement, dans les mêmes conditions 
que pour le mouvement brownien) da la fonction non aléatoire P (log n). 


On peut d’ailleurs échapper à ces oscillations. périodiques en modifiant le 


choix des points de division de la manière suivante: nous choisirons un point 
de division # au hasard entre zéro et un, avec répartition uniforme de la 


« 


probabilité; puis deux points 4% et 7," respectivement dans les deux inter- . 


valles (0, to) et (to, 1) ; puis quatre nouveaux points dans les intervalles ainsi 


_probable a priori €{B,}, et que c’est en tenant compte à la fois du choix de la courbe 
et de celui des ¢, que nous disons que B,— €{B,} tend presque sûrement vers zéro. 

3 C’est done par erreur que, dans ma Note du 12 décembre 1938, javais indiqué 

les courbes I, et T, (désignées dans cette Note par ©, et 0,) comme modèles de 

mouvement brownien. Du moins il semble que ce soit une erreur. Il n’y à première 

vue aucune raison de penser que la/fogction périodique P,(log7r) se réduise à une 

constante; mais je wai pas démontré que cette hypothèse est exclue. ` Or remarque 

d’ailleurs qu’il est a priori possible qu’elle soit constante pour I’, et variable pour I, 

ou inversement. 
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' distingués, et ainsi de suite. Après p opérations analogues, on aura défini une 
ligne polygonal Ly” à 2? côtés, inscrite dans I. On peut penser que l’oscilla- 
tion périodique signalée ci-dessus disparaît ici simplement parce qu’on ne con- 
sidère que des valeurs entières de p= log n/log 2; mais il se produit aussi 

‘une autre circonstance remarquable. Les n= 2? valeurs de r = At corres- 

. pondant aux côtés de L,” ne sont pas ici pour la plupart de l’ordre de 

grandeur de 1/n; les n valeurs des produits nr se répartissent sur un intervalle 

beaucoup plus étendu que dans le cas précédent, et, pour n’importe quel 

intervalle fini sur l’échelle des log 1/7, la probabilité tend vers zéro pour n 

infini et tend à s’y répartir avec une densité constante. On aura alors à con- 

sidérer, au lieu de P(logn), une moyenne entre tes différentes valeurs de 

P(log 1/r) qui se réduira à la limite à 


ya ars P,(u) du, 


et les valeurs de Ba = B,” correspondant aux lignes polygonales L” tendent 
en probabilité, et même presque sûrement, vers B. Il faut remarquer qu’il 
n’y a aucune raison de penser que B a la même valeur 2 que dans le cas des- 
lignes Ly’; c’était une valeur particulière tenant au rôle particulier qui jouent 
les lignes L dans la définition des lignes T; ici il s’agit d’une moyenne, 
presque sûrement réalisée dans les conditions où nous nous sommes placés. 
On aurait d’ailleurs la même valeur limite pour By si l’on partait d’une 
division initiale de l’intervalles (0,1) en À intervalles égaux (ou choisis au 
hasard), dont chacun serait subdivisé ensuite comme il vient d’être indiqué. 
Dans les remarques qui précèdent, on pourrait s'attendre à trouver comme 
limite, au lieu de la constante B, une fonction périodique de log h. Il n’en 
est rien, et cette constante B semble donner, pour chacun des types de courbes . 
T, une bonne mesure de ce qu’on peut appeler loscillation brownienne 
généralisée; c’est une limite généralisée, ou limite en moyenne par rapport à 
la variable logn, de la suite des By obtenus par le premier des processus 
indiqués. 

Des considérations analogues, dans le cas de la courbe I',, peuvent gap- 
pliquer à l’aire comprise entre la courbe et sa corde; on pent définir une aire 
stochastique généralisée qui serait nécessairement égale à la moitié de Paire 
du triangle initial. Dans le cas de la courbe Ts, il y a presque sûrement une — 


227] faut remarquer que nous n’avons pas exclu lhypothèse quil: -y alt une aire 
stochastique non généralisée. Si c'était le cas, cela n’empécherait pas que pour des 
lignes polygonales inscrites L, convenablement choisies Paire comprise entre L, et 
A(0)A(1), ne convergerait pas vers cette aire Stochastique, et rien n’empéche de penser 
que les lignes L”,, soient précisément de telles lignes exceptionnelles. Disons seulement, 
en répétant une "dée exprimée dans la note précédente, que le mode de définition de la ` 
courbe implique la périodicité sur l’échelle logarithmique, et que nous ne voyons aucune 


LE MOUVEMENT BROWNIEN PLAN. 547 


aire stochastique généralisée, mais variable avec cette courbe (tandis que 
Poscillation brownienne généralisée ne dépend pas du choix de la courbe). 


4°. Etudions maintenant les courbes obtenues en prenant pour 
.A(0)A(4)A(1) un triangle isocèle de base A(0)'A(1) et d’angle au sommet 
a; nous les désignerons par l(œ); pour æ == ~/2 elles se réduisent à celles 
que nous venons d'étudier. Nous désignerons par Ta(«) (h = 0,1,2,3) la 
courbe pour laquelle l’orientation des triangles des aires S'n est définie comme 
pour la courbe Tu. 

Le rapport de similitude (effectif, ou stochastique) de chacun des arcs 
A(0)A(H) et 4(4)4(1) eb de la courbe entière est g =z- + man Si 
a > r/?, on a g? < +; il en résulte immédiatement que, si r est très petit, la 
longueur des cordes A(t)A(t-+ 7) est o( Vr); Voscillation brownienne est 
nulle. La courbe a alors une mesure superficielle nulle. D’autre part Paire 
` comprise entre la courbe ‘et sa corde est bien définie, au sens de l'analyse 
ordinaire. Il est inutile d’insister davantage sur ce cas simple; le cas où 
a < 7/2, donc q? > $, est moins simple. Il faut bien entendu, pour que la 
suite des lignes LZ’, successivement définies convergent vers une courbe, que 
Von ait q < 1. Nous supposerons donc maintenant œ compris entre r/2 et 
7/3, et étudierons la courbe I',(@) pour laquelle Vorientation de chacun des 
triangles de chaque aire 9’, dépend d’un tirage au sort indépendant des autres. 

Vaire d’un triangle de 8’ est + q*"s (s étant Paire du triangle initial). 
L’aire totale de S’n, compte tenu des signes, a pour valeur quadratique moyenne 
2"/2q%ns. La éondition pour que la série XS”,, qui définit Paire S, soit con- 
vergente en moyenne quadratique, et par suite presque sûrement convergente, 
est donc 2q* < 1, c’est-à-dire « > a’, a’ étant l’angle compris entre 7/3 et 
1/2 pour lequel 8 sin «’/2 = 1. Pour ces valeurs de a, Paire © est stochas- 
tiquement définie; pour a S g’, la série 39’, est essentiellement divergente, 
et l’on ne peut pas, même par des procédés de moyennes, définir S. 

Au point de vue de la mesure superficielle de la courbe T,(«), les con- 
sidérations exposées à propos de T, subsistent en ce sens que, si n est grand, 
les portions du plan recouvertes par S’, ont chance de l’être un grand nombre 
de fois. Mais en même temps la somme des aires des triangles de S’n, prises 
en valeur absolue, augmente proportionnellement à (2q?)"; la courbe fait done 
d'autant plus de détours infiniment petits, a d'autant plus de chances de 
pouvoir remplir une aire, que « est plus petit. On est donc en présence de 
deux causes agissant en sens contraire, et Pon ne sait pas à première vue 
laquelle Pemporte. On peut seulemæt observer qu’une des causes varie avec 


raison de penser que les oscillations que cette périodicité laisse prévoir n’existent pas 
effectivement dans l'étude de l'aire. Le calcul d’une moyenne sur un intervalle assez 
étendu les fait en tout cas disparaître. 
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a, et cela ne semble pas être le cas pour Vautre. D’autre part la probabilité 
que la mesure superficielle de T,;(æ) soit positive, n'étant pas modifiée par le 
résultat d’un nombre fini d'épreuves, ne peut être que zéro ou un. Il y a done 
lieu de penser qu’il existe un nombre œ” (peut-être égal à a’) tel que cette 
probabilité soit nulle pour « > &” (ou peut-être a = «”) et égale à Punité 
dans le cas contraire. 


5°. Etudions maintenant un exemple de schéma dans lequel le rapport 
de similitude stochastique de chacun des arcs A(0)A(4) et A(4)A(1) sera 
aléatoire. Nous prendrons à cet effet pour A(4) un point choisi au hasard 
sur la circonférence de diamètre ‘4 (0) A(1), ou sur l’une ou l’autre des demi- 
circonférence limité à ce diamètre; de même chaque triangle de chaque aire 
8’, sera un triangle rectangle ayant pour hypoténuse le côté de L’, qui lui 
sert de base. Pour mettre orientation de ces triangles en évidence, nous 
supposerons dans tous les cas qu’on choisisse le sommet indéterminé sur le 
demi-circonférence située à gauche de ce diamètre {les sommets de L’, étant 
parcourus dans le sens des ¢ croissants), et toujours avec une répartition 
uniforme de la probabilité sur cette demi-circonférence (on pourrait d’ailleurs 
adopter d’autres règles). On conservera le point choisi, ou bien on le rem- 
placera par le point symétrique, situé sur l’autre demi-circonférence, suivant 
le signe d’un nombre «% qui sera déterminé comme pour les courbes I. 
Nous désignerons par I” les courbes ainsi obtenues, et par Lo, 11, l'a, I”; les 
courbes, toutes aléatoires, qui correspondent respectivement aux courbes 
Ty, Da, Ta, Te | | , 

On remarque que, dans tous les cas, la somme B, = X (Al)? étendue aux 
lignes L'n, est égale au carré de la longueur A(0)A(1), carré que nous sup- 
posons toujours égal à 2. Au point de vue de loscillation brownienne, nous 
pouvons répéter ce qui a été dit pour les courbes I. Si l’on choisit des points 
de division au hasard, B, est indéterminé entre deux limites positives; mais 
le fait que, pour les lignes L’,, on ait Ba = 2, suffit à montrer que la courbe 
fait juste assez de détours infiniment petits pour pouvoir remplir une aire, 
si son tracé était guidé par d’autres lois que celles du hasard. 

' Evaluons d’abord Paire S'a. Un triangle de cette aire, si son hypoténuse 
est Al, a pour aire (Al)? sin &, ¢ étant un angle choisi au hasard entre 0 et 7; 
sa valeur probable, sans tenir compte des signes, est done (Al)?/2x, et, pour 
l’ensemble des triangles de S'n, comme B,=%, la somme de ces valeurs 
probables est 1/7. Il s’agit d’ailleurs d’une somme de termes tous très petits; 
on vérifie aisément qu’il y a convergence en moyenne quadratique, et même 

` convergence presque sûre, vers cette valeur probable. 

Pour les courbes Ia, I’, et I’, on se trouve alors dans les mêmes con- 
ditions que pour To, I4, et I}: chaque aire 8’, étant une somme de triangles 
ayant la même orientation, la série 38”, qui définit 9 est asymptotiquement 
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de la forme % + 1/7 ; elle est divergente, et Paire S n’est pas stochastiquement 
définie; du moins, à première vue, elle ne semble pas l’être. 

Pots la courbe I”,, au contraire, les aires des triangles de chaque 9’, ont 
_ des signes variables. La valeur probable de 8’, est nulle, et celle de 9’,* est 


E{(S'n)?} =F BEL (Al)* sin? 6} == È E (AT) 4} 
< 5 3(Al)*€{Max (Al)?} — 75 €(Max(al)’}. 


Cette expression est le terme général d’une série convergente. D'autre part, 
quand tous les sommets de la ligne L’, (et par suite 81, 8°2,- © - , 1) sont 
connus, Paire S'a dépend d’une loi symétrique. On sait que ces conditions 
entraînent à la fois la convergence en moyenne quadratique et la convergence 
presque sûre de la série S == 28”, ; Paire 9 est ainsi presque sûrement définie. 


6°. Démontrons maintenant que: les courbes I” ont presque sûrement 
ung mesure superficielle nulle. Ise raisonnement qui suit, 'en grande partie 
identique à ceux faits pour le mouvement brownien et pour la courbe Ts, 
s'applique indifféremment aux différents types de courbes I’. 

Les courbes I” étant dans une région bornée, leur mesure superficielle 
est bornée; elle a donc une valeur probable m, positive ou nulle, mais finie. 
Si A(0), A(4) et A(1) sont connus, les valeurs probables des mesures super- 
ficielles des ares A(0)A(4) et A(4) A (1) sont m cos? a et m sin? a, a désignant 
Vangle 4(1)4(0)A(+$) ; leurs valeurs probables a priori, A(1) étant inconnu, 
sont donc égales à m/2 (on remarque que, même si l’on adoptait pour a une 
loi de probabilité absolument quelconque, la somme de ces valeurs probables 
est toujours m). On en déduit, exactement comme dans le cas du mouvement 
brownien (§6,2°), que l’ensemble des points communs aux deux arcs con-' 
sidérés a presque sûrement une mesure superficielle nulle; il en est de même 
a fortiori, quel que soit r entre zéro et 4, de l’ensemble des points communs 
aux arcs A(4—7)A(4) et A(4)A(4 +7), qui n’est qu’une partie du 
précédent. 

Nous prendrons r == 1/64 (pour la courbe I’, on pourrait prendre 1/16). 
On voit aisément que, si A (0), A (4) et A (1) sont connus, les positions possi- 
bles pour chacun des points A($—r) et A(4-+7) recouvrent une aire 
entourant complètement le point A ($), et cela avec une densité de probabilité 
admettant au voisinage de ce point une borne inférieure positive. 

Désignous par O, une quelconque des formes possibles de Pare A(4)A(1). 
Les différents arcs A(4)4(4 +7) possibles obtiennent en choisissant une 
position possible pour A($ +7) et un are Ci; A($)A($ +7) sera Parc 
semblable à C, allant de 4(4) à A(4 +7). Supposant l’origine placée au 
point A(4), et représentant les points du plan par leurs affixes X + iY, nous 
désignerons par U(t)(4<t< 1) Vaffixe du point A(t) de Pare C, et par 
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V(t) ($ <t< $ +r) celle du point A(t) de Pare A(4)A(4 + 7) semblable 
à O, et aboutissant en un point A ($ +7) d’affixe Vi. On a évidemment — 


PV Hu =: AI NAT (0 <u<l). 


Supposons que C, ait une mesure superficielle positive; si V est donné et 
assez petit, et que l’on détermine V, par cette relation, les points d’affixes V, 
décrivent, quand u varie, une courbe transformée de C, ayant aussi une mesure 
superficielle positive, et de plus très voisine de A(4); il y aura donc une 
probabilité positive que A (4 + 7) soit sur cette courbe, et que par suite lare 
A(4)A($-++7) contienne le point M ®affixe V; c’est un point quelconque 
dans le voisinage de A ($). 

Si alors il y avait une probabilité positive que I’ ait une mesure super- 
ficielle positive, il en serait de même pour C1; la circonstance que nous venons 
d’examiner se produisant avec une probabilité positive, la probabilité (M) 
que, A(4) étant connu, M appartienne à Parc A ($) A(4 + r), serait positive 
pour M assez voisin de A(4); il en serait de même de la probabilité ¢)(M) 
relative à Pare '4(£—r)A(&). Or, une fois A(4) choisi, ces arcs sont 
indépendants, et la probabilité que M appartienne à la fois à ces deux arcs 
aurait la valeur (M) = ¢.(M)¢1(M) positive au voisinage de A(4). Son 
intégrale dans tout le plan, qui est la mesure superficielle probable de Ven- 
semble commun à ces deux arcs, quand A(+#) est connu, serait positive. Cette 
conclusion étant vraie quel que soit le point A ($), sauf s’il occupait une des 
positions extrèmes A (0) et A (1), ce qui est infiniment peu probable, la valeur 

_ probable a priori de cet ensemble serait nulle, ce qui est en contradiction avec 
le résultat obtenu plus haut. Le résultat énoncé est donc établi. 

Nous avons ainsi vérifie une fois de plus un fait évidemment très général : 
quand Voscillation brownienne n'est pas infinie, pour que la courbe étudiée 
recouvre une aire, tl faut une organisation de ses détours infiniment petits que 
le hasard n'a aucune chance de produire. Le cas général est celui où la mesure 
superficielle de la courbe considérée est nulle. ‘ 

Nous avons, dans les trois cas étudiés dans ce travail (courbes C, Ts et 
I’) utilisé l’indépendance, au moins lorsque certains éléments aléatoires sont 
connus, d’an arc précédant ce point et d’un arc suivant ce point. Il est évident 
qu’une relation. aussi précise que l’indépendance stochastique n’est pas néces- 
gaire. Ainsi les probabilités (M) et ¢:(@) pourraient n’étre pas indé- 
pendantes; si (M), quoique dépendant de Pare 4(0)A(4) supposé connu, 
avait une borne inférieure positive, la conclusion subsisterait. 

Tl y a donc lieu de penser que le principe général que nous venons 
d'indiquer peut s’appliquer à beaucoup d’autres schémas que ceux étudiés dans 
ce travail. | 


Pants, 


A GALOIS THEORY OF LINEAR SYSTEMS OVER COMMUTATIVE 
FIELDS.* : 


By REINHOLD BARR. 


N. Jacobson ? has recently succeeded in extending the Galois Theory from 
commutative fields to non-commutative fields. In accordance with the now- 
adays customary point of view he considers the Galois Theory. as the theory 
of finite groups of automorphisms of commutative fields and of their fields of 
invariants. This theory contains the classical correspondence theorem of 
Galois as a simple special case. His fundamental condition which makes it 
possible to carry the commutative theory over to the non-commutative case 
is the restriction to finite groups of automorphisms without inner auto- 
morphisms 41. His method consists in the application of the theory of 
simple rings without making much use of the commutative Galois Theory. 

In this paper we give a different approach to Jacobson’s theory. Our 
intent. is to use the commutative Galois Theory ruthlessly and it turns out 
that beyond doing this one needs hardly more than the fact that a non-singular 
matrix with coefficients in a commutative field possesses an inverse; in par- 
ticular we do not need any deeper facts concerning linear transformations 
or ideals. 

Our method makes it possible to extend the theory in several directions. 
First we may investigate instead of non-commutative fields linear systems over 
commutative fields which need not have a finite basis over this field of reference 
and with one exception all the theorems of Galois Theory proper hold true in 
this framework. The exception which does not hold true may be easily derived 
in the case of fields from certain theorems concerning linear systems and is 
actually wrong for linear systems. Secondly we can prove that—at least for 
infinite linear systems—Jacobson’s condition, properly phrased, is necessary 
for the validity of a Galois Theory. The rephrasing consists in substituting 
the concept “ central-automorphism ” for “ inner automorphism ”; and this is 
necessary, since only the former can be defined in the case of linear systems. 
This is the reason why we need the stronger hypothesis to obtain even those 
results which Jacobson is able to establish on the basis of his weaker con- 
dition. Thirdly we may prove quite general theorems which permit the 

e 


* Received August 4, 1939. 
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transfer of the Galois Theory of any class of groups of automorphisms of 
commutative fields to linear systems whenever there exists a commutative 
Galois Theory so that in particular the Galois Theory of infinite algebraic 
extensions may be extended to linear systems; and these general “ transfer ” 
theorems are actually the starting point of our investigations. 

Finally we give the elements of a theory of crossed products of non- 
commutative fields with finite groups of automorphisms. The standard 
theorems may easily be carried over. Only when proving a generalization of 
E. Noether’s “ Hauptgeschlechtssatz im minimalen” do we have to assume 
that the field in question be finite over its central so that we may use the 
theorem that central-automorphisms are inner automorphisms, a theorem that 
otherwise has no place in our theory. In this chapter as in the others 
Jacobson’s work and ours overlap in many respects though the methods em- 
ployed are rather different—his being strictly non-commutative, ours strictly 
commutative—and though neither obtains all the results of the other one. ` 


CHAPTER I. Fundamentals and transfer. 


1. If the set L is a commutative group with regard to an operation 
which is written as addition, if L contains elements different from 0, if F 
is a commutative ® field, and if there exists to every element f in F, x in L 
a uniquely determined element fs — gf in L so that 


(a) f(o+y) = fet fy for fin F, x,y in L, 

(b) (+g)z= fr + ge for f,g in F, z in L, 

(e) (gr) = (fg) for f,g in F, z in L, 

(d) it for v in L and 1 the unit element in F, 


then L is called a linear system over the field F. 


Note that (d) assures the absence of zero-divisors. 

_ Dependence and independence of subsets of L with regard to F may be 
defined as usual. Every independent set is contained in a greatest independent 
set, every greatest independent set is a basis and any two greatest independent 
sets contain the same number of elements. These remarks clearly concern L 
as an abelian operator group with operators in F. However it is not this aspect 
of the matter that interests us primarily. 


3 It may be noted that in some parts of Chapter I it is not necessary to assume 
that F be a commutative fleld, that the property of being a field will be sufficient for 
these considerations. 
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If 8 is any subset of L, then we denote by Z (8) the set of all the elements 
z in F so that zs is in g for every s in S. The subset S of L is said to be 
complete in L, if 
(i) S isa subgroup of the addition group L, 
(ii) 80, 
(ili) Z(S) is a subfield of F. 
- Most of the complete sets which we shall have to consider will satisfy an 
additional property: 
(iv) If f is an element in F so that there exists an element s £ 0, satisfying: 


fs is an element in 8, then f is an element in Z(8). 


If S is a subset of L, U a subset of F, then US is the subgroup of the 
additive group L, which is generated by all the elements us for u in U and 
sin 8. | 

lf 8 is complete in L, then the subset V of F is said to be independent 


over 8, if Š vis: = 0 for v, in V and s; in J implies that all the s, are 0. 
tel j 

(1.1) If S ts complete in L, and if the subset V of F is independent over 9, 
then V ts independent over Z (8). 

Proof. Suppose that Š av = 0 for v; in V and z; in Z (8). There 

i 
exists in § an element s 34 0. Hence all the elements s; = z;s are in S. Thus 
we have Ÿ SiV; = 8 $ #it4 = 0. Since V is independent over 8, this implies 
izl tl à 


that 0 == s; = z;s. Hence z; 0, since 50. 
The converse of (1. 1) is in general not true. Hence we define: 


The subset T of L is the direct product U X S of the subfield U of F 
by the subset S of L which is complete in L, if 


(1) . Z(8) SU, 


(2) subsets of U are independent over Z (S) if, and only if, they are in- 
dependent over 8, i 
(3) ; T = US. 


It is a consequence of (1.1), that the conditions (2) and, (3) may be 
condensed into the following condition: 


(0) A subset V of U is a basis of U over Z(S) if, and only if, it is a basis 
7 
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| of T over S, i.e. every element t in T may be represented in one and only 
one way in the form: 


t— 2 s(v) v for s(v) in S, 


where all the s(v)—apart from a finite number of exceptions—are 0. 


That T is generated in “adjoining” V to S is equivalent to (3); and 
the unicity of representation of elements in T is equivalent to (2). 

A simple method for constructing subsets T of L so that L is the direct 
product of F and T is contained in the following statement. 


THEOREM 1.2. Suppose that L is a linear system over the commutative 
field F, and that the subset T of L is complete in L. Then L is the direct 
product of F and T if, and only if, every basis of the operator group T over 
Z(T) ts a basis of the operator group L over F. 


Proof. Suppose first that L is the direct product of F and T. Let B be 
some basis of the operator group T over Z(T) so that T may be written: 
T= Z:Z(T). Suppose furthermore that f1,: : :,f, are elements in F, 


in 


k 
by,* + +, bg elements in. B so that © f4b4==0. Let A be some (linear) basis 
i 
of F over Z(T). Then there exist elements u; in A, zı; in Z(T) so that 
fi = 2 z4juy and thus we find that 0 = 2 24704 = = tyu; where ty = > 24404 
43 


is an cements in T, since the b; are in T and the 24; ae in Z(T). Since L is 
the direct product of F and T, and since the u; are elements in F which are 


y k . 4 
independent over Z (T), it follows that 0 = t; = $ 2,;b;. Since the elements 
1 


b, are part of a basis of the operator group T over Z (T), they are independent 
- over Z(7') so that all the elements zı; are 0, since they are in Z(T). Thus all 
the f; are 0, i.e. B is independent over F too. Since L = FT, every element 
in L depends on B (with coefficients in F') so that B is a basis of the operator 
group L over P. 

Suppose now conversely that B is some basis of the operator group T over 
Z(T) which is at the same time a basis of the operator group L over F. Then 
clearly L == FT. Suppose now that the elements f,,- - -,f; in F are (linearly) 


k 
independent over Z(1°), and that the elements t in T satisfy: 0 == È tifi 
Since B is a basis of T, there exist elements ziy in Z(T), b; in B ha “es 


i= = 2430, and hence 
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m k 
0 X, bife = D [ X zag fe lbs. 
4, j=l t=1 


k : 
Since the elements b; are part of a basis of L over F, it follows that 0 = 3 z;jf4, 
421 


as these are eléments in F; and since the f; are independent elements in F 
over Z (T), it follows that the elements 2;; in Z(T) are 0. Thus all the t, 
are 0; and this implies that every set in F which is independent over Z (T) is 
at the same time independent over T. Hence L is the direct product of F and 
T ; and this completes the proof. As a matter of fact we have proved slightly 
more namely the 


COROLLARY 1.3. If L is a linear system over the commutative field F, 
and if the subset T of L is complete in L, then the following propositions are 
equivalent. ` 
(A) | L=FXT. 


(B) There exists at least one basis of F over Z(T) which ts a basis of L 
over T. 

(C) Every basis of the operator group T over Z(T) is a basis of the operator 
group L over F. | 

(D) There exists at least one basis of the operator group T over Z (T) which 
ts a basis of the operator group L over F. | 


2. In this section we shall introduce the concept of automorphism of 
a linear system which, of course, will differ from the concept of automorphism 
of an operator group. 


(2.1) If Lisa linear system over the (commutative) field F, then there exists 
to a given automorphism g of the additive group L, at most one automorphism 
h of the field F so that 


(f£): = ftes for f in F and x in L. 


Proof. Suppose that h-and k are two automorphisms of the field F so that 
firs == (fr)s == ftes for f in F and x in L. There exists in L an element 
w <0; and wë 0, since g is an automorphism of D,. Hence fhws — frs 
or (f* — f*)ws — 0 and this implies f* == f* for every fin F or h == k. 


Consequently we define: The transformation g of L is an automorphism * 
of the linear system L over the field F, if | 
ene ne aaae . 


“These automorphisms of linear systems over fields are often termed semi-linear 
transformations; cp. e.g. N, Jacobson, Annals of Mathematics, vol. 38 (1937), 484-507. 
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(*) gisan automorphism of the additive group La, and if 
(**) there exists one (and only one) automorphism h of the field F so that 
(fa) = ftre for f in F and z in L. 


If g is an automorphism of the linear system L over the field F, then we 
say of the uniquely determined automorphism h of the field F which occurs | 
` in (**) that it is induced by g and put h -== g’. If G is some set of auto- 
- morphisms of L, then we denote by G’ the set of all the g’ for g in G. 

If u, v are both automorphisms of L, then (uvy == wv’. If G is a group 
of automorphisms of L, then a homomorphism of G upon the group G’ is 
‘defined’ in mapping the element g in G upon the element g’ in G’; and this 
homomorphism of G upon G’ is an isomorphism between G and G’ if, and 


` only if, g’ = 1 implies g — 1 (for elements g in G). 


An automorphism which leaves all the elements in some set Y invariant 
is called a Y-automorphism. If the subset S of the linear system L over the 
field F is complete in L, and if g is an S-automorphism of L, then g’ is a 

4(8)-automorphism of D. . 
i If G is a group of automorphisms of L, then (L,G) consists of all the 
elements in Z which are left invariant by every automorphism in G; and 
(F,H) is defined accordingly. 

If § is a subset of L, then (S < L) is the group of all the S-auto- 
morphisms of L; and (T < F) is defined accordingly. These ea 
satisfy (as neue): $ 


G< ((L,6) <L) ad SS (L, (8< L)). 


Since furthermore G = H implies (L, H) = (L,G), and since SST ni 
(T < L) = (8S < L), it follows that ! 


(5 < L) = ((L, (8 < L)) < L) and. (1,6) = (L (2,6) < L)). 


(2.2) Let G be a group of automorphisms of the linear system L over the 
commutatwe field F. 


. (a) If every (F, G’)-automorphism of F is induced by at most one (L, G)- 
automorphism of L, then (L,G) 0. 
(b) If (L,G) 0, then 
(b.1) Z((L6))= (F,6), | 
: (b.2) (L,G) is complete tn L, 


(b.3) the element f in F belongs to Zd (L,G)) whenever there exists an` 
element t 0 in (L,G) so that ft is.in (L,G). 


~ A GALOIS THEORY OF LINEAR SYSTEMS OVER COMMUTATIVE FIELDS. 6557 


Proof. Assume that every (F,G’)-automorphism of F is induced by at 
most one (L,G)-automorphism of L and that (L, G) —0. If f is an element 
= 0 in F, then an automorphism of L is defined by zë fx for x in L, since 
(rv) == frg = roe for r in F. Hence g is an (L,G)-automorphism of L 
satisfying g’— 1, and this implies g == 1 so that f— 1, i.e. F consists of 0 
and 1 only. Consequently every g’== 1 so that every g—1; and hence 
(L, G) = L £0 which is a contradiction. 

Assume now that (Z,G)-£0, and that f is an element in F, {340 an 
element in. (L, G) =T and that ft is in T too. If g is any automorphism 
in'G, then ft = (ft}s— fft so that f = fë for every g’ in G’. Hence f is an 
element in (F,G’) and in particular 7(T) < (F,G’).—If z is an element 
in (Ff, G’), t any element in T, then (zt)s—#t for every g in G so that zt 
is in T and consequently z is in Z(T). Hence Z(T) = (F, G’) so that Z(T') 
is a field and T is complete in L. 


THEOREM 2.3. Suppose that L is a linear system over the commutative ` 
field F, and that the subset T of L is complete in L. 


(a) If L is the direct product of F and T, then every AE oom nen 
of F ts induced by one and only one T-automorphism of L. 

(b) L= FT if, and only tf, the identity is the only T-automorphism of L 
which induces the indentity in F. | 
(c) If Z(T) = (F,(T < LY), then every independent subset of the opera- 
tor group T over Z(T) is independent over F too. 


Proof. If L is the direct product of F and T, and if B is a basis of F 
over Z(T), then B is a basis of L over T. If v is any element in L, then 
there exist therefore uniquely determined elements {(x, b) in T so that only 
a finite number of t(x, b) are different from 0, and so that i 2 at (te b)b. If 


g andh are two T-automorphisms of L so that g’ = h’, then of > t(x,b)bs 

= À ae b)b¥ == q* so that g = h. If conversely u is a Z (T}- a a A 
of 'P, he a T-eutomorphiem v of L is defined by dr 2, t(x,b)b® and 
clearly v = u. This proves (a). 

L* == FT ig in any case an admissible subgroup of the operator group L 
over F. Thus every basis of the operator group L* over F is contained in 
some basis B of L over F, If L* < L, then there exists in B an element w : 
which is not contained in L*. As T +0, there exists in T an element 140; 
and there exists one and only one @utomorphism g so that g’ == 1, b = be for 
bw in B, wt=—w-+t. Since g is a T-automorphism, and since g 1, 
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it. follows that FT 54 L implies the existente of a T-automor rphism 341 which 
induces the identity, in F; and'this proves (b), since g is the- HR on FT, 
 ifgisa T-automorphism such that g’ = 1. l 

Suppose now that Z (T) = (F, (T < L)’) and that 8 is a subset of, T 
which is independent over Z (T). If § would not be independent over F', 
then § would contain a finite subset which is dependent over F, and amongst 
* these there would be a smallest one, say S,° °°, There exist therefore 


* “elements fi not all of them 0, 80 that o= À se and since the s form a 

‘smallest dependent subset of 5, none of T ft is 0. If g is any T-auto- 
morphism of L, then -0 == Žž sf and consequently 0 = > silff — fef) | 
and this aa fft = ir F, since the s; form a Bodia e F] dependent 
subset of S. Hence fiıfı™ is invariant under all the g’ in (T< LY and belongs 

| therefore to Z(T). Since f0, we find therefore that o= Sales) 


where all the coefficients are elemenis Æ0inZ(T). Hence the s; ant there- 
fore S would be FE over Z (T) and this is epee aie so that (e) holds . 
true too. ~ 


3. The problem of a Galois Theory of linear systems will be roducod. 
by means of the theorems in this section to the CE en of 
Galois Theory in commutative’ fields. _ 


THEOREM 3.1. The subset T of the ‘aba bn L over the commutative 
field F satisfies 


(a) T = (L,(T < L)) and 

(b) 1isthe only automorphism g in (T < L) so that g: = T 

tf, and only if, 

(i) T ts complete in L, 

(ii) Z(T) = (F, (Z(T) < F)), 

(ili) L ts.the direct product of F and T. | , 

Proof. Suppose first that the conditions (a) and (b) are satisfied by T. 
Then it follows from (2.2) that T is complete in L (condition (i)!) and 
that Z(T) = (F,(T < LY). Since therefore (T < Eo e < F), it 
follows that b Nu 
Z(T) = (F, (Z(T) <F)) <ir, wen) —Z(P) 
and this proves (ii). Me à 
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. Suppose now that B is a basis of the operator group T over the field Z(T). 
Then it follows from Theorem 2.3, (c) that B is independent over F so that 
B is a basis of the operator group L over F, since it follows from (b) and 
Theorem 2.3, (b) that L == FT. Hence L is the direct product of F and T 
by Theorem 1. 2. 


Suppose now conversely that the conditions (i) to (iii) are satisfied by T. 
Then it is a consequence of Theorem 2.3, (a) that (b) holds true, and that 
(Z(T) < F)=(T < LY. Let now B be a basis of the operator group T 
over Z (T). Then B is by Theorem 1.2 and condition (iii) a basis of the 
operator group L over F. If w is any element in (L, (T < L)), then there 


k 
exist elements f; in F and elements b; in B so that w = J; fbı. If v is any 
| 1=1 
automorphism in (Z (T7) < F), then there exists a T-automorphism g of L 
k k 
so that g’ =v; and hence we find that D fiv = wt = w = > fibi so that 
4-1 421 


fı = fs for every v in (Z(T) < F). All the elements f; are therefore in 
(F, (Z(T) < F)); and since this field is equal to Z(7) by condition (ii), 
it follows that all the f; are in Z(T) and that w is in T, since all the b; are 
in T. Hence TS (L, (T< L)) ST, i.e. (a) holds true too. 

THEOREM 3.2. The group G of automorphisms of the linear system L 
over the commutative field F satisfies 
(a) G= ((L,G6) < L) and 
(b) lis the only automorphism g in G so that g’ —1 
if, and only tf, 
(i) C = ((F, 6’) < F) and 
(ii) every (F,G’)-automorphism of F is induced by one and only one 
(L, G)-automorphism of L. 


Proof. Suppose first that (a) and (b) are satisfied by G. Put T = (L,G) 
so that G == (T < L) and T = (L, (T < L)) by (a). Hence it follows from 
(b) and Theorem 8.1 that T is complete in L, that Z (T) == (F, (Z(T) < F)) 
and that L is the direct product of F and T. Now it is a consequence of 
Theorem 2. 3 that every Z(Z')-automorphism of F is induced by one and only 
one T-automorphism of L and hence (ii) holds true, since 

Z(T) = (F, (Z(T) < F)) = (F, (T < £)’) 
by (2.2). Finally it follows now from (a) that 
G'=((1,6) < LY = (T < LY = (FC) < F) 


so that (i) holds true too. 
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Assume conversely that G satisfies (i) and (ii). If the automorphism w 
is in ((L,G) < L), then w is in (Z((L,G)) < F); and since Z((L,6G)) 
_ == (F, G’) by (ii) and (2.2), w is in ((F, G6’) < F) which group equals G” 

by (i). Hence it follows from (ii) that w is in G so that (a) and (b) are 
consequences of (i) and (i). 


COROLLARY 3.3. Suppose that the group G of automorphisms of the 
linear system L over the commutative field F satisfies the conditions (a) and 
(b) of Theorem 8.2, and that H is a subgroup of G. Then H = ((L,H) < L) 
tf, and only tf, W = ( (F, H’) < F). 

This follows from Theorem 3.2, since H satisfies condition (b) of 
Theorem 3.2 as a subgroup of G, and since (F, G’) S (F, H’) implies that 
H satisfies condition (ii) of Theorem 3.2 as G satisfies this condition. 


Txeorem 3.4. Suppose that L is a linear system over the field F, that T 
is a subset of L, that T = (L, (T < L)), that 1 is the only REER g 
of L with g’ == 1, and that B ts a set between T and L. ; 


(A) B= (L, (B< L)) tf, and only if, there exists a field R between Z (T) 
and F so that R = (F, (R < F)) and so that B = RT. 

(B) If R is a field between Z (T) and F so that R= (F, (R < F)), then 
R= Z(RT) and RT is the direct product of R and T. 


Proof. Suppose first that there exists a field R between Z(T) and F 
so that R = (F, (R < F) ) and so that B = RT. Then R<Z(B). Letz be 
an element in Z (B), t an element £ 0 in T and v an R-automorphism of F. 
Then there exists by Theorem 3.1 and 2.3 a T-automorphism g of L so that 
g =v. Since g leaves all the elements in T invariant, and since v leaves all 
the elements in # invariant, g leaves all the elements in ÆT invariant. Since 
T < RT, and since z is in Z (RT), 2t is an element in RT so that zt == (at) ë =— 2*t 
and z = 2" since t 3&0. Hence z is in (F, (R < F)) = È so that R = Z (TR). 
Hence (B) holds true, since subsets of À = F that are independent over Z (T) 
are independent over T. 

Let now & be a basis of the operator group T over Z(T). It follows from 
Theorem 3.1 and Theorem 1. 2 that S is a basis of the operator group L over 
F. If w is an element in (L, (RT < L)), then there exist elements f; in F 


k 
and elements 3; in 8 so that w = 2 fass. If v is any R-automorphism of F, 
then there exists again a T-antomorphism g of L 80 a g =v; and g is an 
meron of L. Hence z fisi- = V = Ww = 2 fisi 80 that fa f, 
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since the s; are in T and are independent over F. The elements f, are there- 
fore in (F, (RE < F)) = EÈ so that w is in RT, ie. RT = (L, (RT < L)). 

Assume now conversely that the set B between T and L satisfies 
B= (L,(B<L)). There exists no B-automorphism of L, inducing 1 in F, 
except 1, since there exists no T-automorphism 41 of L which induces the 
identity in F. Hence it follows from Theorem 3.1 thet B is complete in L, 
that Z(B) = (F, (Z(B) < F)) and that L is the direct product of F and B. 
Consequently B* = Z (B)T = B; and it follows from what has been proved 
in the first two paragraphs that Z(B*) = Z (B) and that B* = ( L, (B* < L)). 
It is a consequence of Theorem 3.1 and Theorem 2.3 that every Z(7)-auto- 
morphism of F is induced by one and only one T-automorphism of L so that 
(B < LY = (Z(B) < F) = (Z(B*) < F) = (B* < LY and therefore 
(B< L) =(B* < L) and finally B*— (L, (B* < L)) = (L, (B < L}) 
== B and this completes the proof of (A). 


THEOREM 3.5. Suppose that T is a subset of the linear system L over 
the field F, that T == (L, (T < L)), that the identity is the only T-auto- 
morphism of L which induces the identity in F, and that the set B between T 
and L is complete in L. Then B satisfies: 


(a) B = Bs for every T-automorphism g of L, 

(b) every T-automorphism of the linear system B over the field Z (B) is 
induced by some automorphism of L, 

(c) (F, (B < LY) =Z (B) 

if, and only if, the following conditions are satisfied by B: 

(i) (B< L) is a normal subgroup of (T < L), 


(ii) every Z(T)-automorphism of Z (B) is induced by automorphisms of F, 
(iii) B = (L, (B < L)). 


Proof. We note first that (Be < L) =g1(B < Lg for every auto- 
morphism g of L. This shows that (i) is a consequence of (a). If conversely 
(i) and (iii) are satisfied, then Bs < (L, (Be < L)) = (L, (B < L)) =B 
for every T-automorphism g of L. This implies that B S B© for every 
T-automorphism g of L and therefore we have B = Bs for every T-auto- 
morphism g of L, i. e. 
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(a) is a consequence of (i) and (iii). 


If - (iii) is true, then it follows from Theorem 3.4 that Z(B) 
= (F, (Z(B) < F)) and that B—Z(B) XT. It follows from Theorem 
2.3 that every Z(T)-automorphism of 7(B) is induced by one and only one 
T-automorphism of B. If now g is any T-automorphism of B, then g is a 
Z(T)-automorphism of Z(B). If (ii) holds true, then there exists an auto- 
morphism u of F which induces g’ in Z(B). There exists by Theorem 2.3 
one and only one T-automorphism h of L so that h = u. It is a consequence 
of (a) that h induces an automorphism k in B. Since clearly k’ = g, it 

follows that k = g, as every Z(T)-automorphiem of Z(B) is induced by one 
and only T-automorphism of B. Thus (b) is a consequence of (i) to (iii). 

Suppose now that u is a Z(B)-automorphism of F. As u is a Z(T)- 
automorphism of F, there exists one and only one T'-automorphism v of L 
so that v = u, Since B == Z(B) X T, it follows that v is a B-automorphism 
of L, and this shows that (Z(B) <F)—(B<L). Since we already 
proved that Z(B) = (F, (Z(B) SAN; condition (c) is a Pense of 
(i) to (iti). 

We assume now that conditions (a) to (c) are satisfied 7 B. Let S be 
any basis of the operator group T° over Z(T). Then S is a basis of the 
operator group L over F so that S is independent over Z (B). Hence S is 
contained in a basis S* of the operator group B over Z (B). But it follows 
from (c) and Theorem 2.3, (c) that S* is independent over F too. Con- 
sequently, S == S* and B is the direct Prone of Z (B) and T, as follows 
from Theorem 1. 2. 


Since (B < LY S (Z(B) < F), and since therefore 
Z(B) = (P, (Z(B) < P)) S (F, (B < bY’) =Z (8) 


by (c) or Z(B) = (F, ER < F)), it follows now from Theorem 3.4 that 
B= (I; (B <L)).- 


Suppose finally that u is a Z(T')-automorphism of Z(B). Since B is 
the direct product of Z(B) and T, there exists by Theorem 2.3 one’and only 
one T'-automorphism v of B so that w’—u. It is a consequence of (b) that 
there exists an automorphism g of L which induces v in B., Then the auto- 
morphism g’ of F induces v = u in Z(B). This completes the proof of the 
fact that (i) to (iii) are consequences of. (a) to (e). ` 
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CHAPTER. II. Galois Theory. 


4, i this section we state the finite, commutative Galots IRON in the 
form most convenient for our purposes... 


(4.1) Suppose that K is a subfield of the (commutative) field F. Then 
there exists.a finite group H of automorphisms of the field F so that 


$ K = (F, B) 
if, and only if, F is finite, normal and separable over K. 


(4.2), If H is a finite. group of automorphisms of the GRR) field 
F, then” de 
H = (EH) <F). 


(4. 3) _ If F is finite, normal and separable.over its subfield K, then . 
= (F, (B<F)) | 


for every field B between K and F, i.e. Fis ‘finite, normal and LS over 
every field B between K and F. 


(4.4) If F is Finite, normal and separable Rd 6 
field between K and F, then a necessary and sufficient condition for B to be - 
finite, nornial and separable over K is that 


: (B < F) is a normal subgroup of (£ <F), . 
and then Œ <B) and (K < F)/(B < F) are essentially the same. 


(4.5). If Fis finite, normal and separable over its subfield K, then there 
exists in Fan element b so that the elements b* for h in (K < F) form a 
basis of F over K. (Existence of a normal basis).® 

It my. finally be ss that finite and separable extensions are 


? 


“hat. from the. .text-books on modern ice one should consult the following 
papers in which the theory bas been presented in a form similar to the one sketched 
here. R. Baer, Mathematische Zeitschrift, vol. 33 (1931), pp. 451-479; R. Baer, Ameri- ` 
can Journal, of Mathematics, vol. 59 (1937), pp. 869-888; W. Krull, Mathematische 
Annalen, vol. 100 (1829); pp. 687-698; E. ‘Steinitz, Algebraisohe’ Theorie der Körper. 
Neu. herausgegeben ` und mit einem Anhang: Abriss der Galoisschen Theorie versehen i 
von Reinhold Baer und Helmut Hasse. Berlin, 1930. 

° A complete proof of this theorem has first been given, by, M. Déuring, Mathe- 
matische’ Anndien. © All the proofs published so ‘far ‘use extensively thé theory of 
representations. There exists however an unpublished proof by E. Artin which, ‘uses 
but elementary means from the theory of fields so that this theorem may now be con- 
sidered a part of Galois Theory proper. RE 
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simple extensions, and that the degree of a finite, normal and separable . 
extension is exactly the order of its group, and that the matrix (b45) possesses 
an inverse, if the b; form a basis, the g the group of a finite, normal and 
separable extension ; finally that every automorphism of a subfield of a normal 
extension is induced by an a DRE of the extension field. : 


5. In this section some remarks, concerning matrices and linear equa- 
tions, shall be given which will prove useful in the future. 

Let L be a linear system over the field F. If B is a matrix of n rows 
and n columns with coefficients in F, and if X is a matrix of n rows and one 


column with coefficients in L, then BX (= (bu) (2s)) = ( È bum) is a 
matrix of n rows and one column with coefficients in L. 

If A and B are two matrices of n rows and n columns, both with coeffi- 
cients in F, and if X is a matrix of n rows and one column with coefficients 
in L, then one verifies readily that 


A(BX) — (AB). 


If in particular # is the unit-matrix in F, then EX == 
It is now possible to write the system of linear cae 


n % 
(+) Z buts — o for t= 1, ``, n, ba in F, c in L, 


in the matrix form: (bix) (2x) — (cs). The solutions sẹ of (+) should be 
looked for in L. If in particular the matrix (b) — B is non-singular, i. e. 
if the determinant of B is different from 0, then there exists the inverse 
matrix B- to B; and the system (+) of linear equations has one and only 
one system of solutions a, in L, since (bix) (tx) == (cs) if, and only if, 


B> (c1) — B (dix) (ax) = E (2x) = (ax). 


6. Since the Galois Theory of finite groups of automorphisms is fully 
developed, it is possible to derive stronger theorems in the case of finite groups 
of automorphisms than the theorems of section 8. 


THEOREM 6.1. Let T be a subset of the linear system L over the com- 
mulative field F. Then there exists a Gums group G of automorphisms of L 
so that 
(1) the identity is the only automorphism in G which induces the identity 
in F, : e 
(2) i T = (L,G) 
if, and only if, 
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(i) T is complete in L, 
(ii) Lis the direct product of F and T, 
(iii), F ts finite, normal and separable over Z(T). 


Proof. Assume first that the finite group G of automorphisms of L 
satisfies the conditions (1) and (2), and that T — (L,G). Then G and 6’ 
are isomorphic finite groups. Hence F is finite, normal and separable over 
(F,G’) by (4.1). 

Now let b,,- > <, On be a basis of F over (F,G’). Then G and G’ both 
contain n elements; and there exists an inverse M to the matrix (b7 ) where 
the row-index g’ runs over all the elements in G’.. If u is any element in L, 
then the system i 


(+) LS byl =w for g in G 
| 4=1 ` i 


of linear equations possesses one and only one system of solutions x, in L, 
namely—in matrix-notation— (s) == M (w). If h is any automorphism in 
G, then (2;*) satisfies 


À by Nr — uss for g in G. 
tı i A 


Since gh runs over all the elements in the group G when g takes all the values 
in G, it follows that the x,* are another solution of (+); and since (+) 
possesses but one solution it follows that z; = z; for every h in G so that 
the z, are actually elements in T == (L,G). This implies in particular that 
L — FT.—It one applies this result concerning (+) on u= 0, then it follows 


that the b; are independent over T, since Š bit; = 0 with t in T implies that 
1 


the equations > bist, = 0 for g in G are satisfied, and since the only solutions 


of these. cos are i — 0. 

Since L = FT, T 0; and it follows from (2.2) that Z(T) — (F,G’), 
that T is complete in L, wd therefore from the results of the first paragraph 
of the proof that L is the direct product of F and T. The conditions (i) 
to (iii) are therefore satisfied by T. : 

` Assüme conversely that the conditions (i) to (iii) are satisfied by T. 
Then it follows from (4.38) that Z(T) = (F, (Z (T) < F)) and it follows 
ao from Theorem 8.1 that (T < L) satisfies condition (1) and that 

= (L, (T < L)). ‘Since (T < L) satisfies (1), (T < L) and (T < LY 
are isomorphic groups, so that (T <eL) is finite, since F is finite over Z (T), 
and since (T < LY is a subgroup of (Z(T) < F). Thus the existence of a 


566 7 REINHOLD BARR. : 


finite group G of automorphisms of L, ge a and C) is a con- 
sequence of (i) to (iii). 
An. alternative proof for this last inference may be given, & as REA sd 
_ proof does not use Theorem 3.1. If (i) to (iii) are satisfied by T, then it 
- follows from (4.3) that Z(T) = (F, (Z(T) < F)). Let b be an element 
‘in F go that the elements bs for g in (Z(T) <.F) form a basis of-F over Z(T) 
(cp. (4.5)1). The elements bs form a basis of L over T—by (ii)-—and it 
follows from Theorem 2.3 that (T < L) satisfies (1) and that {Z(T) <F) 
= (T<L) so that (T <L) =G is finite. If finally z is an element in 
(L, (T < L)); then there exist elements t(g) in T so that s= Dilg). 


Consequently £ = > KO) beH for every h in G; and this implies: that all the 
t(g) are equal oe a ‘fixed element ¢ in T so that T1 P bs’ — tz for 4. in 
. me ‘Hence ¢ is in T and consequently T= (L; (T < T. 


CororzarÝ 6.2. Suppose that the subset T of the linear system L over 
the commutative field F is complete in L, and that F ts finite, normal and 
separable over Z(T). . 


(a) If L is the direct product of F and T; then (T < i is a “finite group 
of automorphisms of L, the identity is the only T-automorphism of L which 
induces the identity. in F and T = (L, (T <L)).. 


(b) L is the direct product of F and T if, and only if, every Z(T)-auto- 
morphism of F ts induced by one and only one T-automorphism of L. 


Proof. (a) has already been verified in the proof of Theorem 6. 1.—That 
the condition of (b) is necessary, follows from Theorem 2.3. If on the other 
hand every Z (T)-automorphism of F is induced by one and only one T-auto- 
morphism of F, ‘then (T < LY = (Z(T) < F) and therefore Z(T) 
—(F,(Z(T) < F)) = (F, (T < LY) by (4.3). Hence it follows from 
Theorem 2: 3, (b), (c) that L is the direct product of F and T. 


THEOREM 6.8. If the identity is the only automorphism in the finite 
group G of automorphisms of the linear system L over the commutative field 
F which induces the identity in F, then G = ((L, G) < L). 


Proof. It is a consequence of Theorem 6.1 that (L, G) ‘is complete in 
L, that F is’ finite, normal and separable over Z((L,G)) and that L is the 
direct product of F and (L,G). . Hence it follows from Corollary. 6.2, (b) 
that every automorphism in (Z((L,G9) < F) is ‘induced by an’ (L, G)- 
automorphism of L so that ((L,G) <L)'=(Z((L6))<F), and it is a 
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consequence of (2.2) that Z((L,G)) = (F,6/). Now it follows from (4. 2) 
that G’ = ((F, CI FY = (Z((L, 6)) < F) = ((Z, G) <.L)'. : Since 
G” is finite, and since every Z ( (L, G))-automorphism of F is induced by one 
and only one (Z,G)-automorphism of L, this implies that G = ( (L,G) < L). 


| THEOREM 6.4. Suppose that L is a linear system over the commutative 
field F, that the subset T of L is complete in L, that L is the direct product 
of F and T, that F is finite, marae lie ewe over Z Fi > and that B. 48 
a set between T and L. 


(A) B= (L,(B<L)) if, and only i, there exists à ies E between zit) 
and F so that B= RT.. 


(B) f R is a field between Z (T) and F, then R =Z (RT). 


This is a consequence of Corollary 6.2 and of Theorem 8.4, since > every 
field Æ between Z(T) and F satisfies R= (F, (R < F)) by (4. 8). 


THEOREM 6. 5. bois that L, F, T and B satisfy the Le of 
Theorem 6. 4, and that Bis complete in L. Then B satisfies ` 


(a) (B<L)isa normal subgroup of (T < L), and 


(b) ` B=(L, (B< L)) 

if, and only if, | 

(i) B Be for every T-automorphism g of L, and | 

(it) i (F; (B < L)’) —Z(B). , 

_ Proof. Every Z(T)-automorphism of Z (B) is induced by some -auto- 
morphism of F, since Z(B) is between Z (T) and F, and since F is finite and 
normal over Z(T). Thus the above conditions (a) and (b) imply. the con- 
ditions (i) to (iii) of Theorem 3.5 and consequently the above conditions 
(i) and (ii).—If conversely the above conditions (i) and (ii) are satisfied, 
then (a) is a consequence of (Be < L) — g> (B < L)g. Since (T < L) is 
finite, (B < L) and (B < L)’ are both finite, and hence it follows from (ii) 
and (4.2) that 


(B < L) < (Z(B) < F) = ((F, (È < LY’) < F) = (B < L)' 


eee 


or. 

(B< LY = (4(B) < F). 
By (i) every T- + g of L induces a T-automorphism g* in the | 
linear system B over Z(B). If g* #1, then g’ is in (Z(B) < F) and there- 
fore in (B < LY so that g is in (B <.L), ie. g*—1. The group G* of 
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these automorphisms g* is finite, satisfies condition (1) and (2) of Theorem 

6.1, since (B, G*) == (L, (T <:L)) =T. Hence it follows from Theorem 

6. 1 that B — Z (B) T, and it follows from Theorem 6.4 that B == (L, (B < L)). 
The following theorem is some sort of a converse to Theorem 6. 3. 


THEOREM 6.6. Suppose that the linear system L over the commutative 
_ field F contains an infinity of elements, that G is a finite group of auto- 
morphisms of L and that (L,G) 540. Then G = ((L,G6) < L) if, and only 
if, the identity is the only automorphism in G which induces the identity in F. 


Proof. The sufficiency of. the condition is a consequence of Theorem 
6. 3.—Suppose now that the identity is not the only automorphism in G which 
induces the identity in F. Then denote by W the set of all the elements w in 
G so that w = 1. Clearly Wis a normal subgroup of G. . Let V = (L, W). 
Then Z (V) == F and the automorphisms in G induce in V a finite group G*. 
of automorphisms of the linear system V over F. This group G* is essentially 
the same as G/W. Since by the construction of V the identity is the only 
automorphism in G* which induces the identity in V, it follows from Theorem 
6.8 that G* = ((V,G*) < V) and it may be noted that (V, G*) = (L,G) 
=T, V = FT. 

Since W 41, Y < L. Since V is an admissible subgroup of the operator 
group L over F, there exists a basis B of L ovér F which contains a basis U 
of V over F. Clearly U < B. Now we distinguish two cases. 


Case 1. V contains an infinity of elements. Let d be some element in B 
that is not contained in U. If v is an element +40 in V, then an auto- 
morphism g == g(v) is defined by the conditions: g’ = 1, d5 == d -+ v, b= bs 
for b=4d in B. Each g(v) is a V-automorphism and therefore a T-auto- 
morphism of É. Since there exists an infinity of automorphisms g(v), it 
follows that (T < L) is infinite so that the finite group G is certainly smaller ' 
than ((L,G) < L). 


Case 2. Y ERT but a finite number ot elements. Then there exists 
in B an infinity of elements d which are not contained in V and therefore not 
in U, since it follows from the finiteness of V and from V =< 0, that the field 
F contains but a finite number of elements. Let v be some element = 0 in F. 
If d is an element in B that is not contained in U, then an automorphism ' 
h —h(d) is defined by the conditions: h’ = 1, d* == d + v, b == b* for b Hd 
in B. All these automorphisms are different. They are V-automorphisms and 
therefore they are T-automorphisms of I. Consequently (T < L) is infinite 
and therefore different from the finite group G. Hence we have proved that 
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G = ((L,6) < L) implies that the identity is the only. automorphism in G 
that induces the identity in F, provided L is infinite. 


Remark. If L is a finite system, then every group of automorphisms of 
L is finite. In this case—using the notations of the proof. of the preceding 
theorem—G == ((L,G@) < L) if, and only if, W = ( (L, W) < L). 


7. In tbis section a short discussion is given of possibilities of extending 
Theorem 6.4 and Theorem 6.5. The following assumptions will be made 
throughout this section. L is a linear system over the commutative field F; 
G is a finite group of automorphisms of L so that the automorphism g in G 
satisfies g’ = 1 if, and only if, g = 1; T — (L,G). Then we prove: 


There exists a set W between T and L so that. 
(a) | T<WSl, 
(b) | Z(T)=2(W), 


(c) Wis nets in L and Z(W) contains every element f in F to which 
there exists an element w >< 0 in W so that fw is in W 

if, and only tf, the order of G is greater than 2 and the rank of the 
operator group L over F is greater than 1. 


Proof. Let B be any basis of the operator group T over Z(T). Then 
it is a consequence of Theorems 1.2 and 6.1 that B is a basis of the operator 
group L over F. It is a consequence of Theorem 6. 1 and of (4.5) that there 
exists an element g in F so that the n elements q® for g in G form a basis of 
F over Z(T') ; and these elements form by Theorem 6.1 a basis of L over T. 

Suppose now that the above conditions are satisfied. Then there exist 
in B two different elements b, and bz and there exists in G an element v341 
so that gt is not in Z(T'). That this is possible is clear since 2 ge =q 2 qe} 


ig an element £0 in Z(T) and q is not in Z(T). The éléments 1, v do not 
exhaust G. Put w.— qb, + qb: and denote by W the set of all the elements: 
t+ ew for t in T and z in Z(T). If g is an element in G which is different 
from 1, then w8 = gb + gb, 0, since gg", and since b, ba are 
` independent over F. Hence TKW SL. 

It is obvious that W contains the sums of any two of its elements and that 
Z(T) <Z(W). Suppose now. that t + zw — r € 0 and that f be an element 
in F so that fr is in W. Then there exists an element s in T ind e an element h 
in Z (T) so that i 


8 
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s + hgb, + hab. = fr = fl + fegb, + fegr'bs 


(—) ft—s— (h—fz)qbs + (h—fa) qr be. 
Suppose now that f is not in Z (T). 


Case 1. ft—3—0, Then.it follows from the properties of T and from 
the fact that both s and £ are in T that t== 0. Hence r= zw and r 0 
implies z =£ 0. Consequently k — fz 5£0 and therefore gb, + gb; — 0, But 
this is impossible. ; 
Case. ft—s-Æ0, Then de 0. Suppose first that¢—0. Then 
we find for any element, g in G: 


(h — fz) (g8'b, + QUE Da) == — 8 = (h — fe) (gb: + gba), 
since s is in T and h, z are in Z(T). Then 
(h— fz) ge = (h—fz)q and (h—fe2)qr@ = (h — fe) g” 


and from h— fz40, it follows that h—frez— (h— fr) 0 so that 
gE g? = qe q or gi" = (g )e' for every g'in G’. Hence q1” is an element 
in Z (T) and this is impossible by our choice of v, so that t 540. 

Every element v in T has the form: s == > aa, ye for z(x, b) in Z(T). 


Then it follows from (—) that 


fz(t, b) —z(s, b) = 0 for b Æ b; 
fa(t, bı) —2(3, bı) RE (h — fz)q, 
fe(t, bz) — 4(s, bz) =. (h — fz) g”, 


since the elements b in B form a basis of the operator group L over F. Since 
f is not in Z (T), we find z(t, b) = 2(s,b) = 0 for b £b. Since ¢ 0, this 
implies that not both z(é,b;) are 0. Eliminating f from the remaining two 
equations we find that 


fle (t, 61) + 2q] = 2 (8, b1) + hg, 
flz(t, ba) + zg] = 2 (s, b2) + hq” and therefore 
qiz z(s, be) — h z(t, b2)] + g” [h z(t, b1) — z 2 (s, bi) ] 
== z(t, b1)z(8, b2) + z(t, b2)2(s, 01). 


Since the right side of this last equation is invariant under all the g’ in 6”, 
and since there are g’ in G’ which are différent from both 1 and v’, it follows 
now that ° 

22(8, bz) — k z(t, b:) and zz(s, bı) =h z(t, bı); 
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and consequently z3£0, since not both 2(t,0;) are 0. Since all the 2(t, b) 
and z(s,b) for b 4b; are 0, it follows that zs = ht and therefore we find 
from (—) that 


(h — fz) [qbi + gbe] = ft — s 24 (fa—h)t 


or that — zt == gb, + gb, since h — fz 340. But this is impossible, since 
zt belongs to T and w = qb, + qb. does not. Hence it is impossible that 
f is not in Z (T); and this shows that W satisfies (a) to (c). 

Assume now conversely that W is a domain between T and L which satis- 
fies (b) and (e). To prove the necessity of our conditions we have to discuss 
two cases: 


Case I. The order of G is =2. Since there is nothing to prove, if 
G—1, we suppose that G consists of two different elements 1 and g. If w 
is any element in W, then w == iq + t.g® with t, in T. Since T contains 
tæt,(g + gs), W contains w — t = (t,—1t,)q®. Since ta —+, is in T, and 
since g® does not belong to Z (T), it follows from (b), (c) that t, = t; so 
that w == 1 is an element in T, i.e. W = T. 


Case II. The rank of ‘the operator group L over F ig 1. Then let ¢ be 
any element s4 0 in T. If w is an element in W, then w = ft for some f in F 
and it follows from (c), (b) that f isin Z(T') so that w is in T, i. e. W == T. 


8. There exists a comparatively complete extension of the Galois Theory 
of finite groups of automorphisms (of commutative fields) to groups of auto- 
morphisms G which satisfy the condition : 

(F) The set of elements fs for g in G is finite for every f. The theory of 
these groups may be described as follows." 

(8.1) Let K be a subfield of the commutative field F. Then there exists a 
group G of automorphisms of F so that 

(1) the set f€ (of all the elements fs for g in G) is finite for every ele- 
ment f in F, 

(2) K = (P,G) 


if, and only if, F is algebraic, normal and separable over K. 
(8.2)If F is algebraic, normal and separable over its subfield K, then 


(a) conditions (1), (2) of (8.1) are satisfied by (K < F}; 
(b) F is algebraic, normal and separable over every field between K and F; 


T Cp. footnote *. 
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(e) every K-automorphism of a field between K and F is induced by.8 an 
automorphism of F; 


(d) every finite set of elements in F is contained in a feld between K and F 
that is finite, normal and separable oyer K. 


The groups G which satisfy condition (1) of (8.1) and G == ( (F,G) < F) 
may be characterized by a certain closure property which we need not state, 
as we are not going to make any use of it. The following result is however 
of some importance for us. 

(8.3) Every subgroup S of the group G of automorphisms of F which has 
the property (1) of (8.1) satisfies S — (CF, S) < F) if, and only if G is 
finite. 


9. We shall now develop the Galois Theory of groups of automorphisms 
of linear systems which are subject to the above-mentioned condition Fo in 
analogy.to the theories of sections 6 and 8. 


THEOREM 9.1. The subset T of the tines system L over the commuta- 
tive field F satisfies: 


(a) the identity is the only T-automorphism of L which induces the iden- 
„tity in F, 
(b) xT<D isa finite set for every element x in L, 


(c) ….. …. Tæ(L(T<L)) 
. if, and only if, i i 
(i) T is complete in L, 


(ii) F ts algebraic, normal and separable over Z (T), 
(iti) L ts the direct product of F and T. 


Proof. Suppose first that the conditions (a) to (c) are satisfied by T. 
Then it is a consequence of Theorem 8.1 that T is complete in L, that 
2(T) = (F, (Z(T) < F)) and that L is the direct product of F and T. 
There exists- therefore an element t 40 in T. If f is any element in F, then 
(ft) T<D = f(T<D't ig a finite set so that f(T<2)' is a finite set. Since L is 

. the direct product of F and T, it follows from Theorem 2.3 that (T < LY 
= (Z (T) < F) so that finally every set fZ <P for f in F is finite. Now 
it follows from (8.1) that F is algebraic, normal and separable over Z (T) 
so that (i) to (iii) are consequences of (a) to (c). f 

If conversely the conditions (i) to (fii) are satisfied, then it follows from 
(8.1) that Z (T) = (P, (Z(T) < F)) and it follows therefore from Theorem 
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< 3.1 that T = (L, (T < L)) and that the identity is the only T-automorphism 
of L which induces the identity in F. Furthermore it follows from (8.1) 
. that every set f(?<1)’ for f in F is finite, since by condition (iii) and Theorem 
2.3 we have (T < LY = (Z(T) < F). By (iii) there exist to every element 


æ in L elements t; in T, fi in F so that x = $ tifi Consequently z(T<D) is a 
1 


subset of the set Š hfi T< which is finite so that (a) to (c) are con- 
#1 
sequences of (i) to (iii). 


Taxorem 9.2. The group G of automorphisms of the linear system L 
over the commutative field F has the properties: 


(a) the identity is the only (L,G)-automorphism of L that induces the 
identity in F, and i 


(b) sL <L) is a finite set for every element x in L 
if, and only if, the following conditions are satisfied by G: 


(i) tf S is a normal subgroup of finite indez in G, and if S is the cross cut 
of G and ((L,S) < L), then every automorphism in G which induces the 
identity in Z((L,S)) belongs to S; 


(ii) 2© ts a finite set for every element x in L. 


Proof. It is clear that (ii) is a consequence of (b), since G< 
( (L,G) < L). If T= (L,G), then it is a consequence of (a), (b) and of 
(L,G) = (L, ( (L,G) < L)) that conditions (a) to (c) of Theorem 9. 1 are 
satisfied so that T is complete in L, F is algebraic, normal and separable over 
Z(T), and L is the direct product of F and T. If S is any subgroup of G and : 
B = (L, S), then B = (L, (B < L)) and it follows from Theorem 3.4 that. 
B is complete in L and that B= Z(B)T. If g is an automorphism in G 
so that g’ is a Z(B)-automorphism, then g is a T-automorphism and therefore 
a B-automorphism of L so that g belongs to the cross-cut of G and 
((L,S) < L); and this contains (i) as a special case. | 

Suppose now that conditions (i) and (ii) are satisfied by the group G. 
If u is any element 3£0 in L, then denote by S the set of all those auto- 
morphisms in G which leave every element in the finite set uf invariant. Then 
S is a normal subgroup of finite index in G; and G/S is essentially the finite 
(transitive) group of permutations which G induces in uë. Since U = (L,S) 
contains u&, it follows that S is the cross-cut of G and (U < L)—an auto- 
morphism, leaving every element Sn U invariant, has in particular every 
element in uf as a fixed element—so that the conditions of (i) are satisfied 
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by S. Hence S contains every automorphism in G which induces the identity 
in Z((L,S)) —Z(U)—Since U 4 0—for U contains u4 0—it follows 


from (2.2) that U is complete in L and that Z (U) — (F, S’). G induces 


in the linear system U over Z (U) a group G* of automorphisms, since S is a 
normal subgroup of G, since (L, S)€ == (L, g4Sg), and since therefore every 
automorphism g in G induces an automorphism g* in U. If g is in G, and 
if g* == 1; then g-is in the cross-cut of G and (U < L) and therefore in 'S. 
The group G* of all the automorphisms g* for g in G is therefore essentially 
the same as G/S so that G* is in particular a finite group of automorphisms 
of the linear system U over Z (U). If g*’== 1, then g’ leaves every element 


in Z(U) invariant so that—as has been remarked before—g belongs to S, i. e. 


g*—1. Finally T = (L, G) SU — (L, S) and therefore (U, G*) = (L, G) 
== T. Hence it follows from Theorem 6.1 that T is complete in U, U is the 
direct product of Z(U) and T and Z ee is finite, normal and separable 


` over Z(T). 


Special consequences of this last result—as applied to every u in L—are 
that T is complete in L and that L = FT; and this implies that (8) holds 
true. ? | | 

If ¢540 is an element in T, f any element in F, then (ft)© _ 704 is a 
finite set of elements in L; and consequently f@ is a finite set of elements in F. 
Since Z(T) = (F,G’) by (2.2), it follows from (8.1) that F is algebraic, 
normal and separable over Z(T) so that Z (T) == (F, (Z(T) < F)). 

Suppose now that the elements b,,---,b, in F are independent over 
Z(T). Denote by S the set of all the automorphisms g in G so that g’ leaves 
every element in every set b,° invariant. Since all the sets f’ for f in F are 
finite, it follows that S satisfies all the conditions of (i). Hence it follows 
from what has been proved in the second paragraph of the proof that (L, S) 
is the direct product of Z((L, S)) and T, and Z((L,S)) = (F,S’) con- 
tains all the elements b. Since T = (L,G) — ((L,S),G*)—in the nota- 


‘ tion of the second paragraph of the proof—this implies that the b, are 


independent over T too. Hence L is the direct product of F and T and now 
it follows from Theorem 9.1 that the condition (b) of our theorem is satis- 
fied by. G.$ 

Combining the results of Theorems 9.1 and 9.2 we find the ee 
corollary which takes the ne of Theorem 6.1 in this section. 


#It might be worth noting that the zondiyfens (i) to (iii) of Theorem 9.1 have 
been derived here from the conditions (i) and (ii) .of this Theorem 9.2 without any 
recurrence to Theorem 3. 1. 
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COROLLARY 9.8. Let T be a subset of the linear system L over the field F. 
Then there exists a group G of automorphisms of L which satisfies condilions 


(i) and (ii) of Theorem 9.2 and which satisfies: 
T = (L,G) 


if, and only if, T is complete in L, F is algebraic, normal and separable over 
Z(T), and L is the direct product of F and T. 


COROLLARY 9.4. Suppose that the subset T of the linear system L over 
ihe commutative field F is complete in L, and ihat F is algebraic, normal and 
separable over Z (T). Then L is the direct product of F and T if, and only 
if, every Z(T)-automorphism of F is induced by one and only one T-auto- 
morphism of L. 


Proof. It is a consequence of Theorem 2.3, (a) that every 2(T)- 
automorphism of F is induced by one and only one T-automorphism of L, 
if only L is the direct product of F and T. Assume conversely that every 
Z(T)-automorphism of F is induced by one and only one T-automorphism 
of D. Then it is a consequence of Theorem 2.3, (b) that L == FT. Since 
(T < LY = (Z(T) <F), and since F is algebraic, normal and separable over 
Z(T), it follows from (8.1) that Z (T) = (F, (Z(T) < F))=(F,(T < LY) 
and consequently it follows now from Theorem 2.3, (e)'that every basis of 
the operator group T over Z(T) is a basis of the operator group L over F; 
and hence it follows from Theorem 1. 2 that L is the direct product of F and T. 


THEOREM. 9.5. A group G of automorphisms of the linear system L over 
the commutative field F such that «© is a finite set for every element x in L, 
and such that condition (i) of Theorem 9.2 is fulfilled by G, satisfies 


G = ((L,G) < L) if, and only if, 6 = ((F, G’) < F). 


Proof. It is a consequence of Theorem 9.2 and of the conditions im- 
posed on G, that the identity is the only (L, G)-automorphism of L which 
induces the identity in F and that every set æ((46)<2) ig finite for every x 
in L. Hence it follows from Theorem 9.1 that (L, G) = T is complete in L, 
that F is algebraic, normal and separable over Z (T) and that L is the direct 
product of F and T. It is a consequence of (2.2) that Z (T) = (F, C’), and 
it is a consequence of Theorem 2.3, (a) that every (F, G’)-automorphism of 
F is induced by one and only one (Z,G)-automorphism of L. Now our 
theorem is a consequence of Theorem 3. 2. 

It may finally be mentioned that Theorem 6.4 may be extended to our 


i 
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case with hardly any change. Another immediate consequence of the theorems 
of this section and of (8.3) is the following statement: 


Suppose that L is a linear system over the commutative field F, and that 
the group G of automorphisms of L satisfies: 


(a) a ts a finite set for every x in L; 

(b) if S ts a normal subgroup of finite index in G, and if S contains all the 
(L, S)-automorphisms in G, then .S contains every automorphism in G which 
induces a Z((L,S))-automorphism in F. 


Then every subgroup T of G satisfies T = ((L,T) < L) tf, and only if, 
| G is finite. ` | 


10. Applications of the theory of linear systems to the theory of rings 
and non-commutative fields shall be given in this section. If R is a ring, if the | 
commutative field F is part of the central of R, and if R and F have the same 
identity, then it is clearly possible to consider R as a linear system over F, 
_ since this only means restricting one’s attention to the addition in R and to 
the multiplication of elements in Æ by elements in F. 


Lemma 10.1. If the field F is contained in the central of the ring B, 
and if S is a subring of R which contains the unit-element of F and R, then 
it is necessary and sufficient for the completeness of S in the linear system R 
over F that the cross-cut of. F and S be a field.—If furthermore the linear ` 
system R over F is the direct product of F and S (in the sense of section 1), 
then every S-automorphism of the linear system R over F is at the same time 
an automorphism of the ring R. 


Proof. The first statement of the lemma is clear. If the linear system Æ 
over the field F is the direct product of F and of its subring 9, then let B be 
a basis of # over the, cross-cut Z(S) of S and F (Z(S) is a subfield of F). 
There exist to every element x in R uniquely determined elements s(z,6) in 
S—all of which with a finite number of exceptions are 0—so that x = 2, bs(zx,b). 


If g is an automorphism of the linear system R over F, then g applied on F 
alone is an automorphism of the field F. Suppose now that g is an S-auto- 
` morphism of the linear system # over F. Then 


(y LS, bde(x, b)s(, dr X bedes(x, b)s(y, d) = eye 


‘and this completes the proof. 
This lemma shows in particular that the automorphisms, constructed in 
Theorem 2.3, (a), are ring- promo: piasins i in our case. 
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As we are giving preference to the subfield F of the central of the ring R, 
we consider as automorphisms of R only such ring-automorphisms of R which 
map F upon itself, a hypothesis that will be satisfied for all the ring-auto- 
morphisms of R, in case we assume F to be the central of R. 

Consequently we use the following notation: If G ig a group of auto- 
morphisms of the ring R, then G’ is the group of automorphisms which the 
automorphisms in G induce in F. If T is a subset of R, then (T < R) 
consists of all those automorphisms of the ring R which map F upon itself and 
leave every element in T invariant. | 

Now it has to be remarked that it is impossible to make use of Theorem 
2.3, (b), since it may very well happen that there are no T-automorphisms 
1 of the ring È which induce the identity in F whereas there may exist 
T-automorphisms + 1 of the linear system R over F which induce the identity 
in F. On the other hand it is obvious that Theorem 2.3, (c) may be used, 
since ring-automorphisms are at the same time automorphisms of the linear 
system. 

Let now T te a ‘subring of R which contain the unit-element of F. No 
other subsets will be considered. Then an element f in F satisfies fT S T 
if, and only if, f is in T too; and for this reason we denote by Z(T) the 
cross-cut of F and T. T is complete in R if, and omg if, Z(T) is a sub- 
field of F. 

_ Now it is easy to derive the ROUE statements from Theorems 6.1 
and 6.3. 


Tagorem A. Suppose that the cross-cut Z(T) of the subring T of the 
ring R and of the subfield F of the central of the ring R is a subfield of F. 
Then there exists a finite group G of automorphisms of the ring R—all of 
which map F upon itself—such that the identity is the only F-automorphism 
in-G; and. such that T = (R,G) if, and only if, F is finite, normal and 
separable over Z (T) and the linear system R over F ts the direct product of 
T and F. 

THEOREM B. If G is a finite group of automorphisms of the ring R 
all of which map the subfield F of the central of R upon itself, and if the 
identity is the only F-automorphism in G, then 


G— ( (F£, G) < B). 


The statements we are going to derive from Corollary 9.3 and Theorem 
9: 5.concern.gioups G of aitomer poems of the ring R with the following 
properties: «°° = ‘ 


' (1) - F= Fe for every g in G; 
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(2) 2 is a finite set of elements for every x in À; 
(8) if S is a normal subgroup of finite index in G, and if S contains all 
those automorphisms in.G which leave all the elements in (R, S) invariant, 
then every Z ( (R, S))-automorphism in G belongs to S. 

Note that a finite group G satisfies these conditions, if its automorphisms 
map F upon itself, and if the identity is the only F-automorphism in G. 

THEOREM A’. Suppose that Fisa subfield of the central of the ring R, 
and that T is a subring of R whose cross-cut with F is a subfield Z (T) of F. 
Then there exists a group G of automorphisms of the ring R which satisfies 
the above conditions (1) to (3) so that 

T = (R,G)- 


tf, and only tf, F is algebraic, normal and separable over Z(T), and the linear 
system R over F ts the direct product of F and T. 


THEOREM B’. If the group G of automorphisms of the ring R satisfies 
conditions (1) to (3), then G’ = ((F,G’) < F) ts a necessary and sufficient 
condition for G = ((R,G) < R). 

The following. important and obvious consequence of Theorem A’ may' 
be stated for future reference. 


Lamma 10.2. Suppose that the central of the ring R is a field F, and 
that the group G of automorphisms of the ring R satisfies conditions (2) 
and (3). 

(a) F ts the centralizer of (R,G) in R. 
(b) Z((R6G)) ts ane central of (R, G). 


(b) is a consequence of (a); and (a) follows from the fact that by 
Theorem A’ the linear system F over F is equal to F (R, G), and that F is 
exactly the central of R. 


Finally it may be noted that the following statement may be derived 
from Theorem 8. 4. 


Taeorem C. Suppose that F is a subfield of the central of the ring R; 
and that the group G of automorphisms of the ring R Haa conditions 
(1) to (3). 

(a) The set B between (R,G) and R satisfies Bem (R, (B < R)) if, and 
only af, there exists a field S between Z ( (R, G)) and F so that B = 8 (R, G). 
(b) If S isa field between Z((R,G))°and F, then 8 = Z(S(R,G)). 


It is the main-objective of this section to show that in case of (non-- 
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commutative) fields. it is possible to prove an essentially stronger theorem 
than Theorem C. 


Lena 10.8. If the central F of the ring R is a field, tf G is a finite 
group of automorphisms of R such that the identity is the only automorphism 
in G which leaves every element in F invariant and such that (E,G) ts a 
subfield of R, if W is a ring between (R, G) and R whose cross-cut with F ts 
the same as the cross-cut of (R,G) and F, then W = (R,G). 


Note that every automorphism of À maps F upon itself, since F is the 
central of F, and that (R, G) though a field need not be a commutative field. 


Proof. It is a consequence of Theorem A that F is finite, normal and 
separable over the cross-cut Z (K) of K == (R,G) and F, and that the linear 
system À over the commutative field F is the direct product of F and K. 
There exists therefore by (4.5) an element b in F so that the elements bs for 
g in G form a basis of F over Z(K), since the elements in G induce in F 
an isomorphic group of automorphisms, and since (F,G)—Z(K). There 
exist furthermore to every element sz in Æ uniquely determined elements 
c(z,g) in K so that z = 2 cls, g)bs and + belongs to K if, and only if, 

gin 


all the elements ¢(z, g) are equal. 

Assume now that K < W. Then there exist in W elements which are 
not contained in K; and amongst these there is one, w, so that the number 
of coefficients c(x, g) 5&0 is as small as possible. Since w is not in W, w s£ 0 
and there exists an automorphism v in G so that c(w, v) 0. 

Let now ¢ be any element in K. Then 


twe(w,v)*—we(w,v)*t— > [tc(w, g)c(w, v)“*— o(w, g)c(w, v)*t] bs 
ging 
would be in W and the number of its coefficients 4 0 would be smaller than 


for w. Hence this element is in K so that all its coefficients are equal. Since 
at least one of these coefficients is 0, all the coefficients are 0 so that 


tc(w, g)c(w, v) = c(w, g)c(w, v*)t for every À in K, g in 6. 


Now it follows from Lemma 10.2 that z(w,g) = ¢(w,g)c(w,v) is an 

element in F, and since z(w, g) is an element in K, it is in the cross-cut Z (K) 

of K and F. Hence w= > 2(w,g)bec(w,t) = fe(w,v). Since w~0, 
ginc 


f is an element in the cross-cut of W and F; and it follows from our hypothesis 
that f isin Z(K). Thus w would be in K and this is a contradiction so that 
finaly W = K. 


THEOREM 10.4. IfGisa finite group of automorphisms of the (non- 
commutative) field Q such that the identity is the only automorphism in G 
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which leaves every element in the central F of Q invariant, then every ring R 
` between (Q,G) and Q whose cross-cut with F is a field satisfies: 


R= (Q, (R < Q)). | 
Note that every (Q, H) is a subfield of Q so that the condition imposed 
on R is necessary and sufficient and implies that F is a field. ; 


Proof. Denote by Z (R) the cross-cut of R and F. Z(E) is a subfield 
of F which contains the cross-cut Z(K) of K—(Q,G) and F. Put 

Z(R)K. Then KS&8< R; and it follows from Theorem A and 
Theorem C that Z (S8) = Z(R) and 8 = (Q, (9 < Q)), since F is finite, 
normal and separable over Z (K) and since therefore the field Z (R) between 
Z(K) and F satisfies: Z (R) — (F, (Z(E) < F)). This implies in par- 
ticular that 8 is a field. Since (S < Q) SG, it follows now from Penne 
‘10.3 that § = R and this completes the proof. - 


- Tarorem 10.5. If K is a subfield of the field Q such that the identity 
is the only K-automorphism of Q which leaves every element in the central 
F of Q invariant, such that the sets 2'X<® are finite for every element.a in 
Q, and K == (Q, (K < Q)), then every ring R between K and Q whose cross- 
, cut with F ts a field satisfies: 


R= (Q,(B<Q)). 


Proof. It is a consequence of Theorem 9.1 that F is algebraic, normal 
and separable over the cross-cut Z (K) of K and F, and that the linear system 
Q over F is the direct product of F and K i.e. the group (K < Q) satisfies 
the conditions (1) to (3). If the cross-cut Z(R) of the ring R between KE 
and Q is a field, then Z (F) is a field between Z(K) and F; and it follows 
from (8.1) and (8.2) that (F, (Z(E) <F))—Z(R). It is then a con- 
sequence of Theorem C that the domain S =Z (R) K satisfies: S ==(Q,(8 < Q)) 
and Z(S)==Z(R). Since (S < Q) = (K < Q) it follows that the identity 
is the only S-automorphism of Q which induces the identity in F and that 
every set z(#<9) js finite. Finally it is clear that S is a. field which is con- 
tained in R. 

Suppose now that u is any element in R. Denote by U the set of all the 
S-automorphisms of Q which leave all the elements in w'S<®) invariant. and 
put V == (Q, U). It is clear that U is a normal subgroup of finite index in 
(S < Q), that u< < V and that therefore U—((Q,U) <Q). It isa 
consequence of Theorem 9.2 that an S-automorphism of Q which induces the 
identity in R(V) leaves every element in V invariant. Thus (9 < Q) induces 
in the field V with central R(V) a finite group G of automorphiams so that 
the identity is the only automorphism in-G which induces the identity in 
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Z(V) and so that (V, G) — S. ‘Denote now by D the cross-cut of V and R. 
D is a ring between S and V which contains u and whose cross-cut with Z (V) 
is just Z(R) —Z(8). Hence it follows from Lemma 10.3 that D =S 80 
that in particular u is an clement in S. Hence S—R and this completes 
the proof. 

CHAPTER IIL Crossed Products.° 


11. The extension of the concept of crossed product we are going to 
give here concerns itself with a (not necessarily commutative) field Q whose 
central may be denoted by F and a group G of automorphisms of the aed Q 
which is subject to the following conditions: 


(I) Q ts the direct product of F and the subfield K = (Q,G) of Q; 


(IT) K — (Q, (E < Q)). 

Two important inferences of (1) may be stated at once. 
(1) The identity is the only F-automorphism in G. 
(I) F is the centralizer a K in Q; and the central a) K is tts cross-cut Z - 
with F. 


Given condition (I), one verifies. that (II) is s equivalent to the fol- 
lowing condition: 
(T) ` Z= (F, <P). 


It may be noted furthermore that a consequence of (I) is 


(I*) Every Z-automorphism of F is induced by one and only one K-auto- 
morphism of Q. ` 

Upon occasion we shall have to use the further restriction : 
(qm) = (K < Q). l 

In denoting by g’ the automorphism of F which is induced by g in Q, 
it is a consequence of (I*) that (III) is equivalent to the following assumption. 
(II) GC (Z<F) | | 

Conditions (I) to (III) are satisfied by all those finite groups G of 
automorphisms of Q whose only F-automorphism is the identity. —Conditions 
(I) and (II) are satisfied by the more general class of groups which satisfy 
the conditions (2), (8) stated in section 10. 


Now. we connect with every element g in G an indeterminate u(g) and 
consider the system QG of all the linear forms: 


Fore presentation of the classical theory of crossed products ep. e.g. H. Hasse, 
Transactions ‘of the American Mathematical Society, vol. 34 (1932), pp. 171-214. 
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2, u(8)4(e) 


where the g(g) are elements in Q all but a finite number of which are 0. 
It is clear how to add two such forms and how to multiply them by elements 
in Q [from the right]. 

In this linear system QG over Q a multiplication shall be defined which 
is subject to the following rules: 


(A) qu(g) —u(g)gs for q in Q and g in G; 
(E) if g and h are two elements in G, then there exists an element (g, h) 
in Q so that u(g)u(h) = u(gh) (g, h). 


The elements (g,h) are called a factor-set and the linear system QG 
enriched by this multiplication is termed the crossed-product 


(Q, G, (g, h)). 
(11.1) The multiplication in (Q, 6, (g,h)) is associative if, and only if, 
(i) every (g,h) is in F; 
(ii) (4, st) (8, t) — (rs, t) (r,s)! for r,s,t in G. 


Proof. Suppose first that the multiplication is associative. If g is any 
element in Q and g, h are elements in G, then 


u(gh) (g,h)q—u(g)u(h)g = q@)u(g)u(h) = qe u(gh) (g, h) 
= u(gh)q(g,h) or 
(g, h)g = q(g, h) for every q in Q so that (i) holds true. 
If furthermore r, s, t are three elements in G, then 


(rst) (r, st) (s,£) = u(r)u(st) (s,t) = u(r)u(s)u(t) 
—u(rs)(r,s)u(t) = u(rs)u(t) (r,s)! 
= u(rst) (rs, t) (r,8)* 
and this proves the necessity of (ii). 
If conversely (i) and (ii) are satisfied, then 


Eu(ra(r)[ Zu(s)b(s) Su(t)e(s)] 
= Z w(r)a(r) [u(s)b(s)u(t)o(e)] 
= 2 ulr)a(r) [u(a)u(e)o(s)e(2)) 
= u(r)a(r)u(st) (8, t)b(s)*c(t) 
— 2 u(r)u(at)a(r)#(s, t)b(s)'c(t) 
-= = u(rst) (r, af) (8, t)a(r)*b(s)*0(t) 
-> u(rst) (rs, t) (r, 8)'a(r)b(s)*c(t) 
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= F u(rs)u(e) (r, 8)'0(r)*0(2) (8) 

— F u(rs) (r,s) [u(t)a(r)0(s)e(0)] 
= X [ulr)u(s) ]u(e)a(r)0(s)0()] 
= 2 [u(r)u(eya(r)(s)]lue(e)] 
= 3 [u(ra(r)u(s)è(s)][u(2)0(0)] 
= [E u(ra(r) E u(e)d(s)] Zuel) 


and this completes the proof. 

This statement explains why we have to and are going to restrict our- 
selves to the consideration of factor-sets which satisfy the above conditions 
(i) and (ii). | 

As one verifies easily that the element w(1) (1,1) is the unit element 
in (Q, G, (g,h)), we may assume without loss of generality that 


(Gi) (g,1) = (1,h) 1 or u(1) = 1. . 
Finally one verifies that u(g) (g,g?)"1 is the inverse to u(g). 


12. In this section we discuss the general structural properties of crossed 
products 


= (9,6, (g,h)) 


where Q is a field, G a group of automorphisms of Q which satisfies (I) and 
(IT), and where (g,h) is a factor-set of G in Q which satisfies (i) to (iil). 
(12.1) P is simple. 


Proof. Suppose that W is a two sided ideal +40 in P. Then ae 
exists among the elements w == $, u(g)q(w,g) 540 in W at least one such 
ginG 


that the number of g with q(w, g) 0 is as small as possible. Let v be such 
an element, and suppose that u is an automorphism with q(v,u) 40. If g 
is another automorphism in G, and if u + g, then it is a consequence of (I*) 
(in section 11) that uw’ 54g’ and there exists therefore an element f in F so 
that fe ><f*. Clearly fu — vf" = 2 uh) [ (f* — f#)q(v, h)] is an element 
in W; and it is 0, since the number of its coefficients =< 0 is smaller than for v. 
Hence in particular (f¢—f*)q(v, g) ==0 and this implies g(v,g) —0 for 
every g =u. This implies that u(u) itself is an element in W; and hence 
all the u(g) are in W, i.e. P = W. 

(12.2) (@,G, (g,h)) is the direct product of (F,G’,(g,h)) and K. 


This is an obvious consequence of condition (I) in section 11. 
An interesting consequence of this statement and of a well-known property 
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` of crossed- -products of commutative fields by finite groups of automorphisms 
may be stated separately. 


(12. 2*) If G is a finite group of. automorphisms of the field Q so that thé 
identity is the only central-automorphism in G, then (Q,G, (g, h) =1) ia 
full matriz-algebra over the field (Q, G). 


(12.3) An element w in P = (Q,G, (g,h)) satisfies wF == Fw if, and only 
tf, it has the form u(g)q for some g in G and q in Q. 


Proof. That elements of the form w—u(g)q satisfy wF = Fw, is 
obvious. If on the other hand w£ 0 satisfies wF == Fw, then there exists to 
every elementf in Fan elementf* in F so thatfw = wf*. If w == 2 u(g) q(w,8), 


then this implies that feq (w, g) = f*q(w, g) for every g in tA If u and v 
are two different automorphisms in G so that both g(w,u) and q(w, v) are 
different from 0, then this would imply that f° == f° for every f in P; and 
this is impossible by (I*) of section 11. Hence w == u(g)q. > 


(12.4) Q ts the centralizer of F in P. 


Tf w is an element, satisfying wf = fw for every f in F, then it follows 
from (12.8) that w = u(g)q and it follows from (I*) that g—1. 


(12.5) Q is uniquely determined as the greatest subfield of P which is con- 
tained in thé normalizer of F in P. 
Proof. Suppose that U is some subfield of the normalizer of F in P. 


_ Then it follows from (12.3) that every element in U has the form -u(g)q. 


If u(g)q and u(h)p are two elements in U.which are both different from 0, 
then their sum is in U and therefore of the one-term-form, i.e. g = h. For 
the same reason g = g?, ie. g—1 so that U.S Q. | a 
(12.6) Z is the central of P. | 


For elements of the central belong to Q by (12.4), hence to F. They 
belong to K and therefore to Z, since they permute with the elements “(g): 


(12.7) K is the centraliser of (P, C, (g, h)). 
This follows from (18. 4), since ‘the u(g) are in (F, G’, (g, h) ). 


13. It is a consequence of (12.3) that the normalizer of both the fields 
F and Q in P= (Q,6, (g, h)) is—apart from 0—the group, generated in 
adjoining the elements u(g) to Q. Denote the set of all the elements w in 
P which satisfy: wF — Fw—or wQ = Qu—by N; so that the elements 0 
in N form the group, we have described just now. 

Every automorphism of P maps. the central Z == (F,G’) upon itself. 
But there may exist automorphisms: of P which do not map F upon: itself. 
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If however an automorphism of P maps F upon itself, then it maps Q and N 
upon themselves. If conversely an automorphism of P maps N upon itself, 
then it follows from (12.5) that this automorphism maps Q upon itself; and 
if an automorphism maps Q upon itself, then F is mapped upon itself too, 
since F is the central of Q. In this section we shall investigate those auto- 
morphisms of P which map F, Q and N upon themselves. : 

If the automorphism r of P maps F, Q and N upon themselves, then 
r induces an automorphism r* in Q; and r*’—in the usual notation—is the 
automorphism which r and r* induce in F. Since r induces an automorphism 
in the group N* which maps the cross-cut Q* of Q and N* upon itself, it 
follows that r induces an automorphism in the quotient group N*/Q*, and 
since G and NW*/Q* are essentially the same, it follows that r induces an 
automorphism r” in G, and that | 


(a) u(g)"—=u(g")r(g) with r(g) in Qt. 
Applying now r upon condition (A) of section 11, we find that 


u(gr”)grer(g) = g*u(g")r(g) = gue)" = (qu(g))" 
= (u(g) gt)’ == u(g”)r(g) qe" 
or , 
(b) g"r(g) =r(g)gr" for q in Q and g in G 
or what amounts to the same | 


(D) q= r(g) g erg). 
Applying the automoiphism r upon condition (F) of section 11, and in 
‘using conditions (i), (ii) of (11.1), it follows that 


(g, h)” = [u(gh)*u(g)u(h) — r (gh)*u(g"h”) u(gr")r(g)u(h’)r(h) 
= r(ghħ)>u(g" h”) u(g" )u(h)r(g)¥"r(h) 
== r(ghħh)>(g”, h” )r(g)¥"r(h) or 
(c) (g, h)” (gr", h”)> r(gh) *7(g)*"r(h). for g,h in G. 
_ Thus we have seen that every automorphism r of P which maps F, Q 
and Ÿ upon themselves induces an automorphism r* of Q, an automorphism 
r” of G and—by (a)—a Q-valued function r(g) of the elements in G; and 
it is obvious that r is uniquely determined by r*, r” and r(g). 

THEOREM 13.1, Suppose that r* is an automorphism of Q, r” an auto- 
morphism of G, and that r(g) is a Q-valued function of the elements g in G. 
Then there exists an automorphism r of P which induces r* in Q, r” in G and 
satisfies (a) if, and only if, r*, r” and r(g) obey the rules (b), (e). 

Proof. The necessity of these conditions has already been verified.— 
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Thus assume that (b) and (c) are satisfied by r*, r” and r(g). If x is any 
element in P, then there exist uniquely determined elements g(x, g)—all of 
which with a finite number of exceptions are 0—so that 


r= © u(g)q(z, g). 
gmg 

A transformation r of P may be defined by 
(a) or Z'u(g”)r(g)g(z, 8)" 

ring 
and this transformation satisfies clearly (a), induces r* in Q, since u(1) = 1; 
and it will be clear that r induces r” in G, as soon as we have proved that r 
ig an automorphism of P. This transformation is a one-one-correspondence, 
mapping P upon the whole set P, since the equation y" == x possesses one and 
only one solution y in P, namely the element y with the coefficients q(y, g) 


= (g) "g(x, g")r*". That r preserves addition is clear; that it preserves 
multiplication is verified as follows: i 


(ay) — L2u(e)g(s g)u(h)g(y, h)] [2 u(g)u(h)a(2, 8) *a (y, h)]" 
= [ S u(gh) (gh)q(x;8)*g(y,hk)]" 
= Suek )r (eh) (g, h)g (z, g)™*q (y, h)” 
— Sue" “)u (he) (87, he") (gh) (g, h)g (z, 8)" a(y, h)™ 
=S ulg Ju(h”)r(g)*"r(h)g (x, g)" qty, h)" 
= $ u(g”)r(g)u(h”)r(h)q(z,g)™*g) (y, h)" 
=al” \r(gju(h”)q (z g) "™”r(k)g(y, h)” 
= 2 ular )r(g)q (a, g) u(h”)r(h)q(y, h)” = zy 
and this completes the proof. 
Restricting (b) to elements in F only we find 
(b”) = rg’ = gr" 
and in applying (c) on g = h = 1 we find that 
(e) r(1)=1 
and in applying (c) on h = g} we derive from (c’) that 
CO (8,87) (g, g) = r(e) r (8). 


14. In this section we add successively new hypotheses to those used in. 
the preceding sections. To the hypothesis that r*, r” and r(g) satisfy the 
conditions (b), (c) of section 13 we addefirst: 


(1) r* is an element in GC’. 
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We note first that this assumption is certainly satisfied, whenever r* is a 
Z-automorphism. of Q and G satisfies condition (III) of section 11. 

From (1) it follows that there exists an HORS w in G so that 
© w == r. Then it follows from (b”) that 


g = rt gr = wgw = (wl gi)’ 
and this implies by (I*) of section 11 that 
(1°) g” == wig, 
Let now be 8*— r*w 4, Then it follows from (b) that 
g'er(g) = r(g)g" or 
a”) ger (gy = r(g)® qu. 
Now we add another hypothesis. 
(2) Every F-automorphism of Q is an inner automorphism of Q. 


It is known that this hypothesis is a consequence of the finiteness of Q ` 
over F, and that this hypothesis is not always satisfied. 


Since s* == 1, from (2) follows the existence of an element b in Q 
so that 


q” == b1qb for every q in Q. 
Ang this on (1”) we find l 
gt = [br (g) =b] qor (g) 0] 
and it is a consequence of the fact that F is the central of Q that 


br(g) = b= is an element in P. Hence there exists an element f(g) in F 
so that 


(2) (8) — eee f(g)t e 
and it is a consequence of (c) that this F-valued function f i satisfies 
(27) (g, h)” (wgw, w ho) — f(gh) f(g)" "ef (h). 


If the F-valued function f(g) satisfies condition (2”), then the identity- 
automorphism of Q together with the inner automorphism which is induced 
in G by w together with this function. f(g) satisfy the conditions of the 
Theorem 13.1 so that they are induced by an automorphism of P which is a 
Q-automorphism of P and therefore a central-automorphism of P. 

If we now add the final hypothesis that ` : 


(3) F-automorphisms of (F, C, (g, h)) are inner NEEL TN then it 
follows from the existence of an automorphism of P which leaves all the 
elements in Q invariant, induces in G the inner automorphism effected by w, 
and induces f(g) according to (a) of section 13, that there exists an element 
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in (F, G, (g, h)) which induces this automorphism. This element has by 
necessity the form u(w)f for f in F, and now it follows that $ 
_ u(wigw) f(g) = f u(w) u(g)u(w)f 
= fu (w) u (w>) u (w )u(g)u(w)f 
= f+ (10%, w) u (wgw) (107, gw) (g, w)f 
= u (wgw) fiee (w, ww) e (w, gw) (g, w) 
or 


(3) f(g) Pr (g/10) for (g/w) = (w, w) ge) (w, gw) (g. 10) 
l and the function r(g) has by (2°) and (3°) the form 
r(g) = (bef) ee {g/w). 


The most important special case of all these considerations may be stated 
separately. i 


` If the field Q is finite over its central F, if G is a finite group of auto- 

morphisms of Q so that the identity-is the only F-automorphism in G, if (g, h) 
is a factor-set of G in F which satisfies conditions (ii) and (ii) of section 11, 
if r* is an (F, G’)-automorphism of Q, r” an automorphism of G and 1(g) 
a Q-valued function of the elements in G so that 

g'er(g) —=r(g)g for q in Q, g in G, and 

(g, h)" (87, k”) = (gh) (8) "r(R) for g,h in G, 
ihen there exists an element w in G and an element v in Q so that 
w = rY, g” = wgw and 
ràg) = ve (g/w) 


where . 
(g/w) = (w>, w) (107, gw) (g, w). 
If we choose in particular the factor-set (g, h) = 1, (house see: 
If w is an automorphism in G and r(g) a valued function so that 
r(gh) =r (g)= =r (h), 
then there exists an element v in Q so that | 
l r(g) = vel, | 
Finally it ought to be mentioned that the element v induces in Q the same 


automorphism as rw. 
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THE NUMBER OF REPRESENTATIONS FUNCTION FOR BINARY 
QUADRATIC FORMS.* 


By NEWMAN A. HALL! : 


The problem of finding the number of representations of an arbitrary 
integer by a given binary quadratic form has yet to be solved in complete 
generality. In the two centuries that have followed the first general investi- 
gation by J. L. Lagrange * of any part of the problem, the investigations have 
proceeded in two directions. A great number of specific forms have been con- 
sidered individually for which more or less general solutions have been given. 
Again, certain general investigations have reduced the problem to more simple 
and direct questions. The early investigations of Dirichlet * and more recently 

those of Pall + are of this nature. 
| In the discussion to follow, we offer as a contribution to the general 
problem, the general explicit expressions for the number of representations 
function for all forms whose discriminant is such that there is a single class 
in each genus together with a specific example showing the numerical com- 
putation of the number of representations. 

We are concerned with binary quadratic forms designated by [a, b, c], 
of discriminant — A = b? — 4ac, and shall examine the form of the number 
of representation: function 


N[m = ac + bay | cy" 
this being the number of solutions in integers, x and y, of 


m = az? + bay + cy. 


As is customary only forms which are positive definite and whose coefficients 
have no common factors, i.e. are primitive, will be considered. | 
We shall base the investigation on the following theorem of Dirichlet: * 


* Received September 20, 1939. 

1 Presented to the American Mathematical Society September 6, “1938, cf. Bulletin 
of the American Mathematical Sootety, vol. 44 (1938), p. 488. : 

2J, L. Lagrange, “ Recherches d’Arithmetique,” Oeuvres, t. 3, pp. 693-785. 

2G. L. Dirichlet, Zahlentheorte, ed. 4 (1894), p. 229. | 

+G. Pall, Mathematisohe, Zeitschrift, si 36 (1032), p. 321-343. 

5 Dirichlet, loo. cit. 
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THEOBEM 1. Let m be positive and prime to A. The number of repre- 
sentations of m by all the reduced forms of discriminant — A 148 © = (— A/p) 


where w == 2, tf A> 4; w = 4, if A = 4; w= 6, if a= 3, and CR 
Kronecker’s symbol. 


There are quantities associated with a particular form, invariant in that 
they are equal for all integers represented by said form which separate the 
forms of given discriminant into genera which may or may not coincide with 
the several classes. These invariants, the so-called characters, are defined by 


THEOREM 2.° If pu peo, >`, pr are the distinct odd prime factors of A, 
then (n/pi) has the same value for all integers n prime to A, represented by 
a form [a,b,c] of discriminant — A. When A is even, À = — 4D, the same 
is true of 

ô == (— 1), if D==0 or 8 (mod 4) | 
e= (— 1), if D==0 or 2 (mod 8) 
be, if D==0 or 6 (mod 8). 


The set, Ci,C2,- * +,Cx, will stand for the characters belonging to a 
“certain discriminant, excluding ôe if both § and e are characters. The number 
` of these, A, will equal k, k + 1, or k + 2 according to the nature of the dis- 
criminant as indicated above. The notation, C;(n), is to represent the value 
of the character C; for n of the form representing n. 

All forms of ‘a given discriminant whose characters have the game value 
are said to form a genus. Since equivalent forms represent the same numbers, 
all forms in the same class are in the same genus. i 

When there is a single class in each genus we may proceed using these 
characters to give the explicit form for tbe number of representations func- 
tion for integers m prime to 2A. If [a, b, c] represents some integer s, Theorem 
2 states C;(m) == C,(s) as a necessary condition that m be represented at all 
by [a,b,c]. Since we assume a single class in each genus, each reduced form 
` has different values for the characters. Hence by Theorem 1 we obtain 


THEoREx 3. Let [a,b,c] be a form of discriminant — A < — 4 such 
that there is a single class of forms in each genus. If m is an integer prime 
to 2A 


Nm A bay + c°] 
= 57] ri IT CD + Ci(a)Ci(m)] = (—A/p). 


* L. E. Dickson, introduction to the Theory of Numbers, pp. 82, 87. 
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In order to extend this result to the number of representations of numbers 
not prime to the discriminant, there are required three auxiliäry theorems. 


Lemma 1. Let [a,b,c] be a form of discriminant — A. Let p be a prime 
such that eithér p° divides A and p > 2 or p—® and A=0 or 12 (mod 16). 
Then . 2. 5 

N[pm = az* + bzy + cy’, (m, p) = 1] =0 
and . 
N[p?m = 02° + bay + cy*] 
| —N [m =ar + biy + c’y?] 


where [a’,b’,c’] ts a form of discriminant — A/p? whose characters are re- 
spectively equal to the corresponding ones for [a, b, c]. 


Lema 2. Let [a,b,c] be a form of discriminant — A, and let the prime, 
p, divide A but not satisfy the conditions of Lemma 1. Then 


N[pm = az? + bay + cy] 
= N[m dr + bay + c'y°] 
where [a,b,c] is a form of discriminant — A whose charaélers are equal 
to the product of the corresponding characters for [a,b,c] and those for the 
form of discriminant — A representing p. 


Lemmas 1 and 2 are taken directly from theorems stated by G. Pall‘ 
with our added condition that there be a single class to a genus. 

IT [a, b, c] has a discriminant — A = — 3 (mod 8), then since 
A =b? — dac, a,b, and c must be odd; so that if ax? + bey + cy?==0 (mod 2), 
then 2° + ey -+ y°= 0 (mod 2). If z were odd, y could be neither odd nor 
even, thus 2 and y must both be even, s = 2, y = 2, and | 


av? + bay + oy? = 0 (mod 4). 
This proves | 


Lexa 8. Del [a,b,c] be a form of discriminant — A == — 3 (mod 8). 
Then | | 
N [2m = az? + bey + cy°, m odd] 
om N[m ar + bry + cy?], r even 
= 0, r odd. 


The number of representations function. In the theorems below giving 
the explicit form of the number of representations function for all cases where 
there is a single class of forms to g genus C,(s) is to stand for the value of 


TG. Pall, Mathematische Zeitschrift, loc. oit., pp. 331-332, Theorems 4 and 5. 
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the characters for the positive, primitive form [a,b,c] of discriminant — A, 
and we write F(m) = $, (—A/p), where the summation is taken over all 
a/m 


divisors u of m. 


THEOREM 4. If A= pi; pz; ` + p= 3 (mod 8), A> 3, where the pi 
are -distinct odd primes, . i | oe 


N[2*p.%- + pm = az? + bry + cy”, (m, 2A) = 1] ; 


ser = | T(t + i [Os (ps) 140: (s) Cs (m) JF (m) 

The powers of the odd primes, p4, in the number represented are reduced 
according to Lemma 2. Whence by Theorem 3 the statement follows. The 
even power 2A is required by Lemma 3. The factor multiplying F{m) occurs 
in this manner merely to associate the plus or minus one value of the char- 
- acters with the representation or non-representation according to Lemma 2. 

It has been shown previously ê that if A = 7 (mod 8), the only diserimi- . 
nants for which there is a single class to a genus are A=? and À == 15. 
These are included in 


THEOREM 5. If A pipe, Pi = 8 ór 7, po—=5 or 1, respectwely, 
N[2\p.p.%m — ar? + bry + cy’, (m, 24) —1] 


EED (1+ 104( 2.) 6104 (ps) 10s (s)Cs(m) JF (m). 


The powers of the odd primes, p4, in the number represented are reduced 
according to Lemma 2, while the power of two is reduced by use of results 
stated by Dickson ® on forms of discriminant =A and — 15. The theorem 


follows from Theorem 3. 
The only odd discriminants containing the square of a pame as a factor 
which have a single class to a genus are: 


A = 27, 75, 99, 147, 315.2 ` 





These are included in 


THEOREM 6. If A= p;*pops, where oo 5, 3,7, 3; Paps = 3, 3, 11, 3, 35; 
respectively, > : 


8 N. A. Hall, Mathematische Zeitschrift, si 44 (1938), p. 88. 
° Dickson, too. cit., pp. 81, 88. 
10 Hall, loo. ott. 
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N [2p pp, == az? + bry + cy", (m, 2A) = 1] 
a. s : | 
= ser LL 1 + TL [CpI (s)Cu(m)JF Cm), a = 0 


= 0, a = 1 | 
À 8 a k 
— ge LL E + II LG (p) ]004(s) Cc (pit-*m) JP (pttm), a = 2 


where w == 3 when pps = 3 and w= 1 otherwise, and C, is the character ` 
associated with py. 

The power of p, in the number represented is reduced according to 
Lemma 1, while those of pı and ps are reduced according to Lemma 2. The 
statement then follows from Theorem 3. The even power RA is required by 
Lemma 3. 


THEOREM 7. If A= pfp: +> p,s=4 or 8 aia 16), A44, where 
pı == 2, 0 = 2 or 8 and the remaining p; are distinct odd primes, 
N[ pit pots + ++ pm == ‘of + bey + cy?, (m, A) = 1] | 
= sill + i [Cs(py) ]4Cs(s)Ci(m) }F(m). 


The powers of the primes, p; in the number represented are reduced 
according to Lemma 2. Whence by Theorem 3 the statement follows. 

When A= 0 (mod 16), there is more than a single class to a genus unless 
A == 16n or 64n, n= 1, 3, 7, 15, or unless A == 32 (mod 64). The latter 
case.is included in i 


THEOREM 8. If A= pp." + -pr where pı —® and the remaining pe 
are distinct odd primes, 
NU pips - ae = at? + bay + cy’, (m, A) = 1] 
= riko fratrie, a= 


ae KE 
= js FE PT OOG), a 22 


where Cy is the er for forms of discriminant — A not a character for 
forms of discriminant — A/4. f 

The power of p, == 2 in the number represented is reduced according to 
Lemma 1 and those of the odd primes, p according to Lemma 2. The state- 


ment is then a consequence of Theorem 3. 
When A = 12 (mod 16) the only discriminants for which there is a single 


13 Hall, loc. cit. 
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class of forms to a genus are A = 12, 28, 60.17 These together with A = 16n 
and 64n, n == 3, 7, 15, and A= 8 are included in 


THEOoREM 9. If A= fp, pa, where 8 = 0, 2, 4, 6, and pape = 3, T, 15, 
N[2p,%p.%m = az’ + bey + cy?, (m, 2A) = 1] 


= pr I (1 + HL [00(ps) 9C (s) C (m) }F(m), AZ 6 
= +i [Ci (p1) IC (s)Ci(m)}F(m), À = 0 —2 
-5 Eha + Leue IOL Cut»), A= 0—4 


=z TE (1 TY [Ou (m)19G4(s)C(m))F(m), à == 0—6 
= 0, <b, voli 


where 
w =e 3, fipe=3, A even 
w= 0, Pıp: = 3,. À odd 
w= à — 0 -+ 1, pp: =? or 15 
and C,,: > -,Cy are in this case the characters for A == pipes Cra and Cmo, 


ihe additional characters for A = 16p:p. and 64p:p. respectively. 


The power of 2 in the number represented is reduced according to Lemma 
1, and those of p, and p, according to Lemma 2. These reductions together 
with the appropriate uses of Theorems 1, 3, and à provide the statement given. 
The even. discriminants A == 4, 16, 64 are included in 


THEOREM 10. If A==4, 
N[2¢m = ar? + bo (1,2) =1] 


= 4F (m). 
If A= 16, 
N[2¢m = az? + bay + cy”, (m, 2) = 1] 
= vF (m), 
v= 2, a= 0l; w= l, g=]; v= 4, a ZR. 
If A= 64, 


N [2am = az? + bay + cy°, (m, 2) = 1] 
= F[1 + 8 (8) (m) ] [1 + e(s)e(m)]F (m), a 0 
— oF (m) * 
o= 0, ome l, 3; -w = 2, a = 2; A T | 
The power of 2 in the number represented is reduced according to Lemma 1. 
Theorem 3 completes the statement. | 


12 Hall, loc. cit. 
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The only even discriminants containing as a factor the square of an odd 
prime which have a single class to a genus are: 
A — 36, 72, 100, 180, 288.1 


The number of representations function for these even discriminants is 
given by 


THEOREM 11. If A = 2°3?, 0 = 3 or 5, 

N [238m = ax? + bay + cy®, (m, A) = 1] 

“ell [1+ Cx(2)*0r(s)Ce(m)]F(m), B= 0 
= ges [E (1 + Cu(2)*Cu(s)Cc(3*m)]F(8*m), B= 

= 0, =i, or a =f — 4 

where Ca == (n/3). If A= 2p; p= 3 or 5, 

N [20pm — ax? + bay + og, (m, A) = 1] | 
=p ÉI [1+ CR) (8) O(m) JP (m), B—=0 


= 4F (pm), B22 
=0, B—1 


If A = 22- 32-5, | 
N [223857 m — A = bay + cy?, (m, A) = 1] 


-ml a TI [1 + 64(2)*C,(5)"Cu(s)Cx(m) JF (m), B= 0 


— pa LI (1 + Cu(2)900(5) 70 (9) Ci (87m) JE (35m), BE? 
—0, B—1 
where Cy = (n/3). 


The reductions, are again made according to Lemma 1 and 2, and the 
statements follow from Theorem 3. 

The numerical calculations for the number of representations will be 
aided by 


THEOREM 12. When there is a single clags of forms to a genus and A 18 
not divisible by the square of an odd prime, 2+, or 2°, 


F(m) = 3 (— S/n) == TT O(n). 


15 Hall, loc. cit. 
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When p?/A, p an odd prime, , 
h-1 
Ce) = TE Ga) 
where Cy == (n/p). f 


This theorem is directly evident from the definition of the characters 
given in Theorem 2 and from the law of quadratic reciprocity. 


Numerical computations. As indicated above Theorems 4 through 11 
give the explicit form of the number of representations function for all forms 
of discriminant with a single class to a genus. In specific cases the application 
of these results will require the knowledge of the characters for the form and 
the values these characters assume for forms representing various numbers 
and for different genera of the same discriminant. This information can be — 
readily calculated from the definitions given in Theorem 2. The author has 
prepared a table giving this data for all known discriminants having a single 
class to a genus.# This table lists all the reduced forms of given discriminant 
together with the several characters and the values they assume for the num- 
bers represented by each of the reduced forms. | 

As an illustration of the method consider the form, 2x° + 35°, of dis- 
criminant — 280 == — 2°- 5-7. The description of the characters as caleu- 
lated or read from the table referred to above can be presented compactly: oog 


280 = 23:5-7 e(n) (n/5)  (n/7) 
[1, 0, 70] -1 1 1 

` [2, 0, 35], 2 oe | 1 
[5, 0, 14], 5 =] 1 24 

[7, 0,10],7 ro “ei et 


The reduced forms of discriminant — 280 are: [1,’0, 70], [2, 0, 35], [5, 0, 14], 
[7, 0,10]. The prime factors of the discriminant, 2, 5, and 7, are represented 
by the last three of these respectively. There are three characters, e(n), (n/5), 
(n/?), which take on the values listed for numbers represented by the form 
opposite. The number of representations function is given for this case by 
Theorem 7. The function is accordingly: 

N[2%5% 74m == 22° 35y’, (m, 70) = 1] 


— 4[1 —(— 1) te (m) JL —(— 1) (0/5) J + (— 1) 2 (m/7)] Z (—280/x 
We have, furthermore, | 


UN, À. Hall, California Institute of Technology, Thesis (1938), pp. 104-116. 
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e(m) = 1, m=1or? (mod 8) 
== —1, m= 3 or 5 (mod 8) 


(m/5) =. 1, ‘ms=1or4 (mod 3) 
=—1, m=2or38 (modi) 


(m/7) = 1, m=1,2,or4 (mod 7) 
==—1, m=3,5,or6 (mod 7). 


Hence we may separate integers, m, (m, 70), into residue classes, modulo 280, 
with the triplet e(m), (m/5), (m/7), identical in value for all integers in 
the class: 


. (m) (m/5) (m/7) 

1) els + + 1, 9, 39, 71, 79, 81, 121, 151, 

©. 169, 191, 239, 249. | 

2) _ a + 11, 29, 51, 99, 109, 141, 149, 
_ 179, 211, 219, 261, 221. 

3) + — 2° E 23, 57, 113, 127, 177, 183, 198, 
| 207, 283, 247, 263, 137. 

J - 8%, 58, 67, 93, 107, 123, 163, 
197, 258, 267, 277, 43. 


2 = 


5) + + — 31, 41, 111, 89, 129, 159, 199, 
E | 201, 209, 241, 271; 279. 
. 6) — > + — 19, 59, 61, 69, 101, 131, 139, 
171, 181, 229, 251, 269. 
7) + — = 17, 33, 47, 78, 87, 97, 103, 153,. 
: 223, 257, 143, 167. 
8) == Le Zi à 3, 18, 27, 83, 117, 157, 173, 


_ 187, 213, 227, 237, 243. 


According to the parity of &ı, a, a; we have the four cases: 


+a ata, ata — CA az A 
a) _ even even even _! even even even 
odd odd odd 
b) odd odd even odd ` even even 
even  odd odd 
¢) odd even — odd even odd even 


odd even odd 
d) even odd odd e odd odd even 
| i even ‘ even odd 
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Applying these results to the number of representations function as given 
above, it is possible to state further: 


. N[2258 72m = 22? + 35y°, (m, 70) = 1] 
— 23 (—260/n) 


when &,, as, as and m are paired according to the divisions above: a), 4); 
b), 1);'¢), 7); d), 6); otherwise the number of representations is zero. 
According to Theorem- 12, 


F (m) = 2° (a) (4/8) (a/7) = = = 280/x). 


Hence F (m) is equal to the excess of the divisors of m in classes 1), 4), 6), 7). 
over those in classes 2), 3), 5), 8). 
The formula may be illustrated further by verifying the number of repre- 
peninHons of 23902. If 
23902 = 22? + B5y?, 


the empirically obtained solutions are: æ= +11, y= +26; c= +59, 
y = + 22; r= + 101, y = + 10; z= + 109, y = + 2; with all choices of 
sign permissible. Hence the number of representations is 16. To check this 
with our formula, we observe that 23902 == 2 : 17 : 19 - 37, hence | 


N[23902 == 22? + 354°] 
= N[2: 17: 19+ BY = 2r + 35y°] 
= 2 È e(u)(u/5) (x/7). 
u/11961 


Since, referring to the previous notation, 17-19-37 — 11951==191 (mod 280), 
we have case b), 1). Furthermore, for each of the prime factors of 11951, 
e(u) (4/5) (2/7) is seen by reference to 4), 6), and 7) to be + 1. Hence the 
same is true for all factors. In all, 11951 has eight factors: 1; 17; 19; 37; 
17-19; 17-37; 19-37; 17-19-37. Thus, finally 


N [23902 = 22? + 35y°] = 2-8 = 16, 
to agree with the empirical result. 


QUEENS COLLEGE, 
FLUSHING, New YORK. 
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By Howanp CAMPAIGNE. 


1. Introduction. In 1934 F. Marty? and H. S. Wall * introduced in- 
dependently the notion of hypergroup. They both used this term for a system 
which may in particular be a group, but in which the product of two elements 
is in general a set of elements of the system. Both writers discussed partition 
hypergroups in which the elements are sets of elements of a group. The 
question arises as to whether or not every hypergroup can be represented in 
this way as a partition hypergroup obtained from a group. Marty offered 
the conjecture that, the answer is in the affirmative. Wall gave an example 
of a hypergroup which cannot be so represented by means of the special con- 
jugation which he considered. It is shown in the present paper (section 6) 
that it is not possible to represent this hypergroup by.means of any conjugation 
whatsoever among the elements of a group. 

_ The second main result obtained is a characterization of simple groups 
in terms of a partition hypergroup (section 9). It is shown that a group G 
is simple if and only if a certain partition hypergroup contains no proper 
sub-hypergroups except the identity group. This partition hypergroup is 
obtained by means of a conjugation among the elements of G depending on 
its group of inner automorphisms, and the proof depends on the study of the 
lattices of this hypergroup.. | | 

Sections 3, 4, 5 treat of partition hypergroups, the mapping of one hyper- 
group upon another, semi-regular, regular, and commutative hypergroups, 
respectively. In section 7 there are considered examples of conjugations. 

An analogue of the direct product of groups is the subject of section 8. 
Hypergroups which are products of two hypergroups are completely char- 
acterized. Many of the ideas of this section are generalizations of ideas in 
R. Remak’s papers (there cited). 


* Received August 4, 1988; Revised February 21, 1940. 

1 Presented to the Society April 8, 1938. 

3 F. Marty, “Sur une généralisation de la notion de groupe,” Sartryok ur För- 
handlingar vid Attonde Skandinaviska Matematikerkongressen i Stockholm (1934), 
pp. 46-49. | 

2H. 8. Wall, “Hypergroups,” Bulletin of the American. Mathematical Society, 
vol. 41 (1935), p. 36. [Presented at the Énnual meeting of the American Mathematical 
Society, Pittsburgh, December 27-31, 1934.]' 
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2. Definition of a hypergroup. We consider a system H of elements 
4, 6,c,- - - in which a product ub is defined for every pair of elements a, b of 
H. The product CD of two subsets C, D of H is defined as the set of all 
distinct elements of the products cd as c ranges over C and d over D. The 
system H is a hypergroup if it satisfies the following postulates.* 


I. If a and b are elements of H, then the product ab is a non-vacuous ` 

subeet of distinct elements of H. 

II. Ifa, b, c are elements of H then a(bc) = {ab)c. | 

III. There is in H at least one element e, called an identity, such that 
for every element b in H the products eb and be contain b. 

IV. There is at least one identity e in H such that if b is an arbitrary 
element of H there is in H at least one element b>, called an inverse of b 
relative to e, such that the sets b-b and bb contain e. 


It is easily shown that if5 aH, bH then there exist elements x, y in H 
such that baxr, beya. From the definition of a group it follows that if the 
products ab are all single element sets, then H is a group. | 

A subhypergroup K of H is a subset of H in which the postulates I to IV 
‘are satisfied with the law of multiplication of H. If KH and contains at 
least one identity of H, K is a proper subhypergroup of H. E 


3. Definition of conjugation. An equivalence relation, ~, in a hyper- 
group H is called a conjugation if when asli, beH, ceab, c’~c, then there 
exist elements a’, b’ in H such that a ~a, b’ ~b, dab. If a’ — a we shall 
say that a’ is conjugate to a. This relation is symmetric, reflexive, ‘and 
transitive. , 

If y is a conjugation in H, the distinct residue classes {a}, of elements 
conjugate to a form a hypergroup {H},, with respect to the law of multiplica- 
tion which requires that 


{c}ye{a}{B}y 


if and only if there exist elements a’, b’ in H conjugate to a and b, respectively, 
such that cab”. It will be seen that postulates I to IV are satisfied by this 
system. If e is an identity of H then {e}, is obviously an identity of {H},;. 


t This definition is somewhat different both from that of Wall and from that of 
: Marty. Walls definition (“Hypergroups,” American Journal of Mathematics, vol. 59 
(1937), pp. 77-98) differs only in one respect, namely, that he requires the product 
ab to have exactly n elements (not necessarily distinct) where n is a fixed integer 
greater than 0. The definition we have adopted agrees with that of the regular multi- 
graup of Dresher and Ore, “ Theory of multigroups,” American Journal of Mathematics, 
vol. 60 (1938), pp. 703-733. = 
*The symbol a A is read “a is an element of 4”. 
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if b-* is an inverse of b relative to e then {b-*}, is an inverse of {b}, relative 
to {e},. We shall call {H}, ue partition hypergroup of H relative to the 
conjugation y. 

By the remark near the end of section 2, {4}, is a group if and only if 
when a, b, a’, b’ are elements of H such that a~ a’, b~ 0’, cab, d.a’b’, then 
c— d. 

We shall say that a subset J of a een H is appropriate relative 
to a conjugation y in H if J contains with a all the conjugates of a relative 
to y. IfJ is an appropriate subhypergroup of H then y induces a conjugation 
yı in J, and it is easily seen that the partition hypergroup {J},, is a sub- 
hypergroup of {H},. Conversely, if K is a subhypergroup of {H}, then the 
set J of all the elements of Æ contained in the auiey classes of K is appro- 
priate relative to y. 


4. Mapping of one hypergroup upon another. We shall consider a 
mapping of a hypergroup À upon a AR A such that the following 
conditions are satisfied. 


(1) Each element a of A is mapped oe a uniquely determined element 
a of Y, in symbols a > a. 


(2) If af then there is at least one element a of A such that a—a. 
(8) If cab and cc, a — a, b — b then c,ab. 


(4) If af, BA and cab then there exist elements a, b, c in A such that 
a—a, b — b, c—c and cab. 


It follows from (2), (3) that an identity of A is mapped upon an identity 
of N. If a is an inverse of a relative to an identity 6, and a*—a,, a— a, 
e— e, then it follows from (2), (3) that aı == a™ is an inverse of a relative 
to e. an f 

If there exists a mapping of A upon YW satisfying conditions (1) to (4) 
we shall say that À is semi-isomorphic with À, in symbols A = Wf. In particu- 
lar À is isomorphic with W, A Æ Y, if the mapping satisfies (1), (3), and 
(4), and the following condition stronger than (2): — 


(2’) If aA then there is exactly one element a of A such that a — a. 


Isomorphism is reflexive, symmetric, and transitive. Semi-isomorphism 
is reflxive but not symmetric. It is easily seen to be transitive. In fact, if 
Pe), O02, p— 4, q> r, then, it will be seen that the mapping pr 
maps P upon À in such a way that the conditions (1) to (4) are satisfied, 
and therefore P = Q. 


10 


£ 
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THEOREM 4.1. If A and X are finite hypergroups, and A = A, A == A, 
then AxA. 


Proof. Let A,A be of orders u,v respectively. Since A = & it follows © 
that p = v, and since Y = A, psy. Hence p == y, and therefore 4 ~ A since 
the mapping must be one to one. : 

` The following theorem may readily be verified. 


THEOREM 4.2. Let {H}, be a partition hypergroup of H relative to the 
conjugation y. If H, then the mapping a — {a} is a semi-isomorphism, 
so that H = {H},. 


THEOREM 4.3. Let {H},, {H},, be partition hypergroups of H relative 
to conjugations yı, ya such that if a — b relative to`y, then a — b relative to 
ys Then there exists a conjugation ys in {H},, such that {{H}y}n = {Hho 

Proof. Let {a}, ~ {b},, when a~b relative to y. This defines a con- 
jugation ys in (H}y- The mapping {{a}n}n—> (a)n of {{H}m}r upon 
{H},, is seen to be an isomorphism. 

An automorphism of H is an isomorphic mapping of H upon itself, The 
‘set of automorphisms of H forms a group. There is one case in which a 
subgroup P of this group induces a group of automorphisms in a partition 
hypergroup {H}., namely, when the conjugation is préserved under the auto- 
morphisms of P. If an automorphism p of P maps a upon a’ we shall write 
a*®>a’. Suppose then that whenever peP, a*>a’, bib, a~b it follows 
that a’ ~b’. Definea mapping p’ of {H}. upon itself by letting {a}, 2 {a’}y 
when aa. This is clearly a one to one mapping of {H}, upon itself, and 
is easily seen to be an automorphism of {H}y. The set of these induced auto- 
marpinane forms a group Q. It may be shown that P = Q if when Pe, åH, 
a2b then a + b. 


5. Semi-regular, regular, and commutative hypergroups. A hyper- 
group À will be called semt-regular if it contains at least one element s, called 
a scalar, such that if a,A then as and sa are single element sets. The set of 
all scalars is called the nucleus of A. Wall® has shown that for his hyper- 
group the nucleus forms a subgroup of the hypergroup, and its identity is 
the only identity of the hypergroup. The same holds for the hypergroup 
here considered, and the proof given by Wall holds without modification. 


Tagorex 6.1. Let A, M be two hypergroups such that there is a semi- 
isomorphic mapping a->a of A upon A. Let E, B denote the subsets of 
elements of A mapped upon an identity e and.an arbitrary element b, respec- 
tively, of À. Then À is semi-regular if and only if 


€ Wall, loc. cit., in footnote 4, Theorem 4, p. 79: 
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(1) E is a subhypergroup of A, and 
(2) EB CB, BE CB for every b. 


Proof. Supposing that Y is semi-regular, we shall prove that (1) and 
(2) hold. Let a, b be elements of Æ, and cab. Then a— e, b — e and there- 
fore c — e = 6°, that is, cis in F. If aE and a> an inverse of a relative to an 
identity e (necessarily in Æ), then if at—> a, we have: ew, eco, == Q, = €, 
so that a.. This completes the proof of (1). To prove EB C B, let b —> b 
for every element b of B. Then if a.# and cab we must have cab = eb — b 
so that ¢.B. Similarly, BE C B. 

Conversely let (1) and (2) hold. To prove that À is semi-regular we 
shall show that e is a scalar. If ceb then, since A = Y, there exist elements 
c, a, b in A such that cc, 4E, b.B, and cab. Thus ce&B C B or c — b =c 
and therefore b == eb. Similarly, b == be. Since this holds for every b in % 
it follows that e is a scalar, as was to be proved. 

If A =% and À is semi-regular, then 


COROLLARY 5.1. The set N of all elements of À which are mapped upon 
elements of the nucleus of U is d subhypergroup of A. 


A semi-regular hypergroup is called regular if each element has a unique 
inverse with respect to the identity e and if e.ab implies e,ba. 


THEOREM 5.2. If H is a hypergroup such that eab implies that eba 
for every identity e, then a partition hypergroup {H }y is regular if and only tf: 


(1) the identities of H are all contained in a single class {e}y, and the 
set E of elements in this residue class is a subhypergroup of H; 

(2) tf B is the set of elements in any class {b}y, then HBC B and 
BEC B; 


(3) the inverses relative to all identities of the elements in any class {a}y 
are all in one and the same class {a}. 


Proof. By Theorem 4.2 the mapping a-> {a}, is a semi-isomorphism 
of H upon {H}y. Hence by Theorem 5.1 the conditions (1) and (2) are 
necessary for the regularity of {H}y. Condition (3) is also necessary. For 
if {a}y{H}, then when {H}y is regular {at}, must be the unique inverse 
of {a}, and no other class can contain an inverse of an element in the 
class {a},. l 

Conditions (1) and (2) are sufficient for the semi-regularity of {H}y 
by Theorem 5.1. To prove that (3} implies the regularity of {H}, we must 
show that every element {a}, has a unique inverse. If {e}ye{a}y{b}y (so 
that by hypothesis {e}..{a},{b},), then there exist elements a’, b’ conjugate 
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to a and b such that 6,a’b’. Hence b’ is the inverse of a’, and therefore by (3), 
hy = (h = {br | 

A hypergroup is commutative if ab — ba for every pair a, b of its ele- 
ments. It is easy to see that if A is commutative and A=W, then Y is 
commutative. In particular, a partition hypergroup of a commutative hyper- 
group is commutative. 


6. A regular commutative hypergroup which cannot be eii 
as a partition of a group. Such a hypergroup is given by the following table: 


8 b a 
“ele b -a 
b |b a e, b 
a ja e, b a, b. 


Wall” showed that this hypergroup cannot be represented as a partition hyper- 
group of a group relative to the special conjugation which he considered. We 
shall prove that this hypergroup has a property which no partition hypergroup 
of a gròup has, namely: it is not inversive. 

. À hypergroup is inversive 8 if when cab there exists an à identity e, and 
inverses ct, at, b-*, of c, a, b, such that c*-b*a*. Every group is necessarily 
-inversive. We shall prove that every partition hypergroup of a group is in- 
versive. More generally, we: have 


` Taeorkx 6.1. If À = and A ts inversive, then A is inversive. 


Proof. If cab, a, BA, and c — c, a — a, b > b, cab, then by hypothesis 
there exist inverses a+, b-t, c relative to an identity e such, that cba. 
Let ct, b — b, ata, Then «ba. Now e— e, where e is an 
identity of Y, and also at at, b — b>, c>, that is, a, =a’, 
by == b7, and ct, Thus cibtat, so that Y is inversive. | 

In the example above, aa? but a* y (a)*, so that the hypergroup is not 
inversive. We therefore have: 


THEOREM 6.2. There exists a regular commutative hypergroup which is 
not isomorphic with a partition hypergroup of a group. 


7. Examples of conjugations. Other writers ? have discussed a con- 
jugation of which the following is an immediate generalization. 


T Loo. oùt., p. 96. i 

3 This is less restrictive than Dresher and Ore’s reversible in itself. See the 
reference cited in footnote 4, p. 717. 

° Wall, loo. oit., pp. 92-93. Marty, “ Sur les groupes et hypergroupes attachés 
a une fraction rationnelle,” Annales de VHoole Normale Superieure (3), vol. 53 (1936) j 
pp- 83- 123. A generalization is mentioned by Dresher and Ore, p. 720. 


PARTITION HYPERGROUPS. ` 605 


He 1. Let H be semi-regular, and S, T subgroups of its nucleus. 
Let a ~ b if a — sbt where seS, KT. This defines a conjugation in H. Denote 
the partition hypergroup by {H; 9, T}. In particular, if H is a group, S an 
invariant subgroup, and T the identity group, then {H; 8, T} is the quotient 
group H/S. The partition hypergroup {H; S, T} is semi-regular if 8 = T. 


Example 2. Let H be commutative and inversive and contain an identity 
e such that each element has exactly one iriverse with respect to e. Let b~a 
if b—a or ba". This relation is a conjugation in H, and defines a 
partition hypergroup {H}. In this case H = {H} if and only if every ele- 
ment of H is self-inverse with respect to e. If H is an Abelian group then. 
{H} is a regular commutative hypergroup in which the produe of any two 
elements is a set of at most two elements. 


Example 3. We may define a conjugation in an arbitrary hypergroup H 
in terms of any subgroup P of the group of automorphisms of H. Inasmuch 
as the partition hypergroups obtained in this way play an important role in 
a subsequent result, we shall develop here some of their properties. 

The conjugation is defined as follows. Let a~ b if a is mapped on b by 
some automorphism of P. This relation is clearly a conjugation, and so 
defines a partition hypergroup of H which we shall denote by {H}p. By 
Theorem 4.8 we have at once: 


THEOREM 7.1. If Q is a subgroup of a group P of automorphisms of H, 
then there exists a conjugation y in {H}q such that {{H}o}, = {H}r. 


THEOREM 7.2. If H is semi-regular (regular) then {H}p ts semi- 
regular (regular). 


Proof. The identity e of a semi-regular hypergroup is mapped on itself 
by every automorphism, and therefore the class {¢}p contains only e. “Eyi: 
_ dently {e}p is a scalar of {H}p, 80 that {H}p is semi-regular. 

. If H is regular then we must show in addition that every element 
of {H}p has exactly one inverse, and that {6}p{a}p{b}p implies that 
{e}pe{b}p{a}r. If {e}re{a}r{b}p then there exist elements a’, b’ conjugate to ` 
a, b such that ¢,a’b’, and hence the class {b}p contains the inverse of an element 
of {a}e. ‘But if a is mapped on a’.by an automorphism pı, then a is mapped 
on a’ by p. Since =b ~b it then follows that a*~6, that is, 
{b}p == {a*}p. The regularity: of {H}p follows. 


THEOREM 7.3. If G isa group and P a group of. tis automorphisms, 
then {@}p ts a regular hypergroup, and is a.group tf and only tf P is the 
identity group. 
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Proof. The first part is a corollary to theorem 7.2. To prove the last 
part, let us suppose {G}p is a group if {a}pe{G}p then {a}p{a*}p = {e}p. 
Since {e}p contains but one element, we must have a’a7 == e if a’ ~ a, so that 
a’ = a, and hence P contains only the identity automorphism. The converse 
is obviously true. 


8. Product hypergroups.° The product A X B of two hypergroups is 
the set of all ordered couples a X b where a.A, bB, and where multiplication 
between couples is defined by agreeing that a X be(@ı X 01) (a2 X be) if 
Gcyz, b.b,b,. It is easily scen that 4 X B is a hypergroup. A subhypergroup 
H of A X B is called a sub-product of A and B if each element of A (and 
likewise each element of B) is represented in at least one couple a X b in H. 
A sub-product of A and B will be denoted by A X B. 

We shall begin by listing, without proof, £ some of the more obvious 
properties of products and sub-products. 


(1) 4 X B is a group if and only if A and B are groups. 
(2) AXB=ZBXA. 


(3) If A, is a subhypergroup of A, then A, X B is a subhypergroup 
of AX B. 


(4) If K is a subhypergroup of A X B, then there exist subhypergroups 
À, and B, of A and B such that K is a sub-product of A, and B. 


(5) A sub-product A X B is semi-regular only if A and B are semi- 
regular. The nucleus of AX B is a sub-product of subgroups of the nucleii 
of A and B. AX B is semi-regular (regular) if and only if A and B are 
semi-regular (regular), and its nucleus is M X N, where M and N are the 
nucleii of A and B. | 


If B contains an “idempotent” element b such that b* ==}, then it is 
evident that A X B contains a subhypergroup isomorphic with A, namely 
A X b. The converse is not necessarily so, as shown by the following example. 
Let Te be the hypergroup of w elements to, t,,:--,ts1, where for every 
A, p, v We have tyetyut,. Let-Z'co be a set of a countable number of elements 
ty with, multiplication similarly defined. TZ, has no subhypergroups, and 
no idempotent elements. Yet Tæ X Tu= To under. the correspondence 


1° See Robert Remak’s papers, “ Über minimale invariante Untergruppen in der 
Theorie der Endlichen Gruppen,” vol. 162 (1930), pp. 1-16, and “ tber die Darstellung 
der endlichen Gruppen als Untergruppen directer Produckte,” vol. 163 (1930), pp. 1-44 
of the Journal für Mathematik. 
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ty X ty tow’. Note that the product Tu X Ty = Tp has no subhyper- 
groups. The following theorem gives conditions under which the converse 
is true. 


THEOREM 8.1. Let every descending chain of subhypergroups A D A’ 
D 47 - - -of A be finite, and let A contain an identity e such that e is a 
finite set. If AX B contains a subhypergroup isomorphic with A, then B 
must contain an idempotent element. | 


Proof. Let Ko be a subhypergroup of A X B such that Ko ~ A. Then 
by (4) A, B contain subhypergroups A’, B’ such that Ky = A’ X B’, a sub- 
product of A’ and B’. If A’ — A the argument proceeds as in the next para- 
graph. If A’ 4A, then K, D K, =A’, since Ko = A. Therefore by (4), 
A’, B’ contain subhypergroups A”, B” such that K, = A” X B”. If A” 4’, 
then K, D K,= A”. Therefore by (4) A”, B” contain subhypergroups 4’”, 
B” such that K: == 4” X B”. Continuing in this way we get a descending 
chain of subhypergroups A D A’ D A” D- - - which must terminate. There- 
fore there is a » such that A®™Ð == A), Without loss of generality we can 
assume that A’ = A. 

Let a — a X b’ be corresponding elements under the isomorphism 
AmAX B’=K,. If a, is an identity in A then ay’ X by’ is an identity 
in Ko, and a and by’ are identities in A and B’ respectively. Let a1, Qs, * -, 
a,,* * + be the identities of A. Let an, Bn, yn be the numbers of elements 
in the sets a, by’*, and a. Thus ay, Bn, yn are positive integers (or in- 
finite) and an8n = yy There is a smallest yn, let it be yy Since 8, == yr 
we have ay. But a, is also among the integers yẹ since a, is an 
identity in A. Thus æ) = ya, and a, == y), so that B,==1. Since by is an 
identity, by’ 2 — by, and B has an idempotent element. 


COROLLARY 8.1. Let A X B satisfy the descending chain condition, and 
have an identity e such that e is a finite set. Then AX B contains sub- 
hypergroups A, and Bo, isomorphic with A and B respectively, if and only if 
A X B contains an idempotent element, the intersection of Ay and Bo. 


We next consider the question: when can a hypergroup be expressed as 
a product? In order to get an answer to our question we must first consider 
conjugations in product hypergroups, which can always be expressed in terms 
of conjugations in the factor hypergroups, according to the following theorem. 


THEOREM 8.2. If A and B are hypergroups with conjugations among 
their elements, then there ts a conjugation among the elements of À X B such 
that {A} X {B} = {A X B}.. Conversely, if there ts a conjugation among 
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the elements of À X B then there exist conjugations among the elements of A, 
and of B such that {AX B} = {A} X {B}. 


Proof. Let aXb~« Xb if, and only if, a~a’ and b~b’. The | 
mapping {a X b} — {a} X {b} is seen to be an isomorphism. 


COROLLARY 8.2. There erists a Se aie in AXB such that 
{AXB} ZA. Thai is, AX BA. 


Proof. In A let a~ a if a = a’, and in B let b~ b’ for every b and v. 


The following theorem is a direct generalization from standard group 
theory. : 


THEOREM 8.3. Necessary and sufficient conditions that a semi-regular 
hypergroup H be the product of hypergroups A and B are: 


(1) A and B are semi-regular and contained in H; 
(2) tf aA and bB then ab — ba ts a single element; 
(3) AB = H, and a,b, = azb: only if a, = a: and b, = de. 


Proof. To establish the sufficiency of these conditions consider A X B. 
Each element of H is uniquely representable as a product ab. The mapping. 
a X b— ab is seen to be an isomorphism. The necessity is easily seen. 


THEOREM 8.4. A necessary and sufficient condition that an arbitrary 
hypergroup H be isomorphic with the product of two hypergroups is that there 
be two conjugations y, and y. in H with the following properties. For every 
pair x and y in H there exists a unique element c such that c— x relative 
. yi and c— y relative to ye. If LaTi, and YsetfrYx, then CaeC102 Then 

= {4}, X {Hye | 


Proof of necessity. In ‘A X B define y, by a X b — a’ X b when a = a’. 
and y: by a X b~a X b’ when b = b’. - These conjugations on the con- 
ditions above, and {A X B}, = A, {A X B}y, = B. 


Proof of sufficiency. TA (Dn, X X {H}q, Since the classes {x}. 
and {y}, have just one element c in common, the pair {x},, X {y},, can be 
represented uniquely as {c}+, X {c} The mapping {c},, X {c}y —c is an, 
isomorphism of {H}., X {H},, with H. ‘ | 

We conclude with conditions under which the product is commutative 
or inversive. . 


C THEOREM 8.5. A necessary and suffmient condition that AXBbe com- 
‘mutative is that both A and B be commutative. 
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# 


THEOREA 8.6. À necessary and sufficient condition that A X B be in- 
versive is that both A and B be inversive. 


9. The lattices of a hypergroup. The structure of a hypergroup is 
clarified by studying its lattice of subsets. A lattice is defined as a set € of 
elements: A, B, C,- - : such that the following conditions are satisfied. 


( 1) For each pair of elements A and B in € there exist elements A v B 
and A B in G, called respectively the union and intersection of A and B. 


(2) These combinations are commutative, A v B=BvA and AnB 
= Bon A, i 


(3) They are dti as well, Av (Bu S (Av B) v C and 
An(BnC)=(AnB)rC. | 


(4) For each pair A and B we have Av (Ba A) = A= (AvB)AA, 


The intersection C^ D of two subsets C and D of a hypergroup H is the 
set of all elements common to the two. The union ™ Cw D is the set of all 
elements contained in products 2%: Tp, p any integer, where æ is an 
element of either C or D. The closed subsets of a hypergroup A form a 
lattice =? Y. 

If Y and 8 are ETA the set of all pairs of elements of X and 8 form 
a lattice W X B, their direct join. . If A is the lattice of the closed subsets 
of a hypergroup A, and 8 is that of the hypergroup B, what is the relation 
between À X B and Y X B? To answer this we define a plenary subset of 
A X B ag one which is the product H X J of its component sets. lt Hi XA 
and H. X J, are two plenary subsets of A X B then 


(Ai X di) v (Ha X di) = (H, v He) X (Jiv Je) ad 
(HiX di) * (Az X de) = (Hi^ Hs) X (Jia Je). 


The plenary subset H X J is closed under multiplication if and only if both 
H and J are closed. The closed plenary subsets of A X B form a lattice 
isomorphic with A X %8. 

We next consider the questions, what sublattices does the lattice of the 
hypergroup have, and when is the lattice of {H} among them? Theorem 9.1 
contributes to the answer of the first part of the quo and the next two 
theorems to the second part. 


31 Dresher and Ore, pp. 714, 715. 

32 Dresher and Ore, p. 715, Theorem 2. 

+ Garrett Birkhoff, “On , the combifiations of subalgebras,” Proceedings of the 
Cambridge Philosophical Society, vol..29 (1933), pp. 441-464, Theorem 18. 1. à 
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THEOREM 9.1. Let H be a regular inversive hypergroup. The set of 
proper subhypergroups of H forms a lattice. 


Proof. If J and K are proper subhypergroups of H then J v K is closed 
. under multiplication, and contains the identity. If b is in Ju K then b> is 
in Ju K, by the Lemma 1 following. Therefore Ju K is a proper sub- 

hypergroup. 


The common part of J and K, J^ K, is closed under multiplication, and 
contains the identity and the inverse of each of its elements. Therefore J a K 
is a proper subhypergroup. 

The operations of union and intersection are commutative and associative. 
Since the intersection of two proper subhypergroups is the largest contained 
in both, and the union the smallest containing both, J u (K ^J) == J and 
(ZuK)nJ—J. The following lemma then completes the proof. 


Lemma 1. If H is an inversive regular hypergroup, then cas - ` ‘an 
> implies that clap a7 + + at. 
7-1 


Proof by induction. Assume the conclusion valid for n == ? and 7==p—1. 
It then follows for y == p. If cas: ` ` Qu san then there is an element b in 
Q,f2°* * Qp- such that cbay, whence c-tas-tb1, Since baton, ‘ut, 
we have rap ae --at Thus Theorem 9.1 is proved. 


THEOREM 9.2, Let H be a hypergroup with a conjugation among tts 
elements such that {H} ts regular and inversive. The proper appropriate 
subhypergroups of H form a lattice isomorphic with that of the proper sub- 
hypergroups of {H}. 


Proof. Ti J and K are proper and appropriate in H then JaK is a 
proper appropriate subhypergroup, since it is closed under multiplication and 
the conjugation and contains all the identities and all the inverses of all its 
elements, as seen in Lemma 2. All the identities are in Ju K. By Theorem 
9.1 {J} v {K} is a proper subhypergroup of {H}, whence by Lemma 2 there 
is a proper appropriate subhypergroup I of H such that {I} = {J}. {K}. 
By Lemma 3, since {J} v {K} contains {J} and {K}, J contains Ju K. If 
4 is an element in J then {t} is in a product of the type {x.}{z2}- < - {an}, 
where ty is in J or K. This is only possible if there are elements zy’, in J 
if Zve], in K if £vK, such that 42,2. + - - ay’. Thus every element iel is in 
J~ K, that is, Ju K mm I, a proper appropriate subhypergroup. As before, 
Jv (KaJ) =J and (J v K) aJ =J, and the lattice postulates are satisfied. 
The isomorphism follows from the fornfulas; {J v K} = {J} v {K}, {Jo K} 
= {J} ^ {K}, which in turn follow from Lemmas 3 and 4. : 
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Lemma 2. Let H be a hypergroup with a conjugation among tts elements 
such that {H} is regular. For every proper subhypergroup K of {H} there 
exists a proper appropriate subhypergroup J of H such that {J} =K. J con- 
tains all identities of H, and all the inverses of all its elements. 


Proof. Let J be the set of all elements j which map upon the elements 
{3} of K. J is appropriate and closed under multiplication. If e is an 
identity in H then {e} is the identity of {H}, and therefore e is in J. If 
h is an inverse of A with respect to e then {h} is the inverse of {h} with 
respect to {e}. Therefore J contains with k all of its inverses. Thus J is 
proper and appropriate, and {J} = E. 


Leama 3. If H is a hypergroup with a conjugation among its elements, 
and if J and K are appropriate subsets of H, then J D K if and only if 
{J} > {K}. 


Proof. If JDK and {k}-{K}, then kK, and so in J, and therefore 
{k} {J}. Therefore {J} D {K}. If {J} D {K} and kK, then {k}.{K}, 
whence {K}.{J}, and so keJ. Therefore J > K. 


Lemma 4 The union of two appropriate subsets ts appropriate. 


Proof by induction. Let h be an element of the union Jv K of two 
appropriate subsets, Then k is contained in a product 2:72: * - £p, where 4» 
is in either J or K. Let h’~h. If p= 2 then there exist 2,’ ~ Ti, a’ Ze 
such that h’.2,’c.’. Suppose that for p==y— 1 there exist ty ~ £r 
n= 1,2, + -,v— 1, such that hs - a”. Then a similar statement 
holds for »==v. For hezits - æv implies that there is an element 
brie: + - x, such that h.bry. If kh’ ~ h then there exist b’ ~ b, zy ~ x, such 
that h’,b’s,’. By hypothesis there exist tn’ ~~ Tn, n == 1,2,° + :,y—1, such 
that b'at? -Eye Therefore h’.b’x,’ Cz,’ + -ay4/m’. Since J and K 
are appropriate we have Zye) implies £e), and @eK implies a,)/-K. Therefore 
J v is appropriate. The proof of Theorem 9.2 is now complete. 

Let G be a group with a conjugation among its elements. If H is a 
subset of {G} which is closed under multiplication, and if the set D in G 
which maps upon H is finite, then H is a subhypergroup of {@}. For D is 
closed under multiplication, and being finite, is an appropriate subgroup of G. 
Therefore H == {D} is a subhypergroup of {G}. 

If G is finite then the subhypergroups of {G} form a lattice @. For the 
closed subsets of {G} form a lattice,*and by the paragraph above each closed 
subset is a subhypergroup. Since, if D and F are appropriate subgroups of G, 
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{Dv F} = {D} v {F} and {DoF} —{D}{F}, the appropriate subgroups 
of G form a lattice isomorphic with ©, 

Let H be a regular hypergroup with a conjugation among its elements 
such that for every element b we have b+~b. Let J be a subset of {H} 
which is closed under multiplication, and K be the set of elements of H which 
map upon the elements of J. ‘Therefore K is closed under multiplication and 
the conjugation. With b it contains b, and so it contains e. Therefore K 
is an appropriate proper subhypergroup of H. ` 

If J and L are subhypergroups of H then {J v Fes {J} v {L} aa 
{J n L} m= {J}o{L}. Thus we have 


THEOREM 9.3. Let H be a regular hypergroup with a conjugation among 
tts elements such that for every b, b~~ b>. Then the subhypergroups of {H} 
form a lattice isomorphic with that of the appropriate subhypergroups of H. 


We conclude with a condition that a group G be simple. Let S be a 
subgroup of the automorphisms of G. Let b— c in G when b —c under an 
automorphism of S. Now {G}s is a finite regular inversive hypergroup, and 
the set of appropriate subgroups of Gis a lattice isomorphic with that of the 
proper subhypergroups of {G}s. A subgroup is appropriate if and only 
if it is characteristic under 5. If S is the group of inner automorphisms of G 
then the appropriate subgroups of G are the normal subgroups, andthe lattice 
of the proper subhypergroups of Ee) is a B-lattice.** We thus have the 
following theorem: 


TaxoRex 9.4. If S ts the group of inner automorphisms of the group G, 
then G is simple if and only if the partition hypergroup {G}s hus no proper 
subhypergroups except E, the identity group. 


UNIVERSITY oF MINNESOTA. 


1 Birkhoff, loo. cit., Section Il. 


ON THE ALMOST PERIODIC BEHAVIOR OF MULTIPLICATIVE 
NUMBER-THEORETICAL FUNCTIONS.* 


By E. R. van KAMPEN and AUREL WINTNER. 


The purpose of the present paper is to develop criteria for the almost 
periodic behavior (B>) of multiplicative number-theoretical functions. In. 
the particular case of what have been called strongly multiplicative functions, 
such criteria were recently! found for À — 1 and A= 2. However, the only 
representative of the classical number theoretical-functions in the class of 
strongly multiplicative functions is p(n)/n, where is Eulers function. 
Thus, there arises the question as to the possibility of a, corresponding theory 
in the general case. 

It will turn out that such a theory can be developed, although the situa- 
tion then is essentially more involved. In fact, already the question of the 
almost periodicity (B>) of the factor functions, which belong to each of the 
prime numbers, must be discussed. Correspondingly, the preservation of 
almost periodicity (B>) on multiplication of a finite number of such factor 
functions requires especial care. The limit process which leads to the given 
multiplicative function is formally more involved than, though in principle 
uot different from, the corresponding step in the strongly multiplicative case. 

The results to be: obtained may be illustrated by the sum, e(n), of the 
divisors of n. The result in this case will be that o(n)/n is almost periodic 
(B>) for arbitrarily large À and has the Fourier expansion 

oln) 6 © cm(n) 
a 2 


a 2: m? 


where the c’s denote the Ramanujan sums. But Ramanujan? has proved that 
l 6 © m(n). 
o(n) mare me ©. 


so that, if one divides by n, Ramanujan’s trigonometric series turns out to be 
the Fourier series of the function to which it converges. 
That Ramanujan’s results do not imply any almost periodic behavior may 


* Received November 15, 1939. 

2M. Kac, E. R. van Kampen and Awrel Wintner, “Ramanujan sums and almost 
periodic functions,” American Journal of Mathematios, vol. 62 (1940), pp. 107-114. 

78, Ramanujan, Colleoted Papers (Cambridge, 1927), pp. 179-199. 
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be illustrated by the following example: Ramanujan proves that if d(n) 
denotes the number of divisors of n, then 


ae 3 3 En 


Cm(n). 
But this convergent trigonometrical series cannot be the Fourier series (3) 
of the function, d(n), which it represents. In fact, 


= 3 d(m) ~ log n, n— ©, (Dirichlet) 
implies that the mean value of d(n) is +  ; so that d(n) cannot be almost 
periodic (B). 

Incidentally, the results to be obtained are, in contrast to the results of 
Ramanujan, independent of the prime number theorem. 

It should be mentioned that, while Theorems IV and VI below may, 
with straightforward modification of proof and wording, be transferred to the 
case where multiplicative functions are replaced by additive functions, essen- 
tial complications seem to arise in connection with the.corresponding analogue 
to Theorem V below (if As 1). 

By a function f(n) will be meant a sequence in which n runs through 
all positive integers. The average M{f} —M{f(n)} of an f is defined as the 
limit (n> co) of the arithmetical mean of the n numbers f(1),: ::,f(n), 
if this limit exists. And AÆ{f}— M{f(n)} will denote the upper limit 
(S+ ©) of this arithmetical mean, if f= 0. 

By a multiplicative function f(n) is meant á sequence , (1), f(2),° °° 
of numbers for which 


f(mn2) = f(n) f (nz) whenever (ni, ne) —1; hence, f(1) —1 


unless f(n) — 0 for every n (this trivial case will be excluded). 

If there exists a fixed prime number p* such that f(g”) == 1 for every k 
and for every prime number p distinct from p*, the function f(n) will be 
called a prime multiplicative function (belonging to the prime number p*). 
It is clear that if pn denotes the m-th prime number and fn(n) an arbitrary 
prime multiplicative function belonging to pm, then 


(1) f(n) = I fn(n) 


defines a multiplicative function f(m), À but a finite number of the factors 
of the infinite product being 1 for a fixed n. Conversely, every given multi- 
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plicative function f determines a unique sequence {fm} of prime multiplicative. 
functions fm by means of which f is representable in the form (1). In fact, 
(2) fm(%) =f (Pa) if pak|m but pafa; © (f(1) =1)- 


In what follows, g(n) will denote an arbitrary function which satisfies, 
for a fixed prime p, the requirement that 


(3) ` g(r) = g(pt) if pln but p Yn, 


Condition (3) is satisfied by every prime multiplicative function belonging 
to p, but not only by these functions; in fact, (3) is possible also when g(n) 
vanishes for n == 1, without vanishing for every n. 


THEOREM I. The average M{g} of a function (3) exists if and only tf 
the sertes 


(4) : 3 Pig (p*) is convergent, 
in which case . 
(6) MU) = (ps) À PP). 


Remark. It will be clear from the proof that (5) holds, if g = 0, also 
when the series (4) is divergent (in which case M{g} == + œ). 


pe Let a; denote the arithmetical mean of the p* numbers 
g(1),- - -,g(pt), where ¢ is an arbitrary non-negative ae and p the 
prime number belonging to g. Then, by (3), 


(6) i= (1— p) À pig (pH) +p glp’). 
Hence, for every t > 0, 
as—air—=p'(g(pt)—g(pt*)); do = g (1), 
and 80 ` , | 
(D ppt) rat À paa); (02.0). 


Suppose first that M{g} exists. Then, in particular, ` 
a,—> M{g}, and 80 ai — ai —> 0, as t—> œ. 


Consequently, application to (7) of a standard lemma concerning linear 
summation methods shows, that p>g(p*) —> 0 astro. Hence, (4) and 
(5) follow from (6). 

In order to prove the sufficiency of (4), for the existence of M {g}, let 
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n= 3 qpé where 0Eg<p; m= m(n), q= a(n), 
be the p-adic representation of an arbitrary n > 0. Then, by (3) and (6), 


= qipa; 
À it) = a. 


= 
n 


EmMa 


3 qip? 


‘It follows, therefore, from the standard lemma on linear summation Saethóds; 
used before, that in order to prove the existence of M{g}, it is sufficient to . 
assure that the sequence ai, @, &s, ` `: (which merely is a subsequence of the 
sequence defining M {g} ) has a limit 4 + o.° But (6) and the assumption (4) 
clearly imply the existence of lim a; (54 + œ) ; so that the proof is complete. 


ä \ 
Tasorem II. A function g which satisfies (3) (and so, in particular, 
a prime multiplicative function belonging to a prime p) is almost periodic 
> 
(BY) for a given positive (= 1) tf and only if ° 


(8) 3 tl g(P)P <a. 


(This implies, for À = 1, the curious fact that g(n) is almost periodic (B) 
whenever so is | g(n)|). 

It is understood that if à < 1 in Theorem IT (so that there is no Hélder- 
Minkowski inequality and, correspondingly, no natural metric in the B\.space), 
then M{g} need not exist, and so, in particular, g(n) need not have a Fourier 
expansion. 


Proof. If g(n) is almost periodic (BY), 80 is lial n)|; so that ot HT g |} 
exists. Consequently, application of Theorem I to the function | g(n) |> shows, 
that (8) is a necessary condition for the almost periodicity (B>) of g. 

In order to prove the converse, define, in terms of any given function g, 
for every positive integer j a function gf, by placing 


(9) gi(m) =g(n) if 1Z=n<pi, gi(n-+ pl) = gl(n) for every n. 


Then it is clear that (3) remains valid if one replaces g(n) by the non- 
negative function | g(n) —g/(n)|* of n for a fixed j. Hence, the Remark 
which follows Theorem I implies that i 


É co 
Wl 9—9' = Gp) & pl) — 9 (2) DP. 
A Pf : 
8 This result is closely related to a construction due to O. Toeplitz, “ Ein Beispiel . 


zur Theorie der Tastperiodisdhon Funktionen,” Mathematische Annalen, vol. 98 (1827), 
pp. 281-295. 
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On letting here j—-> co, one sees from (9) that, if (8) is satisfied, 
M{|g—gi |} — 0 as joo. 
Since every g/ is, by (9), a periodic function of n, it follows that g is almost 


periodic (B‘).- This completes the proof of Theorem II. 


THrorEM III. A function g(n) which satisfies (3) is almost periodic 
(B) if and only if 


sate 

(10) x p*| g(p*)| < œ, 
&=0 

in which case the Fourier expansion of g(n) ts 


(tf) g(n) ~ Mg} + È cp (n) à LP — oP") 
jot kj p 
where the constant term is 


(12) Mg} = (1—7) À p*9(p*), by (5), 


and the cp (n) denote the Ramanujan sums belonging to those indices m which 
are powers of p: 


(13) Gm (10) aor (ri En) 


== 3 COS 2 À n, where 1=k< m and (k, m) — 1. 
k 
In particular, g(n) is limit-periodic (grenzperiodisch), since the Fourier 
constants belonging to (circular) irrational frequencies all vanish. 

Remark, Assuming that (8) is satisfied for À == 2, one sees from (12) 
and (13) that the Parseval relation belonging to (11) is 


aa 3 (1—1) UGE 


ks p p 
1\ © en S a | $ I) gt)? 
= |[1—--) 3i 1 À (Pp ee à 
>) ro | mares (P p) ie} p” 
an identity which can, of course, be verified directly. 

Proof. Since (10) is the particular case À — 1 of the criterion (8) of 
Theorem II, only the explicit form (11) of the Fourier expansion needs a 
proof. To this end, one can readily verify from the definitions (12), (13) 
and (9), that (11) is certainly true if g is replaced by the periodic function 
g! (where j is arbitrarily fixed) ; inefact, (11) follows for g == g! by straight- 
forward trigonometrical interpolation. Since, by the proof of Theorem II, 
one has M{| g— g! |}—>0 as j->.0, it follows that (11) holds for any g. 


11 
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Since the preceding results concern an arbitrary function which satisfies 
(8), they are applicable to every prime multiplicative function, and so-to any 
of the factors (2) of an arbitrary multiplicative function (1). In what fol- 
lows, there will be established natural analogues to Theorems I-III for the 
case where prime multiplicative functions g(n) = fin(m) are replaced by 
arbitrary multiplicative functions f(n). It seems to be hard to replace 
Theorems IV, V, VI below by theorems which are of the same sharpness as 
the corresponding Theorems I, IT, III above. 

It will be convenient to associate with every multiplicative function f(n) 
another multiplicative function f.(n), which is defined by 





Then, since also f(n). is multiplicative, 


(16) it) = 3 dj (à). 


THEOREM IV. The average M{f} of a multiplicative function f(n) 
exists whenever | | - 


an O ŠIRI, 
in which case | 
(18) Miro. 


Remark, It will be clear from the proof that (18) holds, if f(n) = 0, 
also when the series (17) is divergent (in which case Af{f} == + œ). 


Proof. It is seen from (16) that, for every n = 1, 


sf —3 Lz] kf- (k). 


Hence, | 
3 fh) =n À + OC EI FD 


as n—> œ. Since (17) implies that= Sk | f«(k)| — 0, it follows that 
b kel f 


3 f(k) =n à f(k) + o(n). 
k=1. ki 


This completes the proof of Theorem IV. 


It will be convenient to extend the tlass of an arbitrary multiplicative 
function f in the same way as condition (3) extends the class of prime 
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PE functions. The Sion in question may be defined by the — 
requirement that the given f(n) admits of a factorization (1) in which a 
factor fn need not be multiplicative but merely such that condition (3) is 
satisfied by g = fm and p= pn, Where m = 1,2,- +. ae fm(1). need 


not be 1, it must, of course, be assumed that the product T fm(1) is con- 


vergent and remains convergent if one omits a ‘finite sure of its factors. 
If these conditions are satisfied, f will be called a generalized multi- 
plicative function. For the proof of Theorem V below, those and only those 
generalized multiplicative functions will be needed for which fm(1) — 1 holds 
for every m with the exception of one value; say m = r, for which fr(1) = 0. 
For a generalized, multiplicative function f, let f. denote the generalized 
multiplicative function l 


(19) F2) fem); where fan (1) fn (1), fa (y= En) | 


and every fme is prime multiplicative. It is clear that this definition of f. 
reduces to the definition (15) if the A multiplicative function f is 
multiplicative. i 


Tasoment IV bis. Theorem IV alae foe generalized multiplicative 
functions f also. 


This is readily seen from the proof of Theorem IV and from the 
definitions of the generalized multiplicative functions f, fe 
The following considerations will be based on an auxiliary lemma. 


Lea. The product a 
(20) | f(n) = T fe(m). 


of a finite number of prime multiplicative functions fı,” - * , fm which belong 
to distinct prime numbers p,’ ``, pm is almost periodic (B>) for a fixed 
AZ 1 whenever each of the functions fi,---, fm is almost periodic (B^) for 
this À. 


Proof. For every r(—1,: - -,m), define two prime multiplicative 
functions ur, vr of n by placing ` E ; 
vr(n) = Max (1, |f), ; ia E ECOV 
so that $ e 
| (ME sis S a(n), 
and | 


fi(n) = ur(n)ur(n). 
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Since u,-is a bounded function of n, criterion (8) of Theorem II is satisfied 
by g = ur for every exponent, and so.u, is almost periodic (B=) for every x. _ 
On the other hand, since f- is supposed to be almost periodie (B>) for a fixed 
À, Theorem II assures that (8) ig satisfied by g — fe, and so, in view of the 
definition of vr, by g==v, also; so that v, is almost periodic (B>) by 
Theorem II. Since the product f of the m functions f- is the product of the 
m + m functions ur, vr, where the u, are almost periodic (B*) for arbitrarily 
‘large x, and since A 2 1 by assumption, it follows that the proof of the Lemma: 
will be complete if one shows that the product 


- (21) y(n) = tt vr(n) 


of the m almost periodie (B>) functions v- is almost periodic (BY). And this 
may be shown by induction from m to m + 1, as follows: 

Suppose that v is already known to be almost periodic (B>). Let w be a 
prime multiplicative function which belongs to a prime number p distinct from 
the primes pi, * : *, Pm to which the factors v,;-- : , vm of v belong: Suppose 
finally that w is (as are these factors vr of v) not less than 1 and almost f 
periodic (B>). The induction from m to m + 1 requires to prove that the 
product vw is almost periodic (B>). To this ed, let w? = w (n) denote the 
function which one obtains by applying the definition (9) to g == w, where 
j=1,2,---. Then w/ is periodic, hence almost periodic (B*) for every x, : 
and so the product vw? is almost periodic (B>), since A = 1. Hence, the proof 
- of the almost periodicity (B>) of vw will be complete if one shows that 


WU {d!} +0 as joo, 
where d! — d! (n) denotes the function 
di =— | vw — vw! |>; (G =1,2, >). 


But it is clear from the definitions of v, w and wf, that d/ is for every fixed 7 
a generalized multiplicative function in the sense defined before Theorem 
IV bis. Hence, by Theorem 1V. bis, it is sufficient to show [cf. (17)-(18)] that 


oo 
(22) ae _ à | d.(n)| 
zà eae au ya ‘n=l 
is convergent for a fixed j and that 
| AN{d)} [=] 3 di. (n) |S 5 | B.(n)| 30 as j> o, 


where de(n) is obtained by Anne the notion (19) to f= di. And this 
may be proved as follows: 


à 
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Since v%,°--,%m and w are prime multiplicative and almost periodic 
(B>), Theorem IT assures that condition (8) is satisfied by any of the m + 1 
functions g =, 01," °°, Um; so that, by the definition (19) of the asterisk 
symbol, eet í ` i ` 


(3) 0 È | d(n)| < o, 


if. d° = d’ (n) denotes the multiplicative function 
d? =m (vw) = (v vaw); cf. (21). 


But it is clear from (19) and from the definitions of w and d/, where j > 0, 
that the series (23) is a majorant of (22), also if one replaces by zeros those 
terms of (23) which are not divisible by the (j + 1)-th power of p. Hence, 
it is clear from (23) that the sceries (22) is convergent for every j and has 
‘a value which tends to 0 as j > œ. | 

This completes the proof of the Lemma. 


Remark. It may be mentioned that also the converse of the Lemma is 
true, i.e. that for the almost periodicity (B>) of a finite product of prime 
multiplicative functions belonging to different primes it is not only sufficient 
but also necessary that each of thèse prime multiplicative functions be almost 
periodic (B>), where À = 1 is arbitrarily fixed. This converse of the Lemma 
is not needed in what follows, so that the proof will be omitted. 


THEOREM V. If X21 is fiwed and f is a real non-negative multi- 
plicative function, then f is almost periodic (B) whenever 


(24) 3 [Pa] < o. 


Tt is understood that by f^ is meant the function which belongs to f* in the 
same way as the multiplicative function f. defined by (15) belongs to f, and 
that f == f(n) denotes the A-th power of f = f(n). . 


‘Remark. In view of the Remark which follows Theorem IV, condition 
(24) is necessary as well for the almost periodicity (BY) of f in case fe = 0. 


Proof. It is clear from the definition (19) that if (24) is satisfied by 
the non-negative multiplicative function f, then it is also satisfied by each of 
eits prime multiplicative factors fm = 0. Thus, condition (8) is satisfied by 
g = fa fa + >, and so Theorem II assures the almost, periodicity (B>) of 
any of these prime multiplicative functions. It follows therefore from the 
preceding Lemma that the multiplicative function 
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(25) in (n) = TE fe(n) 


of n is almost periodic (B>) for every m. Hence, in order to complete the 
proof of Theorem V, it would be sufficient to prove that 


(26) M{| f— lun [90 as m o. 


However, it will be convenient to carry out the limit process leading from 
(25) to (1) in another manner, namely by means of a pair of auxiliary func- 
tions u = u(n), v ==v(n) which are defined in n terms of the given function 
f= f(n) as follows: 

Both functions u(n), v(n) are multiplicative, sd 


GOT uP) = Min (f0); oP) Mex (Lf) O 


for every prime p and for every k21. Since f Æ 0 by assumption, it is 
clear from (27) that 


(28) Susi; 

while LT 

(29) f= uv 

for every n. Furthermore, from (27) and (15), 

(30) JulSlpels lols 


By wn and Um will be meant the prime multiplicative functions which belong 
to the m-th prime number p = pw in the s same way as fm in (1) and (2) 
belongs to f; so that 


(31) u(n) = H a(n); v(n) = T on(n), 


, It-is clear from (30) that the assumption (24) remains satisfied if one 
' replaces f by u or by v. Since it was shown before that each of the partial 
products (25) of the infinite product (1) is almost periodic (B>) if f satisfies 
(24), it follows that each of the partial products of either of the infinite 
products (31) is almost periodic (B>). Hence, in order to prove that « and v 
are almost periodic { B>), it-is sufficient to show that 


(32) Mflu—anl}0; H{lv—ynP} 0 asm o, 
where Em = Tm(n), Yn = Ya (n) denote the multiplicative functions a 
nl m ai 
5 r=1 r=1 


But if both functions u,v are known to-be almost periodic (B>), then also 
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their product is almost periodic (B>), since u is bounded, by (28). It follows 
therefore from (29) that the proof of Theorem V will be complete if one 
verifies (32). ` 

In order to prove (32), notice that, by (27), (28) and (31), 


(84) 12422202... Zuz; 1SynSySyn5---S0 


for every n. Furthermore, (30) and (24) assure that (17) is satisfied by” 
any of the functions f = uà, vò, mò, ym*; 80 that, by Theorem IV, the average 
of any of these functions exists. And these averages satisfy, in view of (18), 
the limit relations 


(35) AL {tm} — Hu}; M {yn} > M{u} as m— o. 


But it is clear from (34) and (35) that the proof of (32), hence also the 
proof of Theorem V, will be complete if one verifies the following elementary 
lemma (which has nothing to do with multiplicative functions) : 


Lemma. If there exists a finite average M {f} for the A-th power (A 21) 
of each of the real non-negative functions f(n) = F(n); Fi(n), Po(n), °° 
and if, for every fixed m, 


(36) either Fun) S F(n) for every n or Pn(n) = F(n) for every n, 
then the limit relation 


(87) M{F yy} > AL{fr}, m— ©, 
implies that 
(38). M{\ F— Fn >} > 0, m—> co. 


In order to prove this Lemma, notice that, since A = 1, 
(1—t)§S1—P frosts]. 
Hence, it is clear from (36) that 
| F—F, |S | — FN |. 


Consequently, (38) follows from (37) in view of (36). 

This completes the proof of Theorem V. 

Theorem V may be refined by exhibiting a sequence of almost periodic 
functions (B>) which are explicitly defined in terms of f and tend to f with 
reference to the metric of the space (B>): 


o  THEOREM Vbis. If a real non-negative multiplicative function f satisfies 
(24) for a fixed AÈ 1, then the products (1), (25) are almost periodic (B*) 
and satisfy (26). © 


Proof. It was shown in the Proof of Theorem V that each of the real 
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non-negative functions htm; Em, Ym; u,v; f of n is almost periodic (Bò). 
Furthermore, (25), (27) and (33) imply that hm = tym; so that, by (29), 


f— lhin == (u — tm) + (v — Ym) Tn. 


Hence, if À > 1, it is seen from Minkowski’s inequality that in order to prove 
(26), it is sufficient to show that 


M{|(u—am)v|)}>0; M{|(v—ym)œm |>} > 0, (m— œ), 


where t; 2, %, °°: are, by (28) and (34), uniformly bounded functions of n. 
Consequently, if A > 1, the assertion (26) is, in view of Holder’s inequality 
and of the almost periodicity (B>) of v, implied by 


M{| u — in|} 0; {| v—ym |} 0, (m— œ), 


a pair of relations which obviously imply (26) in the limiting case à = 1 
also. Since this pair of relations is, in view of the uniform boundedness of 
the functions u — Tı, u— 2, - : of n, equivalent to (32), the proof of 
Theorem V bis is complete. 


THeorres VI. A multiplicative function f 20 is almost periodic (B) 
whenever 


oo 
(39) 3 | fe(n)| < ©, 
where f.(n) is the mulliplicative function defined by 
Fe a E i 
pig) LAI) 
The Fourier expansion of f then is 


[se oO | 
(40) f(n) om. 5 ni Cm (a), where hy == x f-(ml), 
mel t=1 


Cm(n) denoting the Ramanujan sum (13). In particular, f is limit-periodic 
grenzperiodisch ). 


Proof. Itis clear from the assumptions of Theorem VI that the assump- 
tions of Theorem IM are satisfied by each of the prime multiplicative functions 
g= fufa}: + which occur in the factorization (1) of f. It follows therefore 
from (11) and (12) that (40) is true if f is replaced by any of the functions 
fifa. Since (13) is known to be a multiplicative function of m for- 
every fixed n, it follows that the series (40) belonging to the finite product 
f = hu = fife: + + fm may be obtained hy a formal multiplication of the m 
Fourier series (40) which belong to f = fi, fe, © * , fm, respectively. But the 
resulting formal product is identical with the Fourier series of the function 
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lim = fifa: * ` fu, as’seen by a repeated application (for À — 1) of the Lemma 
which precedes Theorem V bis. Consequently, (40) is true if the infinite 
product (1) is replaced by the finite product (25), where m is arbitrary. 
Hence, on applying (26) for A == 1, one sees that (40) holds for the infinite 
product (1) also. This completes the proof of Theorem VI. 

. The above results will now be applied to the case of classical multi- 
plicative functions, involving Euler’s $-function and the sum, oa(n), of the 
a-th powers of the divisors of n; so that 

por — 1 


CIE oa(n) = 3 ds, ie, ap) = = 


(where it is understood that a denotes the limit (== k + 1) of ca(p*) as 
a —> +0; so that the function co(n) represents the number of the divisors 
of n). 

For sake of shortness, a function will be called almost periodie (B®) 
if it is almost periodic (B>) for arbitrarily large À. 

(i) The multiplicative function f(n) — oa(n)/n* is almost periodic 
(B®) for every a > 0 and has the Fourier expansion 





(42) LM tata à at) (a0); 


ne mi ? 
while M{oo} = + œ. 
Proof. In order to prove the almost periodicity (B®), it is sufficient to 


show that the criterion (24) of Theorem V is satisfied for every positive À 
But if f(n) =oa(n)/n%, then, by (41), 


(pote — 1) y PES 1) pr (pr —1)) 
43 VE (et) paw! so that f^ = 
by (15). Hence, it is easy to see that f. > 0; so that, on applying to the 
series (24) the Euler factorization, one sees that the criterion of Theorem V 
requires that : 


n § (OI gy <% ras A> 0. 
p , 








But this product of sums may be rewritten as . 
E E SS td ee mu 
pro = (1—pr*) pg pix (=p © 


“and is therefore of the form 
n(2 - 1, @ na p2) +o) 
a r (1+ Ap + O( p24) + O(p*)). 
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And this product is absolutely convergent for every A> 0 and for ay í 
a > 0, since EP < © for every e > 0. 
. This completes the proof of the almost periodicity (B®) of (41) for 


every & > 0. Since application of (48) for à — 1 shows that f-(p*) = po, 


one has, for m—1,2,: >, 
co 4 00 
= f(m) = 3 (mi) (a +1)/ms; 
I=1 l=1 


so that (40) reduces to (42). | 

The calculations involved become shorter in case of Euler’s ¢-function, © 
in which case the result is as follows: 

(ii) Both functions p(n)/n, n/p(n) are almost periodic (B®). Their 
Fourier expansions are 

#0) p? n E) =p 
U —— 44.) —— Zc n 

(441) ~ Fay? X ca(n) pla P? — pi? ( 2) $(n) ™~ (3) Cq(%) qi— pe ? 
where the summation index q runs through the quadratfrei positive integers, ` 

Proof. Since $(p*) == p*— p*", application of the definition (15) to the 
A-th powers of the respective-functions f(n) —¢(n)/n and f(n) =n/(n) 


gives 


(451) P (p) er P, PA) =0 if k> 1, 











(452) (p) = Pt, PR) =0 if k >l. 


. Hence, the criterion (24) of Theorem V requires, for a fixed A, that 


(1+ | Ap(A)|) < ©, ive, that 3| Ap(A)| < œ, 
? p : 


. where 


sn. (Dies (pr D} 
Ap(À) pe and A;(À) Gap? 
respectively. Since it is clear from Sp < œ that ¥|Ap(A)| < œ holds 


for every À > 0 in both cases, the almost periodicity (B®) -follows. Finally, 
application of (44,) and (442) for à =. 1 gives © 


L a >k > 1 and f(p)= TE spy he 0,4 >1, 
respectively; 80 that (44,) and (4) readily follow from (40). 


THE JOHNS HOPKINS UNIVERSITY. 


ON UNIFORMLY ALMOST PERIODIC MULTIPLICATIVE AND 
ADDITIVE FUNCTIONS.* 


By E. R. van KAMPEN. 


In this note conditions are established for certain multiplicative’ or 
additive functions to be uniformly almost periodic? (u. a. p.). 

A subscript p on the symbols H or 3 will denote a product or sum over 
all primes, except that sometimes (explicitly) a finite number of primes will 
be excluded. 

A multiplicative function f is an arithmetical function f= f(n), 
t==m1,2,-° +, which satisfies 


(1) f(n) = f(m)f(t2) if (nn m2) = 1, Fa) = 1]. 
Such a function may be written in the form 

(2) f(n) = f) (we 8), 
where fp(n) is for a fixed prime p defined by 

(3) fi(n) == f(p*) if p*|n and “fn. 


The product in (2) clearly is a finite product for every fixed n. 
An additive function g is an arithmetical function g—g(n), n=1,2,°°°, 
which satisfies 


(4) g(mns) = g(m) + 9(me) if (mm) =L [g(1) = 0). 
Such a function may be written in the form 

(5) ` g(n) = 3pgo(n), (n= 1,2," °°); 
where gP(n) is, for a fixed prime p, defined by 

(6) go(n) = g(p*) it |n and ptn. 


Thus the sum in (5) is a finite sum for every fixed n. 
The main result may be formulated as follows: 


THEOREM 1. An additive function g(n) [real-valued multiplicative func- 
lion f(n)] is u.a. p. if and only if the sum representation (5) [the product 


* Received November 30, 1939. : 

1 Conditions for such functions to helong to a class (BA) of Besicovitch almost 
periodic functions are investigated in: E. R. van Kampen and Aurel Wintner, American 
Journal of Mathematics, vol. 62 (1940), pp. 613-626. 


627 


628 E. R. VAN KAMPEN. 


representation (2)] is uniformly convergent and each summand gp [factor fr] 
TS U. Q. p. 


The sufficiency of the above conditions is, of course, evident from the 
elementary properties of u. a. p. functions, also for complex valued multiplica- 
tive functions. On expressing the conditions in analytical terms, one obtains 
tha following theorems: 


THEOREN 2. An additive function g is u.a. p. tf and only tf the series 


(7) 3, 1. u. bx | g(p*)| is convergent, 
and the limit 
(8) ap = lim g(p*) exists for every prime p, (k= œ). 


It will be clear from the proof that if g(n) is u. a. p., then the unique u. a. p. 
extension of g(n) to non-positive values of n may be obtained by placing: 
g(—n) =g (n) and g(0) = 30». 

Tamorex 3. A multiplicative function f ts u.a. p. tf, and in case of 
real-valued functions only if, the series 


(9) Xp Lu. bx | f(p*) -- 1 | is convergent, 


and thé limit 
(10) Bp == lim f(p*) exists for every p, (k—> ©). 


And one sees easily that the unique u. a. p. extension of f(n) for non-positive 
n may be obtained by placing f(--n) = f(n) and f(0) = Hp 

The proof of Theorem 2 is simpler than the proof of Theorem 3. One 
could, reduce Theorem 2 to a special case of Theorem 3 by considering the 
multiplicative functions exp (@g(n)) and exp (&g(n)), where g(n) is 
additive. The, apparently difficult, complex case will be reduced in Theorem 5 
to the case of additive functions modulo 1. This reduction depends on the 
following theorem on u. a. p. arithmetical functions h(n) of absolute value 1. 


THEOREM 4. If the function h(n) is u.a. p. and satisfies | h(n)| = 1, 
n=1,2,---, then there exist an integer P, real numbers cy and real-valued 
u. a. p. functions pu, u=1,- °°, P, such that 


h(u + uP) = exp 2rt(cyn + wu (nr) ) ; 
(u=1,---:,P; n=1,2,-°°). 


Theorem 4 is a special case of a known theorem concerning generalized 
almost periodic functions on groups.” 
a 2 


3E. R. van Kampen, Journal of the London Mathematical Society, vol. 12 (1937), 
pp. 3-6; the result needed is (2), as modified by the last remark on p. 4. Since this 
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Use will be made of the following lemmas: 


I. If the function (n) satisfies for a ficed prime p the requirement 


(11) | p(n) = $(p*) if pln and gta, . (k=0, 1, 2," S oF 
then p(n) is u.a. p. if and only if 
(12) y = lim (7°) exists, (k> ©). 


Clearly this lemma? is applicable both to the summands gp of g and to 
the factors fp of f. | 

Let ¢; denote, for j = 1, 2,- - : the periodic functions defined by 

s(n) = (n) if 1S NSH; $)(n + p) — gi (n) for every n. 

If (12) is satisfied, then ġ; (n) > (n) uniformly with respect to n as j— œ,- 
so that #(n) is u.a.p. In order to prove the converse, assume that ¢(n) is 
u.a. p., and let y = p(l) denote the translation function of ẹ (n), i. e., let u(1) 
be for a fixed 7 the least upper bound of the function | (n +1) —¢(n)| 
of n(==1,2,---). It is clear from (11) that a(l) is the least upper bound 
of the expression | (pit?) --$(p*)| as j —1,2,: - +, where k = k(l) denotes 
the number of times p occurs in the factorization of 1. Since ¢ is u. a p., the 
lower limit of a(l) as 1—> œ is 0. Thus the lower limit of | (p) —¢(p*)| 
as &—> œ is 0, so that (12) holds. This completes the proof of I. 


II. The additive function g(n) ts bounded if and only if (7) holds, 
and in that case the series (5) representing g(n) is uniformly convergent. 


_ First, if (7) holds, then the absolute value of any g(n) is not more than 
the sum of the series in (7), so that g is bounded. On the other hand, if 
| g(n)| SM holds for every n, then | 3’g(p*)|S M, where the sum X is 
taken over any finite number of distinct primes p and the exponents are 
arbitrary. But this implies that %,|g(p*)| converges uniformly in the expo- 
nents k, and so (7), holds and (5) is uniformly convergent. This completes 
the proof of IT. | 


modification has only been indicated, the following reduction of Theorem 4 to a con- 
jecture of Wintner which was subsequently proved by Bohr (Danske videnskabernes 
Selskab X, vol. 10 (1930)) might be useful. Let P be a translation number of A(n) 
which belongs to the value 1. Then h (n) =h(u +P) is, foru—1,... , P, à ua. p. 
function for which | hy, (m+ 1) —h,(”)| 51. Thus the continuous u.a.p. function 
obtained from h(n) by linear interpolation does not come arbitrarily near to 0. On 
applying to this continuous function the result of Bobr, one obtains the constant o, and 
the function with the properties stated ire Theorem 4. 

3 This lemma is closely related to a result of Toeplitz, Mathematische Annalen, 
vol. 98 (1927), p. 282. 
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II. If the additive function g(n) is u.a. p., then (7) and. (8) are 
satisfied. Since (7) follows from the boundedness of g, by II, it remains to 
prove that’ (8) holds. 

Let po be a fixed prime and let g °(n) denote the summand Of g in (5) 
which corresponds to po. If a(l) and w°(1) denote the translation functions 
of g and g°, it will first be shown that 


(13) - (2) = (2D) holds for 1=1,2,---. 


Let e > 0 be given and let a partial sum g'= g'(n) of the uniformly con- 
vergent series (5) be determined in such a way that | g(n) —g'(n)| <e 
for every n, and] that the:term of (5) which corresponds to the prime po is 
one of.the terms in g’. Then, if »’(Z) denotes the translation function of 9, 
one has p (21) S (21) + 2e. Since p°(21) and a(21) are independent of «, 
- it is clear that (13) will follow if one proves that w°(21) Sy’ (21) for 
Tom 1, 2,° 

To this effect, let ? and n be given and let & be determined in such a way 
that neither n nor n -+ 21 is divisible by po’. Then, from-(6), 


9° (m+ rpok) = g°(n) and g°(n + 21 + rp) — g°(n + 21), 
= Las") 


The number r may be determined in with a way that nt rpc and 
n+ Ql + rp are not divisible by any of the primes (except po) which 
correspond to summands occurring in f.* For this r one has 


g (m+ tpok) = 9° (n+ rp) and g'(n + 21+ rpo 2) — p{n + 21 + rp), 


since every summand of g (with the exception of g ey is 0 at the values of n 
in question. Hence : 


PEE TE — a(n + rh] = | 9" (n 42) — ge (n)|, 


“so that p°(21) = x (24) by the definition of a translation function. This 
completes the proof of (13). ; 

Since g is u. a. p., one has lim inf a (21) = 0 aslo. “Thus, from (13), 
also lim inf a°(21) =0 as 1—> œ. But it is clear from the proof of I that 
the last relation iiplies the existence of the limit (8) for p = po Since po 
was an arbitrary prime number, the proof of III is complete. 

The proof of Theorem 2 is now evident. ` In fact, if (7) and (8) hold 
for the. additive function g, then g is, by II the sum of the uniformly con- 
eae 1 

‘In fact, r is a solution of a system of lifear congrüences to distinct prime moduls. 


Note that at least one of the numbers n and n $ 21 + 1 is even, 80 that the restriction 
to even translations 2} is necessary. 
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vergent series (5), thé terms of which are u. a. p., by I. Thus g is a u. a. p. 
fanetion. On the other hand, if the additive function g is.u. a. p., then (7) 
and (8) hold, by III. Thus the proof of Theorem 2-is complete. And it is 


clear from I, TI and Theorem, 2, that, the part of Theorem 1 which concerns 
additive functions is correct. 


IV. A multiplicative function f(n) satisfies (9) tf and 7. tf f(n) is 
bounded and the product (2) representing f(n) is uniformly convergent. 

That (9) implies the boundedness and the uniform convergence of (2) 
is obvious. But, if (2) is uniformly conveïgent, then there exists for every, 
«> 0 a prime number g.such that |1—Il’f(p*)| <e if the product I’ is 
taken over any finite number of primes p larger than q, and the & are arbitrary 
exponents, If in-addition f is bounded, then the product U,f(p*) (taken 
over all primes) converges absolutely-uniformly with respect ta the exponents ` 
k. Thus 3, | f(p*) —1 | converges uniformly with respect to the exponents k. 
And this, obviously, implies (9), so that the proof of IV is complete. 


V. If f(n) ts au. a. p. function and if (9) holds, i. e., tf the product (2) 
ts is uniformly convergent, then (10) holds also. f 


The proof of this statement will be omitted, since it may be obtained from 
the proof of ITI by unessential modifications. 


VI. If f(n) is a non-negative, multiplicative u.a. p. function, then f 
satisfies condition (9). Ho . 

Let M be an upper bound of f, so that 0 £ f MM for every n. Then one 
has 0ZITf(#) = M, where the product is taken over any finite number of 
distinct primes and the exponents k are arbitrary. Thus X, (l. u. baf (p£) —1) . 
is convergent, and (9) will follow if it is proved that 3%,(1— gr. 1. baf(p*)) ` 
also is convergent (note that f(p°) — 1). , 

Let the integer L be such that every group of L consecutive integers 
contains a translation number of f (n) belonging to 4. If 3, (1 —gr.l. baf(p*)) 


is ce so that IL, gr.1.b. ae) — 0, one can ually construct L numbers 
Re" fx such that 


1 A a sg 
fu) < 337; (m, n) =1 if ij; (j=; D). 


Since no two numbers n; have a common factor, there exists an integer N 
such that | 


N + i= n (mod fe’), for i= 1,- -, L. 


+ Then n; and (N + i)/n; — n’; are relative prime integers, so that 
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EN + à) ETW) EFM < h, Gi. --,L) 


This contradicts f(1) = 1, since there must exist at least one à such that 
0 < i = Land | f(N + à) —f(1)| < 4. Thus 3,(1 — gr. L ba(p*)) is con- 
vergent and so (9) holds for a non-negative, u. a. p., multiplicative function f. 
This completes the proof 5 of VI. 

It is clear from I and IV that the part of Theorem 1 which concerns 
multiplicative functions is an immediate consequence of Theorem 3. And, 
if a multiplicative function f satisfies (9) and (10), then the product (2) 
which represents f is uniformly convergent, by IV, and the factors of this 
product are u. a. p. functions, by I. Thus, in this case f is u. a. p., and the 
proof of the sufficiency of the conditions in Theorem 3 is complete. Now 
suppose that the real non-negative multiplicative function f is u. a. p. Then 
.f satisfies (9), by VI, and hence f satisfies (10), by V. Thus the proof of 
the remaining part of Theorem 3 is complete in case the u. a. p. multiplicative 
function f is supposed to be non-negative. 

Jt is clear from V, that the proof of Theorem 3 will be complete if the 
following analogue of VI for real-valued multiplicative functions is proved: 


VII. If ftsareal-valued multiplicative u. a. p. function, then f satisfies (9). 
First, by VI, the function | f | satisfies (9), so that 
(14) Xp l.u. b.x | | f(g*)|—1| is convergent. 


Let p1, °°", pm be a finite number of distinct primes, including all those primes 

for which gr. 1. bæ | f(p*)| = 0. Then one has, as a consequence of (14), 
Mygrlbalf(p)|>¢ %<e<], 

where the product is taken over all primes except pı,’ © `, pm. 

Next, let A(n) be the multiplicative function which is defined by A(n) = 0 
or A(n) =f(n)/| f(n)| according as » is or is not divisible by at least one of 
the primes pı,’ '*, pm. Then A(n) is au. a. p. function. For if x(n) denotes 
the (periodic) function which is 1 or 0 according as n is or is not divisible by 
at least one of the primes pı, °°, pm, then x(n) = (1—x(n))|f(n)| + «(n) 
is a real-valued u.a.p. function with the positive lower bound c. And so 
A(n) = (x(n) —#«(n))f(m) also is a uw. a. p. function. 

Because f(n) is real valued, the u.a. p. function A(n) can only assume 
the three values 0, 1 and — 1, so that A(n) is periodic. Since A(n) is multi- 
plicative also, one has A(p*) == 1 for every & if the prime p is not a divisor 
of the primitive period of A(n). Hence it is clear from (14) and the definition 
of A(n) that (9) holds for f(n). This cempletes the proof of VIT, hence also 
the proof of Theorem 3. 


3 This proof was communicated to me by P. Erdës. 
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It is easily seen that the proof of VII may be extended to the case where 
A(n) ia restricted to assume a finite number of values. Thus one obtains: 


VIII. If the multiplicative function f(n) is u.a. p. and tf in addition 
there exists an integer r such that f(p*)" ts non-negative for every k unless p 
ig one of a finite number of primes, then f satisfies conditions (9) and (10). 


In. fact, if this finite number of primes is included among the primes 
Pis °°" Pm, of the proof of VII, then the function A(n) of that proof can only 
assume as values either 0 or exp (2atk/r), k—1,::-,#. Thus A(z) will 
again be a periodic function, so that the proof may be completed as the proof 
of VII. 

The reduction of the general multiplicative case to the case of additive 
functions modulo 1 will now be formulated and proved. A function y(n) will 
be called additive modulo 1 if 


4 (nine) =y (m) + Y(n) (mod 1) if (m,n) = 1 


and n, and n are not divisible by one of a finite number of primes pi,°--, pm, 
while y(n) = 0 for all n which are divisible by one of these primes. Let {c} 
denote the distance from c to the nearest integer. 


IX. The statement that conditions (9) and (10) are necessary for any 
multiplicative function to be u.a. p., is s equivalent to the statement that the 
condition 


(15) 3, Lu ba (Y (#)} < + 00 
ts necessary for a function y(n) to be u.a. p., if y(n) is addilive modulo 1. 


For a given u. a.p. multiplicative function f(n), let the u. a.p. multi- 
plicative function A(n) and the periodic function x(n) be defined as in the 
proof of VII. ‘Then A(n) =A(n) + x(n) is u.a.p. and satisfies | h(n)|—1, 
so that Theorem 4 is applicable to h(n). It may be assumed that the integer 
P of Theorem 4 is divisible by each of the primes p.,---, pm of the proof of 
VII. For certain values of the integer u of Theorem 4, one will have 
h(u+nP) = x(u +nP) —1, for every n. For the remaining values of u, 
including in particular u = 1, one has x(u + nP) = 0 and 


(16) A(u + nP) = exp Rri (Cun + ÿx(n)), (n=1,2, >), 


where cy is a real constant and yw a real-valued u. a. p. function. It will be 
shown that one may choose Cuy == 0 for every u for which (16) holds. Since 
en is congruent modulo 1 to a u. a. p. function of n (either for all n or on any 
. arithmetical sequence of values of nŸ if and only if o is rational, it will be 

sufficient to show that cy is rational for every u for which (16) holds. Let 
such a u be fixed. . 
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There exists an arithmetical sequence of values of n such that 
(1+P,u+nP)—1 hence A(1+P)A(u+nP) =A((14+ P)(u+nP)). 
Thus, since (16) holds for this u and for u == 1: 

cı + yi (1) + Cun + p(n) = (n+u+nP)en + Yu(n + u+ nP) (mod 1) 
OT o, Pn=pu(n + u + nP) —Yu(n) — ya (1) — c: + ue, (mod 1) 


for every n belonging to an arithmetical sequence. Since the right side is 
_u. a. p. on this sequence, so is the left side. Thus cx is rational, and one may 
suppose that cy = 0 for every u for which (16) holds. The resulting functions 
du(n) may be combined into one ‘function ẹ by placing y(u + nP) = Yu(n). 
On applying the definition of A(n), one sees that | 

(17) f(n)/f(n)l—exp2ri(n) — (m, Pips ` * Ba) = I, 

where y(n) is a real-valued u. a. p. function, which may be defined. to be 0 
if n is divisible by one of the primes pı,’ ``, pm. Since f(n) is multiplicative, 
y(n) is additive modulo 1. Thus if (15) were a necessary condition for a 
function y(n) of this type to be u a. p., then (15) would hold for y(n), and - 
it would be clear from (17), (14) and V that (9) and (10) were necessary 
conditions for any multiplicative f to be u.a. p. This proves one part of the 
assertion IX. 

Now let y(n) be u. a. p.'and additive modulo 1. Then the function f{n) 

which is defined by 


f(n) = exp riy (n) if (n, Pape: ` fm) =I 

f(n) = 0 if (n, pips: * > fm) > 1 
is u.a. p. and multiplicative. Thus if (9) were a necessary condition for a . 
‘multiplicative function to be u. a.p., then this f would satisfy (9), so that 
y(n) would satisfy (15). This completes the proof of IX. . 

As an example to Theorem 3, consider the function f(n) = oa (n) /n", a > 0, 
where a(n) denotes the sum of the a-th powers of the divisors of n. It will | 
be shown that og(n)/n* ts uniformly almost periodic® if and only tf a > 1. 
In fact, one has for this f — f(n): 


° poke — 1. ae sd -ak 
No =e HS 


so that Lu. b.s | f(p*) — 1 | = (p®— 1)7, and (6) holds or does not hold 
according as «œ> 1 ora& 1. Since f(p*)— (p*—1)*+1 as k—0 and 
since f(n) > 0 for every n, the above statement follows from Theorem 1. 
THE Jonns HOPKINS UNIVERSITY. * j 
° The function o,(n)/na is almost periodic (BA) for every À if a > 0. This was 
shown loo. cit, p. 625. Cf. Ramanujan, Oollected Works, Cambridge (1927), p. 184, 
formula (6.1). 
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By Paur Ernôs and Aure WINTNER.' 


1. By an additive function f = f(n) is.meant a sequence f(1), $ (2); Rig 
defined for every positive integer n in such a way that 


. (1) f(t ne) — f(n) + f(n) whenever (nı, no) == 1; (f(1) — 0). 
Thus, | i 


| œ 
(2) f(n) = 37 (n) — lim fi(n), 
k=l k 00 : 
where fe = f(n) denotes, for fixed k, the additive function 


(3) | f(n) = à 0 (mn), 


and f® =f") (n) is the additive ‘function which is defined in terms of the 
k-th prime number, pr, as follows: 


0, if n 3 0 (mod pr), 
f (pr), if patin and ptn, 


(pi = 2, P2='3, pa = 5, > +). Conversely, if {{f(#!)}} is any given double 
sequence of numbers, then (4), (3), (2) define f®, fx, f, respectively, as 
additive functions of #. In fact, all but a finite number of the terms of the 
infinite series (2) is zero for every fixed n. 

The function f(n) is called multiplicative if in condition (1) the sum 
f(n) + f(n2) becomes replaced by the product f (n1) f (ne). Conditions which 
are either necessary or sufficient for the almost periodicity (B?) of a multi- 
plicative function f(n) are implied by the results of a recent paper.! However, 
none of the results found loc. cit.1 supplies a criterion which is necessary and 
at the same time sufficient for the almost periodicity (B?) of a multiplicative 
function (not even if f(n) is supposed to be real-valued). This situation is 
not surprising, since if a real-valued multiplicative fufiction f(n) changes 
its sign with the uniformity of statistical randomness (as does the Mobius 
function f — x), then the question as to a generalized almost periodic behavior 


CO F9 (n) —] 


* Received December 4, 1939. 

1E. R. van Kampen and Aurel Wintner, “On ‘the almost periodic behavior of 
multiplicative number-theoretical functions,” American Journal of Alathematios, vol. 62 
(1940), pp. 613-626. 
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of f(n) can involve problems of the same order of delicacy as do the relevant 
generalisations of the prime-number theorem, if not of the Riemann hypothesis. 
[While the prime-number theorem is equivalent to the statement that the 
n-average of p(n) exp (iàn) exists for À = 0, Davenport’s results (Quarterly 
Journal of Mathematics, vol. 8 (1937), pp. 318-320), which were obtained 
by an application of the deep methods of Vinogradoff, imply that this average 
exists and vanishes for every real A. In other words, all Fourier coefficients 
of a(n) exist and vanish. Hence, p(n) cannot be almost periodic (B). For 
if it were, the n-average of 


[A@) SOF 04< 5 5 [eal alain Ce 


- ought to vanish. But this average is known to be 6x-*  0.] 
The object of the present paper is to show that the problem admits of a | 
definitive solution in the case of additive, instead of multiplicative, functions. 
In fact, the question of almost periodicity (B?) may then completely be 
answered by the following theorem : 


An additive function f= fin) is almost penom (B*) if and only if 


both series 
o 32, (ay) Fie 


are convergent. 


. This fact seems to be an arithmetical counterpart of a similar result con- 
cerning the case of linearly independent frequencies. (cf. loc. cit.*, pp. 79-80). 
But we were unable to find the common source of these two parallel theorems. 

It is understood that À denotes summation over all prime numbers, which 


are thought of as piesi. according to magnitude (the series (i) need not 
be ‘absolutely PSE 


2. If f denotes the real, and f” the imaginary, part of f, the function. | 
f(n) =f (n) + if” (n) is additive if and only if so are both functions f(n), 
f(n). Similarly, ftn) is almost periodic (B?) if and only if so are f(n) 
and f’(n). Finally, it is clear from | f |? = (F)? + (f”)? that both series 
(i), (ii) are convergent if and only if so are the 2 + 2 series which one 
obtains by writing f and f” for f in (i), (ii). 

Consequently, it is sufficient to prove the italicized theorem for the case 
of real-valued additive functions. Thé posibility of this reduction is essential 
for the method to be applied. In fact, use will be made of a criterion which 
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recently * was proved to be necessary and sufficient for those real-valued addi- 
tive functions which possess an asymptotic distribution function. Now, a 
generalization of this criterion for complex-valued functions is not known and 
seems to lead to essential difficulties. 

The criterion in question states? that a real-valued additive function f(n) 
has an asymptotic distribution if and only if both series 


ECP). f (p)? 
(I) 3 o (II) — 


are convergent, where y* == f*(n) is defined, for y — f(n), by placing 
(5) y* =y or y 1 according as |y| <1 or |y|=1. 


It follows that the convergence of both series (I), (II) is necessary for 
every (real-valued, additive) f which is almost periodic (B?). In fact, it is 
known ° that almost periodicity, in relative measure and so, in particular, 
almost periodicity in relative mean of any positive order (== 2 in the present 
case) is always sufficient for the existence of an asymptotic distribution 
function. 


2 bis. Suppose, in particular, that f(p) = O(1) as p— œ. Then, since 
een 
(6) À de < ©, 
the series (ii) of $ 1 is convergent if and ‘only if so is the series 
fp)? 
P 


hence, one readily sees from (5) that the convergence of the series (i), (ii) 
which occur in the criterion of § 1 is equivalent to the convergence of the 
respective series (I), (II) which occur in the criterion of § 2. 


(7) 





3 


aM 


3. For arbitrary additive functions f, the italicized statement of § 1 will 
be refined by exhibiting, in case of almost periodicity B°), a sequence of 
functions which are explicitly defined in terms of f, tend to f with reference 


2 Paul Erdös and Aurel Wintuer, “ Additive arithmetical functions and statistical 
independence,” American Journal of Mathemattos, vol. 61 (1939), pp. 713-721. 

> Borge Jessen and Aurel Wintner, “ Distribution functions and the Riemann zeta 
function,” Transactions of the American® Mathematical Society, vol. 38 (1935), pp. 48-88, 
more particularly Theorem 24 (and Theorem 25). 
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to the metric of the space (B*), and are almost periodic (B*). In fact, it 
turns out that f cannot be almost periodic (B?) unless it is almost periodic, 
= (B°) in virtue of its expansion (2). In other words, if f is almost periodic 
. (B*), then, on the one hand, each of the functions f®, f®,- : - is almost 
periodic (B?), and, on, the other hand, : 


(8) se ae (fe 31), 


where L(g} — lim à = 3 (1 de 


oo h I=1 


3 bis. Due to this fact, it will be possible to calculate the Fourier series 
of f in terms of the Ramanujan sums 


(9) cm(n) = 37; exp (2ri Ln), where 1=7j<=# and (j,m) —1. 

In fact, the explicit form of the Fourier expansion of an arbitrary additive, 
` almost periodic (B?) function f(n) turns out to be 

(10) f(n) ~ do + RZ treni (1), 

where 1=m1,2,8, + -, k =— 1, 2, 3, <- and 


(11) Go—M{f}, an= 3 Epes!) = wo 


Since (9) consists of ¢(m) terms (¢ == Euler’s function), and since p(p!) 
== pl— pi, the Parseval relation belonging to (10) is 


(18) MEL FI} = ['ao [FB (p — pa) au fr 


4. Itis easy to show that if f is such as to make the series (ii) of 81 
convergent, then each of the functions fx is almost periodic (B*). 

To this end, use will be made of the following fact, proved loc. cit. 
(Theorem II): If a function g = g(n) of the positive integer n is such that, 
for’ some fixed prime number p, one has ` 


(13) g(n) = g(p') whenever p'|n, and ptn, 
then g is almost se (B?) if and only if 
(14) : GE < 
| à p' 
It is clear from (4) that condition (13) is satisfied by g=f™ and 
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p= px, where k is arbitrarily fixed. Furthermore, if f is such as to make 
the series (ii) convergent, then, for every fixed k, 

co 4) (2 
(15) 3 1 foe!) F <o; 

l1 ? 
so that, since f(px') =f (p!) in view of (4), condition (14) also is 
satisfied by g =f and p= pr. Consequently, f® is almost periodic (B°). 
Since k is arbitrary, and since the almost periodic, (B?) functions form a linear 
space, the almost periodicity (B?) of fe now follows from (3). 


4bis. It was shown loc. citt (Theorem III) that if a function g(n) 
satisfies (13) for some fixed prime p and is almost periodic (B), then its 
Fourier expansion is 

co apeu 4-1 
g(n) ~ M{g} +3 aicy'(n), where a, = à se") p a i 
i 


It follows therefore from §4 that if f is such as to make the series (ii) 
convergent, then, for every k, 

; f oo FU) (7.4) — FD [pti 
f® (n) —M{f®} + 3 duc (n), where au à PU, 


Hence, (10) with (11) will follow from (4) as soon as it is proved that, on 
the one hand, the convergence of the series (ii) is a necessary condition for 
the almost periodicity (B*) of f, and that, on the other hand, f must satisfy 
(8) whenever it is almost periodic (B°). 


Proof of the sufficiency of the conditions. 


From here on till the end of §9, the assumption will be that f(n) is a 
real additive function for which both series (i), (ii) of § 1 are convergent. 
The final result ($9) will be that f(n) must then be almost periodic (B?). 


5. In terms of the given f(n), define an F(n) as follows: F(n) is that 
additive function for which the double sequence {{F'(px')}} is given by 


"), if | f(p)| 213 
16 F(pt) = j ip ee 
oie (P = 1 Hp) — Fo), if [FP < 2, 
where p == p, and k—=1,2,3,---. i 

It is easy to see that the convergence of the series (ii) implies that 
: 
(17) : Sal co 
ELP e? 


In fact, it is clear from (16) that the series (17) is majorized by A + B +0, 
where | f i 
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ni t 
saslFOl Rte) ga O 
ap P » Pp. rap P 
and so it is sufficient to prove the convergence of these three series. But 
application of (16) to 1==1 shows that F(p) —0 unless | F(p)| 21, in 


which case F'(p) — f(p) ; so that the series A reduces to 


im 3 UT 


He P 


z 


ma 


=2 


and is therefore convergent in virtue of the assumption that the series (ii) 
converges. Tt is clear from the same assumption and from (6), that also the 
series B is convergent. Finally, the series C may be written in the form 


o—3 3 Ul gs x HA, 


zioei P? ziron P 


rf 


But the convergence of the first of these two double series is assured by (6), 
while the second is, in view of 
1 


œ 
34 < 


Le pt ? (p= 2,3,5,---), 


majorized by 


WI, 


EL P 


Since the value of the latter series was seen to be A < co, the proof of (17) 
is now complete. 


Similarly, 
foe) ty }2 : 
(18) gy LPO co. 
. rip P 


In fact, since (a— b)? S2(a? + L°) for arbitrary real a,b, one sees from 
(16) that the series (18) is majorized by A’ + B’-+ C’ where 


as pais lt piety tl, 
P P 2 p P =a p P 
And the proof for the convergence of these three series requires but a repeti- 
tion (with obvious simplifications) of the above proof for the convergence of 
the three series A, B, C. | 
Notice that only the convergence of the second of the series (i), (ii) 
was used thus far. The same remark will hold for $6. ° 


6. It will now be shown that if F&(n) denotes the additive function 
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which belongs to the additive function F- (n) in the same way as (3) béléngs 
to f(n), then 
(19) . UAS ADEN, as k> o, 


where #{| g |P} — lim sup à OP. 


To this end, notice first that, by the definition (16) of the additive 
function F(n), 


3 | F(m) —Fa(m)? 
Æ. S33 3, 3 | F(p') F(q)| y 3 = — z | F(p!)|°, 


Phase n and k are arbitrarily fixed, and the sainatlon indices p,q run 
| tarouga those primes which exceed k. On writing this inequality in the form 


| [F@)| PUUA 
oa RER 


keeping k fixed but letting n—> œ, one sees that 


(19 bis) ` E| P — Fe |) S ArH ey 
where 
ty} 1) ]2 
-Ss (EEN, pass FER 
kip>k Pp. k1 p> p? 


But these sums «, ¢x are identical with the k-th remainders of the convergent 
series (17), (18), respectively, and tend therefore to zero as k— œ. Hence, 
(19) is implied by (19 bis). 


7. If Gk = Gy(n) denotes the additive function which belongs to the 
additive function , 


(20) | - G@=f—F 
in the same way as fr, Fx belong to f, F respectively, then obviously 
(21) - Gr = fr — Pre. ` | 


Thus, it is clear from (16) that, for any fixed k, the elements of the double 
sequence {{G@(p') — Gi(p')}} of the additive function G(n) — G(n) of n 
are independent ofl, i. e., that ` 


(22) G(p)— Gp) = G(p*) — Gp) = a (p) — (n°) — 
for every prime p. It ia also seen from (16) and (20) that 
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(23) ` | @(p)|S1 

for every prime p. i 
Since the series (i) of § 1 is supposed to be convergent, it is clear from 

(20): and (17) that also 


(24) 3 &(p) is convergent.. 
: P 





Similarly, since the series (ii) of § 1 is supposed to be convergent, it is clear 
from (20), from the Schwarz inequality 


a PF (I s (af) (a FE, 
? P Ap P 2. P 
and from (18), that ‘ 


(25) x Suey Files: 


8. Due to (22), it is now easy to transcribe the O-estimates applied 
loc. cit? (p. 716) into o-estimates, which are to the effect that 


(26) -B| @— G7} 30 as k> o. 


Ta fact, (26) may be proved as follows: 
Tf n and k are arbitrarily fixed, one readily verifies from (22) and. from 
the definitions of the real additive' functions G, Qr, that 


ilam ampa y [2] 6) + 3 ey Gp)’, 


where [x] denotes the integral part of x, the prime of =a’ means that pq, 
and the: summation, indices p,q run through those primes which exceed k 
(however, the sums on the right are finite sums for every fixed n, since 


n n] i 
— | —0 and 2] == 0 whenever pq >n and p >, 
Lee >» and p 


respectively). Consequently, 


(26 bis) | + à | G(m) —G(m) |? 


1 


F La T) ss Pe a) 


ALORS er, 


8 bis. As to the inner sum in the second of the four terms on the right 
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of (26 bis), one sees from (24) that if k is fixed and e® denotes the maximum 
of the function 
GQ) | 


= 
kSqEn/p q 





of p and n on the range å S pin; n= 1,2,- , then ® — 0 as k— ©: 
while the absolute value of the whole second term on the right of (26 bis) has, 
for every n, the majorant 


2 3 | G(p)| CHIH x 1, 
stspsSn P K mapan p 

by (23). Finally, . 

2 s | a(p)G(@)|S4 3 2 by (29). 

1 pan TNn psn” 
Thus, on keeping k fixed but letting n—> œ, one sees from (26 bis) that 
. 1 4 
lim sup — & | G(m) — Gi (m) |? 

n00 N m=1 


£ lim su Í ( 3 amy pew 3 1 + 1 x1 }+ s G(p)* 
p Le 
n0 k<psnt P nispsn P P mEn p>k P 


But p and q are prime numbers; so that 


x L < Const, and À = 1—0 
niSpSn P N on 
as n— œ. Hence, 


7 53l G(p)\? a Gp)" 
HM {| G— Ghr |?} Slim sup x + const. P + 3X 
. n>% kecpeat >k P 
On letting here k—> oo, and using the fact « —>0 as k— co, one sees Le 
(24) and (25) that fis proof of (26) is complete. 


9. It is now easy to conclude that f(n) is almost periodic (B°) and. 
satisfies (8). 

In fact, since it was proved in $ 4 that fẹ is dus periodic (RB?) in 
virtue of the convergence of the series (ii), it is sufficient*to show that 


H{|f— ff} 30 a k> 00. 


But the truth of this relation is implied by (19) and (26), since it is clear 
from (20) and (21) that | 


H{|f—f PPS Ar —-R ER + E| G—G P}. 


d 
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Proof of the necessity of the conditions. 


What remains to be proved is that the sufficient condition EE by 
the convergence of the two series (i), (ii) of $1 is necessary as well. Thus, 
from now on till the end of the paper, the assumption will be that the f(n) 
is any given real, additive function which is almost periodic (B°). 


10. , Since f(n) has an asymptotic distribution function, the two series 
(I), (I) of $2 are convergent. And, in view of (5), the convergence of 
(IT) implies that 


(27) Les 
role P 
In terms of the given f(n), define an additive function D(n) by placing . 
(28) 0 Def 


where H = H (n) denotes that additive function for which the double sequence 
{{H(p°)}} is given by ; 


F(p’), 11 
(29) H(p') = < f(p), if 1=1 and | f(p)| 21, : 
Co, if f=-1 and: | f(p)| <1, 


` (p = pr and k—=1,-2,3,-- +). Thus, 


0, if l1, 
D(p!) -{0 if 1—1 and | f(p)| 21, 
(p), if b= 1 and | f(p)| <1, 


and so it is clear from (27) that one obtains two convergent series by writing 
D for fin (i)-(ii), $1. Since the first half of the italicized statement of § 1 
was already proved (§ 5-$ 9), it follows that D(n) is almost periodic (B°). 
Since f(n) is almost periodic (B?) by assumption, one sees from (28) that 
H (n) is almost periodic (B°). 

In particular, H (n) has a square-average 


(30) : M(H} < + o. 
11. In what Sollows, r will denote any of those prime numbers for which 


the absolute value of the given additive function i is 3 not less than 1. Clearly, 
(27) may be written in the form 


(31) I (i=) >. 
Wp cr) [zt r 
0 
Since also the density of the quadratfret integers is a positive number 
(== 62-7), a standard application of the sieve of Erathostenes shows that (31) 
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may be interpreted as follows: If n, j are positive integers and p is a prime, 
let N = N (n, p, j) denote the number of those integers between 1 and n which 
are of the form pfs, where s is quadratfrei, is not a multiple of p, and not a 
multiple of any of the primes r (defined by | f(r)[ 1). Then there exists a 
constant 8 > 0 which is independent of n, p, j} and is such that 


N = N (n, p,j) > Bap. 


Hence, it is clear from the definition (29) of the additive function H (n), 
that 


3 H(m)> 3 FE H(p, 
mel pion P 

where the summation indices p(== 2, 8, 5,-::) and 1(—1,2,---) run through 
those of their combinations for which p! <n. Thus, on writing this inequality 


in the form 


aa N < const, = = à H(m)?, (const. = 81 < 0), 
phon 
and letting »— «, one sees from (30) that 
2 
(32) 3 ae (ys 
E1 p 
where p runs through all primes. 


12. In view of (29), the content of (32) is that, on the one hand, 


(33) Ra < o, 

and, on the other hand, g 

(34) | x IC; 
mæ P 

while (34) implies that 

(35) s LOI, 
role P 


Finally, as pointed out at the beginning of §10 (cf. §2), the series 
(I), (11) of §2 are convergent. This means, in view of (5), that 





Irmis P 
and that also 
(37) = F(p) is convergent. 


rois P 
Now, the convergence of the series (i) and (ii) of $1 is clear from 
(87), (35) and (36), (34), (33), réspectively. 


INSTITUTE FOR ADVANCED STUDY, 
THE JOHNS HOPKINS UNIVERSITY. 


STATISTICAL INDEPENDENCE AND STATISTICAL 
EQUILIBRIUM.* 


By PHILIP Hartman and AUREL WINTNER. 


Consider a conservative dynamical system which has a finite number of 
degrees of freedom and a Hamiltonian function possessing everywhere con- 
tinuous partial derivatives of the second order. Suppose that some fixed value 
of the energy constant A determines a closed, bounded energy surface Q == Q (h) 
in the phase-space; and that this Q does not'contain too many or too high 
critical points (e.g., that no point of © is an equilibrium solution of the 
system). If P is any point of Q, the isoenergetic differential equations deter- 
mine on Q a unique phase-path P; for which Po = P, and which exists for 
—o <ti<+o. The resulting isoenergetic flow on 2 may also be described 
by placing P: = rP, where 1, — co < t< + œ, is for any fixed t a topo- 
logical transformation of Q into itself, and the function +P of (t,P) is 
continuous on the product space of Q and the t-axis. If one projects the 
euclidean Lebesgue measure of the phase-space on the energy surface Q in 
the usual way,! and denotes by (E) the resulting Lebesgue measure of an 
arbitrary Borel subset # of Q, then p(r:E) —»(#) for every # and t, since 
the isoenergetic differential equations which define +; satisfy the condition of 
Liouville.? 

Since obviously 0 < »(Q) < co, it may be assumed that p(Q) == 1. Thus, 
Birkhoff’s ergodic theorem is applicable ° to the flow r: on ©, and states that 
the path P; has an asymptotic distribution function unless the initial condition 
P = P, is chosen on a set of w-measure 0. It is understood that by the 
asymptotic distribution function pp of a path P: is meant an absolutely 
additive set function dp == dp(H), defined for all Borel subsets # of Q and 


* Received February 14, 1940. 

+Cf. e.g, T. Levi-Civita, Journal of Mathematics and Physics (M.I.T.), vol. 13 
(1034), pp. 22-28. ° 

2 For n = 2, cf. the explicit equations of G. D. Birkhoff, Transactions of the Ameri- 
can Mathematical Society, vol. 18 (1917), pp. 211-212. 

-3 G. D. Birkhoff, Proceedings of the National Academy, vol. 17 (1931), pp. 656-660. 
The necessity of excluding possible discontinuity sets (cf. footnote‘) of the asymptotic 
distribution function belonging to a general P was pointed out by A. Wintner, ibid., . 
vol. 18 (1932), pp. 248-251; cf. P. Hartmap and A. Wintner, American Journal of 
athematics, vol. 61 (1939), pp. 977-0984. 

Cf. also A. Wintner, Nature, vol. 145 (1940), pp. 225-226. 
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having the property that if E is any continuity set‘ of dp, then the ¢-set 
defined by P, C E is relatively measurable, and has ¢p(£) as its relative 
measure.’ In other words, ¢p(#) is the probability that the path Ps, 
-— œ <t<-+ æ, which is determined by P = Po, be in the portion E of Q. 
Since Q is compact, the total probability ¢p(Q) is 1. 

Any given Borel set # is a continuity set of ẹp (FE) for almost all P. The 
proof of this fact will be omitted, since it readily follows ‘from an estimate 
which occurs in Birkhoff’s proof ® of the ergodic theorem.*® 

The fact just mentioned, when combined with Lebesgue’s term-by-term ` 
integration of uniformly bounded sequences, obviously implies that 


(1) ONS D= f or(E) dow 


for every Borel subset E of Q. 

Consider the product space 2 X Q consisting’ of all pairs (P, Q) of points 
of Q. Obviously, products E X F of Borel sets of Q are Borel sets of O X ©. 
Tf on Q X Q a Lebesgue measure vis defined by placing v(# X F)=uw(E)u(F) 
for Borel sets E X F, Birkhoff’s ergodic theorem is obviously applicable to 
the product flow r: X rt, with v as the invariant measure on QX Q. Let pq’ 
denote the asymptotic distribution function of the path (P:, Qt) = (rP, tQ), 
where it is understood that a set of initial points (P,Q) of y-measure 0 must 
in general be excluded. 

In what follows, use will be made of the fact that if gx(P) denotes the 
characteristic function of a Borel set K of Q, then 





(2) dim y fO S gu(Pedden)( f ge(Qs)dan)dt— f #r0(E X P) deo 
4 B 


AXB 


* By a continuity set of a distribution function is meant any Borel set H which 
has the property that the distribution function attains the same value for the two Borel 
sets which represent the exterior and the closure of H. It is known that the Borel sets 
which are aot continuity sets of a fixed distribution function are exceptional in the 
same sense as the discontinuity points of a fixed monotone function of a single variable. 

5 A measurable set 7 of points of the t-axis is said to be relatively measurable 
if the Lebesgue t-measure of the common part of T and a fmite interval ust 5v, 
when divided by the length v— u of this interval, tends to a limit as v—wu%™; 
in which case this limit is called the relative measure of T. 

* As to this estimate, cf. N. Wiener, Duke Mathematical Journal, vol. 5 (1938), 
pp. 1-18 (cf. p. 2). 

7 It should be emphasized that this product space is meant in the usual topological 
sense and is not, as it somehow became customary in ergodic theory, the symmetric prod- 
uct space. In other words, the points (£, Q) and (Q, P) of Q XQ will not be identified 
in the present paper. l 
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holds for arbitrary: Borel subsets A, B, E, F of Q. In fact, if E and F are 
fixed, the remark which precedes (1) shows that Æ X F is a continuity set of 
épq for almost all points (P,Q) of QX Q. On the other hand, the ergodic. 
theorem, when applied in its usual form to ‘the fixed point function 
f=f(P, Q) = gr(P)gr(Q) on 2 XQ, states that the limit 

im =f gu(Pe) gn (Qe) at 

vu VU — : 
exists for almost all points (P,Q) of QX Q. Since the definition of the 
asymptotic distribution function pro implies that the latter limit has the 
value $ra (E x F) whenever E X F is a continuity set of ¢pq, it follows that, 
if E and F are ffixad, 








lim 
vu V— 


holds for almost all points (P,Q) of QX Q. Hence, (2) follows by Lebesgue’s 
theorem on term-by-term integration of uniformly bounded sequences. 

Two solution paths P+, Q: on Q are said to be statistically independent 
if the three asymptotic distribution functions pa, br, dg exist and satisfy the 
product condition 
(3) $ro(B X F) — $r (E)¢a (P) 


for all Borel sets E X F, E, F of Q X 0,9,0 which are continuity sets of 
pa, bp, po, respectively. | 

It turns out that the incompressible flows r: on Q which possess this 
property of the statistical independence of almost all pairs of paths on © are 
interrelated with the incompressible flows rt on Q which possess there a 
property of statistical equilibrium. From the physical point of view of sta- 
tistical mechanics, this 'somewhat hidden interrelation between statistical 
independence and statistical equilibrium might perhaps have been expected. 
But we were unable to find any reference in the literature to the interrelation 
of these two physical concepts. On the other hand, the mathematical literature 
contains all of the tools necessary for this identification. In fact, Birkhoff’s 
ergodic theorem, when stated as above in terms of asymptotic distribu- 
tion functions,’ insftres that the notion of statistical independence may be 
meant in its mathematical formulation, used loc. cit.7; while it is known that 
the notion of statistical equilibrium may be anche mathematically as 
follows : ° 


==" ga(P:) gr (Qs) dt = ra (E X F) 


8 Cf. P. Hartman, E. R. van Kampen and À. Way American Journal of Mathe- 
matics, vol. 61 (1939), pp. 477-486. 
° Cf. E, Hopf, Proceedings of the National Aoademy, vol. 18 (1932), pp. 333-340. 
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Suppose that the flow r: has the property that if there are given any 
Borel subset Æ of Q and any “density of probability ” as an integrable func- : 
tion f =f(P) of P on Q, then the probability carried by the set into which 
E is shifted by the flow 7; tends to a limit as to. If this condition is 
satisfied, i. e., if any given initial probability distribution is transformed in such 
a way as to become practically independent of 4 for large t, with reference to 
any Borel set E, then the flow +; is said to tend to statistical equilibrium. 
Since it may be shown * that, instead of arbitrary integrable functions f, it is 
sufficient to consider characteristic functions gr(P) of arbitrary Borel sets F, 
the condition for a flow 7: to tend to'statistical equilibrium consists of the 
existence of the limit? ` 


(4) lim p(B: F), (E: = mE), 


tro 


for any pair Æ, F of Borel subsets of Q. In fact, condition (4), where A-B 
denotes the common part of A and B, is precisely the previous definition, 
since obviously 


(5) [or Pdu = eE: P). 
-e 


It is clear from (4) that if the limit (3) exists, its value is 
(6) lim a(Be F) = f ér(Pdrne f $e(8) den, 
00 , , 

E F 


where‘ gp is the asymptotic distribution function of P:. If it is only required 
that »(H:- F) should become practically independent of ¢ on the average, in 
the sense that, instead of the existence of the limit (6), one merely has? 





(7) lim — 


Í * [a(Es: F) — J ¢p(F) dep ]*dt = 0 

cu00 TU Ju 

for any pair Æ, F of'Borel subsets of Q, then the flow v: is said to tend to 
statistical equilibrium on the average. While it is clear that statistical equi- 
librium is sufficient for statistical equilibrium on the average, the converse is 
not true, at least** if the flow rs is not required to be one determined by an 
isoenergetic dynamical system. Incidentally, the content of the requirement 
(7) remains unchanged if one replaces the square [ ]* by |[ ]|; in fact, 
the integrand [ ]? is a bounded function of t, since 0S pS 1. 


19 Cf. G. D. Birkhoff, loo. oit.*; N. Wiener, loc. cit.1. 
11 An example to this effect was given by B. O. Koopman and J. v. Neumann, 
+ Proceedinga of the National Academy, vol. 18 (1932}, pp- 255-263. 
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The interrelation between statistical independence and statistical equi- 

- librium, as announced above, may now be formultaed as follows: In order 
that a flow be such as to make almost all patrs of paths statistically inde- 
pendent, tt is sufficient (but, at least in case of a non-dynamicäl flow, not 
necessary) that it tend to statistical equilibrium; in fact, almost all pairs of 
paths are statistically independent tf and only if the flow tends to statistical 
equilibrium on the average. 

That a flow r: which makes almost all pairs of paths statistically inde- 
pendent is necessarily a flow tending to statistical equilibrium on the average, 
‘is implied by the second half of Theorem 5 of E. Hopf.’ In fact, one can 

. easily prove that his Theorem 56 is to the following effect: There is tendency 
toward statistical equilibrium on the average if and only if the ‘condition (3), : 
instead of being satisfied for all pairs (E, F), is satisfied for symmetric pairs 
(£,H#) only (it being understood that a zero set of pairs of points (P, Q) 
is always excluded). Apparently, it is this symmetry restriction % which has 
thus far disguised the interrelation between statistical independence and 
statistical equilibrium (either strict or average). In fact, as will be shown 
in the Appendix, two measurable functions of ¢ need not be statistically 
independent if the condition corresponding to (3) is required for symmetric 
pairs (F, F) = (FE, E) only. 

Nevertheless, it will now be shown that dogi all pairs of paths are sta- 
tistical independent in the case of a flow which tends to statistical equilibrium 
on the average. 

To this end, suppose that the flow r: satisfies the average condition (7) 
for arbitrary Borel sets F, F, and write ‘A, B for E, F; so that | 





(bis) lim 


L [tt 13 J $e (B)dan]’dt — 0. 

v-u-+00 V — 

Since both functions »(A:: B), (Es: F) of t lie between 0 and 1, it readily 
follows from (7) and (7 oe that 78 


13 Cf, footnote 7. 
18 This depends ‘on the following obvious remark: If w(t), y(#) are bounded 
_ measurable functions for which there exist constants a, 8 such that 


dt +0, adi > 0, 








then 





as v-— uœ. It is sufficient to prove this in case w(t) and y(t) represent the same 
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lim 


PHO) 


K 
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= T [a(4e B)a(Bi-F)—( | dr(B) dou) ( f pot) dou) ]?dt à 0, 
z A E 


or, if B and E are interchanged, 


es 


v-u00 Ù — ry 





T 


On the othe 
AXB 
But comparis 


| AXB 


pral E X Fdpqv=— lim 


S? Ae B)a(Be-F)—( f $0(B) dew) ( f balP)don) Tdi — 0. 
š A B 


hand, the identities (2) and (5) imply that 





— u 


frac . E)a(Bi- P) dt. 


v 


on of the last two relations gives 


dro (E X F)degv— ( È ¢r(E)dvn)( [` bo(P)don). 
fee ain 


Hence, by Kubini’s theorem, 


pe(E X F)dpev = 


f $e (E)ba(F) dear, 
AXB 


AXB 


since jee X F) was defined as the product measure u(Æ)u(F). Since 


the factors A, 


it follows f 


o 
TOR 


If, inste 


B of the integration domain A X B are arbitrary Borel sets of Q, 
m the separability of Q, that the condition (8) of statistical 
is satisfied by almost all points (P, Q) of OX Q. 


de 
> 


* * * * * x “ 


ad of two pathe Pi, Qr, one considers n paths Pi, Qn: © *, Re, 


their statistical independence is defined by the requirement € 


(3 bis) gpg 
which reduce 


. REX FX: +> xX G) = pr(E)pa (F); ` + on (@), 


s for n= 2 to (8). It is known®.that n= 3 given functions — 


function, and de successively choose the latter function to be a(t), y(t), a(t) + y(t). 


But if »(t) is 
E 


(It is seen th 
functions @, y.) 


bounded, then f 
(t}°— a} = [o (t) + al"lo(t) — a]? S const. [o (t) — a]’. 
t it is sufficient to assume the boundedness of only one of the two 
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need not be statistically independent if any of the three pairs which may be 

selected from them consists of two statistically independent functions. This 

might be one of the reasons why, on the basis of mere time averages of the 

solutions of the (isoenergetic) differential equations of classical dynamics, no 

mathematical theory has been developed thus far for physical facts of the type 

of the Maxwell-Boltzmann distribution, or of the H-theorem. For these facts 

are asymptotic statements of the same type as is the validity of the normal 

`` distribution law in theory of errors; so that the number n of independent 
realizations of one and same model must be chosen arbitrarily large, since no 
statement can be made for a fixed n (in particular, for n = 2). 

But it turns out that, due to the fact that the product spaces considered 
are not the symmetric product spaces,’ it is not difficult to pass from n = 2° 
to any n, While this sounds surprising in view of the examples just men- 
tioned,® all that actually happens is that n-uples of paths, possibly exceptional 
from the point of view of independence, are contained in zero sets which may 
vary with n. In other words, if the flow makes the two paths Pi, Qi sta- 
tistically independent for almost all choices of (P,Q) on OX AQ, then tt also 
makes the n paths Pr, Qu: °°, Re statistically independent for almost all 
choices (P, Q, >°, R) on 9X QX---+ XQ, where n is arbitrary and it is 
understood that the sets excluded are zero sets with reference to the product 
measures (of ») on OX Q and A XQ X: -X |, respectively. 

In fact, it is clear that the calculation following (7 bis) may be carried 
out so as to show that the assumption (7) implies the statistical independence 
of almost all #-uples of solution paths, not only for n =? but for arbitrary n. 

_ Hence, the italicized statement follows from the fact that the requirement (7) 
of ultimate statistical equilibrium on the average was seen to be equivalent to ` 
the requirement of the statistical independence of almost all pairs of paths. 

It follows that the flow satisfies the requirement (7) of ultimate sta- 
tistical equilibrium on the average tf and only tf the paths in almost all n-uples 
of paths are statistically independent. This fact is, from the physical point 
of view, more important than the equivalent criterion in which n is restricted . 
to n==2. In fact, it now becomes admissible to consider a product space 
XQ X-+++ XQ of arbitrarily many factors, thus introducing the flow on 
Q in n independent copies, and then make n— 0. But this ts precisely the 
relevant mathematical. GRR of the theory of limit distributions in 
statistical mechanics. 

If the incompressible flow 7: on Q, instead of satisfying any statistical 

` assumption, is such as to make the asymptotic distribution function ¢p of 
the path P: independent of the initial condition P (for almost all P), then 
the flow r: is necessarily metrically transitive, since (1) then reduces to 
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op = p for almost all P. In particular, the class of those flows which satisfy 

the requirement (6) of ultimate statistical equilibrium and are at the same 

time such as to make dp independent of P for almost all P, is'identical with 

the class of the flows to which the an some respect misleading) name 
“ mixture” was’ given, 

Hedlund # has recently proved that if Q is a mene Riemannian 
manifold of constant negative curvature, of finite connectivity and of finite 
area, then the geodesic flow on Q is a mixture. It follows, therefore, from 
the last italicized theorem, that the geodesic flow on any such Q makes the — 
paths of almost all n-uples of geodesics statistically independent of each other. 
Notice that in this example one has, besides statistical independence, asymp- 
totic equidistribution of almost all paths; so that no example of an isoenergetic 
dynamical system seems to be known in which almost all pairs of paths are 
statistically independent but which is not metrically transitive. 


APPENDIX. 


It is known ® that two real measurable functions x(t), y(t), —acic+o, 
are statistically independent if and only if the Fourier average 





(I) - A(u,v) = lim 


8-100 3 


Z f exp i{ue(t) + oy Jat 


exists uniformly in every fixed bounded domain of a real (u a and 
satisfies the functional equation 


(II) - A(t, v) =A(u, 0)A(0, v). 


On the other hand, if instead’ of statistical independence, which corresponds 
to (3), one requires that the condition corresponding to (3) be satiafied for 
symmetric pairs (F, F) = (E, E) only, then an obvious adaptation of the 
considerations ‘applied loc. cit.® shows that (II) must be replaced by the 
weaker condition . 


(III) A(u,v) + A(v,u) = A (u, 0) A(0, v) + A (v, 0) (0,1), 


[which is again necessary and sufficient, provided that°(I) exists uniformly 
in every fixed bounded domain of the (u,v)-plane, i.e., provided that the 
vector (a(t), y(¢)) has an asymptotic distribution function]. But it is easy 
to construct a pair (e(t), y (t)) which satisfies (III) without satisfying (IT). 
Actually, the pair will be chosen periodic in t; so that (I) reduces to 


u G, A, Hedlund, Annals of Mathematics, vol. 40 (1939), pp. 370-383. 
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1 
(IV) A(u,v) = f° exp i{uc(?) + oy(t) }dt, 
if the period is.1. 
First, define a function L of two real variables” u, v by placing 


9L (u, v) = | + etium) + ett (win) + giu + Deiiv + Qeiiury) | 


Then an easy calculation shows that the functional equation (III) is, and 
‘that (IZ) is not, satisfied by A == L. | 

On the other hand, the function L(u,v) is a trigonometric polynomial 
in which the coefficients of the exponentials are positive and have the sum 1. 
This means that L (u,v) is the Fourier-Stieltjes transform of a 2-dimensional 
purely discontinuous distribution function (with a finite number of jumps). 
Hence, it is clear that one can choose on the interval 0 = {= 1 two siy 
functions s(t), y(t) for which A — L satisfies (IV). 

Incidentally, the trigonometric polynomial L(u,v) is seen to satisfy 
the symmetry relation L(u, 0) —L(0,u). This means that the two functions -Ţ 
z(t), y(t) have the same distribution function. 


QUEENS Corzos, 
Tue Jonxs HOPKINS UNIVERSITY. 


ON AN ASYMPTOTIC FORMULA FOR THE FOURIER TRANS- 
FORMS OF DISTRIBUTIONS ON CERTAIN CURVES.* 


By E. K. HAVILAND, 


The smoothness of infinite convolutions of the type occurring in the 
theory of the Riemann zeta-function has been treated by an estimate of 
Fourier-Stieltjes transforms of the distributions on. convex curves.. An earlier 
method * of obtaining such an estimate consisted in an extension of the usual 
estimate of the Bessel functions Jn, making use of a lemma of van der Corput 
- and an assumption that the spectra are sufficiently smooth convex curves, 
The resulting estimate has then been refined? in such a way as to yield an 
asymptotic formula also. In the case where merely an appraisal is desired, 
the foregoing method has been superseded by a simpler and more general one,’ 
quite elementary in nature, which is free of the restrictions of dimensionality, 
analyticity and convexity imposed by the earlier method. This latter method 
does not, however, admit of obtaining an asymptotic formula, and it is the 
purpose of the present paper to obtain such a formula without the restriction, 
of convexity and with fewer restrictions on the smoothness of the curves. The . 
increased generality is obtained largely by following a method of Hartman‘ . 
for obtaining an asymptotic formula for exponential integrals. 

Let z = 2(¢), y= y(¢), where 0S ¢ <n, be a parametric repre- 
sentation of a Jordan curve, 8, to be described more precisely below, in the 
(z,y)-plane. Let o—o(#) be an absolutely additive set function defined, 
for every Borel set, F, of the (z, y)-plane, by setting o(F) equal to 1/2m times 
_ the linear measure of those œ for which (2(¢), y(¢)) is contained in FS, 
if F is any open set in the plane. In particular, it is seen that S is the 


* Received November 22, 1939. 

3 Cf. A. Wintner, “ Upon a statistical method in the theory of diophantine approxi- 
mations,” American Journal of Mathematics, vol. 55 (1933), pp. 309-881; B. Jessen 
and A. Wintner, “ Distribution functions and the Riemann zeta function,” Transactions 
of the American Mathematical Society, vol. 38 (1935), pp. 48-88. 

3Cf. E. K. Haviland and A. Wintner, “ On the Fourier transforms of distributions 
on convex curves,” Duke Mathematical Journal, vol. 2 (1986), pp. 712-721. 

2 Cf. A. Wintner, “ On the smoothness of infinite convolutions of the type occurring 
in the theory of the Riemann reta-function,” American Journal of Mathematics, vol. 61 
(1939), pp. 231-236. 

t Cf. Philip Hartman, “ An asymptotte formula for exponential integrals,” American. 
Journal of Mathematics, vol. 62 (1940), pp. 115-121. 
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spectrum’ of e. From the definition of Lebesgue and of Radon integrals, 
it follows that ; 


(D A(u oe) = f [exp [i (ue + oy) devo (BD 
=> [exp [i(ue(9) + 29(4)) Jde. 


On setting u == r cos y and v = r sin y, one obtains 





(2) AmA(reoyrainyse) =E f” exp G y)ldė, 
where . 
' (2a) EE T E T) 


It will be assumed that 


‘(i) æ(#) and y(p) possess second derivatives of bounded variation; 


(ii) W(ẹ;y) has, for any fixed y, ‘exactly n zeros on the curve S and 
` these zeros are all simple. Furthermore the zeros of h” (¢ġ;y) are all simple® 
und n in number. Here and in what follows n is a fixed positive integer and 
a prime denotes partial differentiation with respect to ¢. As a consequence 
of (i), h’(¢) =x” ($) cos y + y”(b) sin y is continuous on the torus T: 
(OS p < 27;0 Sy < Rr). As a consequence of (ii), the zeros of h” sepa- 
rate those of hk’. Thus the convex curves previously treated? are included as 
a proper subclass of the curves 8 now considered. 


Under the foregoing assumptions, it will be shown that 


(3) A= CALE {A (ha (4) 5 prie exp [i(rh (gaa (y) ; 4) + n/4 + x/4)] 
rites (Gas); y) J> exp [i(rh(due(g) 5 Y) —7/#)1} + o(r3), 
where the o-term holds uniformly for all y, and 


pus == bars (Y), pars = par (Y). (k=1,: ` *,n/R); i 


represent thd zeros of h’ on 3, the former corresponding to maxima of h and 
the latter to minima. It will be observed that the o (14) of the present paper 
replaces the O (7%) of the previous treatment, so that we now v get precisely 
an asymptotic formula, without a remainder term. | 

The proof of (3) proceeds as follows. First, the minimum distance 
between a zero of A’ and a zero of h” has, for reasons of continuity, a positive 


5 For the definition of the spectrum, cf. A. Wintner, “ On the addition of independent 
distributions,” American Journal of Hathemafies, vol. 56 (1934), pp. 8-16. | 

¢ (ii) might be generalized, under suitable assumptions, to the case where the 
second derivative has multiple zeros. . 
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lower bound Ë independent of y. Let px = paly), (k= 1,:::,n), be the 
zeros of k”(p;#) and let them be so situated that : 
Pi < pe Lpa < fas < pan-s < Gane < dena < gon < pi F Dr. 
Finally, let 2n numbers # be so chosen, as indicated more precisely below, that 
pi < M < ba me < pa << Pan < Nen < Gi + 2r. 
As hR” ($; y) is continuous on the torus T, it is clear that, if we write 
Rb 4) =h (dus) + (bes os 4), (k= 1,2, > -,2n), 
then (¢s;¢;y) will possess the same property, so that to a preassigned 
« > 0 there corresponds a § =$ (e), independent of ¢ and of y, such that 
|o(dusdsy)| <e | 


for all $ such that | $ — px | < 8. 

Moreover, it is clear from (ii) that there exists a positive constant a such 
that | A” (pı (4); y)| > a for every y. Then one may choose yı == y: (y) 80 that 
it lies between ¢, + £/3 and ¢: + 26/3 for all y, where ¢ == min (£,8(a/4)) 
and is therefore independent of y, and a similar choice will be made for the 
remaining #8. 


From (2), 
DRE i te | 
= J, + Je + Ja + FERA ae 


say. 

These integrals fall essentially into two classes: those, such as Ji, in 
which the integration range poësesses an end point pæ, (&=—1,2,:::,n), 
and those, such as Jz, with an integration range containing a point. Phe: 
(k= 1, 2,-- -,n), in its interior. In order to treat J}, set ford. S$ Sm, 
where gi = ¢i(y) and m =m (Y), 
(5) | Pom h(p1; Y) —h(h;y) 
for every fixed y, corresponding to the fact that ¢: is a simple zero of h by 
assumption (ii). On taking the positive square root, | 
(8) | t= | hlg; y) — hlg; y) fi, 
so that as } increases steadily from ¢1 to m, the variable ¢ increases steadily 
from zero to a quantity m(#) == | A(d:(¥) 34) —R(m(y) 54), which has : 
a positive lower bound £ independent of y in virtue of (ii). Hence ¢ is in 
[:(#);, m(y)] a monotone, continous function of ¢. 

Moreover, if a dot represents partial differentiation with respect to t, 


658 | E. K. HAVILAND. 


NP $ = — 2t/h'(b(t¥) 54), if 0 <iE (y), 
80 


1 i a (4) . 
(8) Jı =— exp [irh (u(y) 5 ¥)] f exp [i]t (H(t, ¥) sv) de. 
The integral in (8) is of the form | 


(9) LT FC exp [ire], 
where 
(10) FE) = f(E y) =U (A(t, Y) 50). 


It is known‘ that the integral (9) can, for every fixed y be evaluated asymp- 
totically under the assumption that f(t) — f(t; y) is of bounded variation in 
[0, a (4)]. That the function. (10) possesses this property may be seen as 
follows: : 

Applying Taylors Theorem with the integral form of the remainder, 
one obtains 


D RGP) HH) = RH) + HIE) 
+ f ($ — pı — 8) K” (+ 8) ds, 


where d = ġı (Y4). Since A’ (pı) —0, (6) becomes, after a change of inte- l 
gration variable in (11), . | 


a) t—(— [Too a. 
Similarly, we have | 
: | $ 
(13) W (39) =K ($) = [Code 
$i $ 


Since by hypothesis A” ($; 4) =k” (¢) is, for all fixed y, a continuous func- 
tion of $ and since A” (¢1) +40, we may write, as above, 


W’ (p) =K” (p1) + 0() 5 [olg)] <e if |#— gi] <8(@). 
` Then (12) becomes 


t= (EW (6) — f° a)l) 


and (13) becomes | 
WCG) = (bb) K (h) + f° os) as 
Substituting these values into (10), we obtain 
EH Cu) 5 — (9-4) fF" ($—s) 08) a5} 
a nn re DS 


(14) FE) = FEY) = | j 
W"($s(¥) H (dd) f olad 
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(45) = {— phi” — La J/h” + Lio), 
where . | ; ; 
(16) Lip (6— 0)? f| (b—s)P0(s) ds. 


Now the Lup, (p = 0, 1), are, for fixed y, continuous ae of œ for 
(4) < $ & m(y); and ' 


(17) [La |S (p— 4)" A wien 


so that the L;, are, for fixed y, continuous functions of gin pi(#) S¢Sm(y) 
also if we define Li, <0 for pp. Furthermore, as pointed out above, 
ô == (€) in (17) is independent of # and of ; on T. By virtue of the 
choice of y; and of the existence of the quantity” a, it follows that for all 
- , (h SSE m), and for all y, 
(18) l [Rs + Lio | > 4/2 and |—4$hi” — Ly, | > a/4. 
From (15), (16), (17) and (18), it is clear that f is a continuons function 
of p in hi € bi m and since ¢ is, for fixed y, a continuous function of.t in 
OS tS a (y), it is seen that f(t; y) possesses the same property. Then as 
de ps it is clear that à tends to a definite limit as t— + 0, which 
implies that ¢ exists and (7) holds at t=0 also. 

We now proceed to show that f(t) —f(t;4), as defined by (10), is a 
function of bounded variation in # for OS¢*Sa,(w). In the first place, 
if a function f(#) is of bounded variation in ¢ and œ in turn, is a monotone 
continuous function of t, say in [0, a], then f(#(¢)) is a function of bounded 
variation in ¢ in an appropriate interval. Consequently, by virtue of the 
remark following equation (6), we need prove only that f, as a function of +, 
is of bounded variation. This we do by using (15) and the following familiar 

‘properties of functions of bounded variation: 


(a) The product of two functions of bounded. variation in an interval [a, b] 
is a function of bounded variation there. 


(8) Ifafunction F(z) is of bounded variation in [a,b] andif F(z) > y >0 
there, then (F(z) )4 and (F(z))~ are of bounded variation there. 


(y) The product of two positive monotone non-decreasing (non-increasing) 
functions is again a positive monotone non-decreasing (non-increasing) 
function. 


(8) a(z—b) + c is monotone increasing (decreasing) ifa>0(a< 0). 
(e) The mean value over a finite igterval of a function of bounded variation 


1 Introduced prior to equation (4). 


660 J _ B K. HAVILAND. 


(monotone function) is again a function of bounded variation (mono- 
tone function) .® i 


We now apply the foregoing results to the function F'(¢) defined by the 
right-hand member of (14), considering first the second term in the numerator, 
which may be written 

s— & ] ds 
p— o 


By - hypothesis, a = w m A where ;,@2 are monotone non- 
‘decreasing. Since «1, w2 are both bounded, there exists a positive constant, C, 
such that w:(s) + C, w:(8) + C are positive for all s in [du, mle Then the 
left-hand member of (19) may be written in the form 








(19) (@—4)" Le) SEF ae (892 f o) [1— 
gi 


(20) (bd) (ou(s) + O)as 
(a) (D Si (ols) +0) (2 — das 
— (#— 497 fT Cl) + Cas 


+ (b=) (b= A) f” (o2(8) +0) (84) ds 

= M, — (¢— $,)*M.— Ms + (¢— oi) TM. 
M; and Ms, are monotone by virtue of (e). The integrand in M: is the 
product of two functions of $ non-negative and monotone non-decreasing in 


o [pum]. Now if two functions F,(x), F.(«) are non-negative and monotone 
non-decreasing in an interval [a, b], then not only are 


M, (2) = (e—a) [°F (nds and Malz) = (e—a) f F(s) Pa(s)ds 


monotone non-decreasing functions of v there, which is a consequence of (y) 
and (e), but, in addition, as may be proved by using the First Mean Value 
Theorem, W::(x) =x(s)WM (v), where x(x) is again monotone non- 
decreasing. If we identify Fi (x) with o,(¢) + C and F(z) with $ — di, 
then M, corresponds to Ÿ,, and we have 


Ma ($) =x(6)Da(6) —x (6) (6— 81): f CRT ee (Ce ATCO 


“Le. if F(æ) is of bounded variation in [a, b] and aSiSb, then #(¢) 
~$ i £ 
= (£— a)" F(æ)dæ is of bounded variation in £. ‘The proof in the case of a 


function of “bounded varintion is readily Sbtained by decomposing F(#) into its 
monotone components. 
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Consequently, (¢— ¢1)*M2(¢) == 4x(¢) and is a monotone (non-decreasing) 
function of ¢, and in the same way the monotone character of (p— ¢:)7Mi(¢) __ 
may be established. Since the sum of a finite number of functions of bounded ` 
variation is a function of bounded variation, it follows that the second term 
in the radicand in the numerator of (14) is of bounded variation in ¢. 

That the second term in the denominator of (14).is of bounded variation 
in $ follows immediately from (e). From (18) and (8), it follows that the 
entire numerator of (14) is a function of bounded variation in ¢. Finally, 
by virtue of (18), we may apply (a) and (8) and infer that (14), as a func- 
tion of $, is of bounded variation in [bum] = [h (Y), m(4)]. Hence, as 
has been pointed out, f(t) = f(t; y) is, for fixed y in [0, 2r], a function of 
bounded variation in t in (OStSa,(y)). 

We now write | 


Fy) =f (t) =F 9) + L(t) —f(+0)]. 
In view of (15), this may be written 
(21) f(t) = —(— 2h”) 4 
+ C0) Bhi Ls} da” + Lao]/(— 2h”) (Ia + Lao) 
= f(+ 0) + 8 (64). | 
In the first place, we observe that | f(0)| 2 ae: uniformly for all y. 
Secondly, ®(¢;y) may. be rewritten in the form i 
: ahi’ "Lui — Rhi” ’ Lio — L’ 10 i 
CD SUW =T Lao) CO F M Las) 
From (18) it then follows that the absolute value of the denominator in (22) 
is not less than (2a)? (a?/4) = a54/2°/* > 0 uniformly with respect to ¢, 
while in the numerator A,” is uniformly bounded with respect to y and 


| Ljip |—> 0 as t—> +0, uniformly with respect to y, as appears from Le 
Consequently, 


(28) lance if OSG —H <il; ie, f0Si<(, 


where $, 8, are independent of y. 

We now prove, following the method of Hartman, that 

It f(t) = f(y) —F(-+ 0) + O(E;y), where ®(t;y) is of bounded 
variation in (0S tS a (y)) and @(-+ 0) —0, then 


(24) f° 1 exp [art lat e (+ OPH (A + 0609), 


where the o-term holds uniformly for all y.- 
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Proof: 
5 ap . aly) , 
(5) + SO IE) exp [ie] f F 0) exp [irt] 
| YS (ts) exp [—~irf*]dt — Ia + Db. 
Now ¢ | 
Ta f(+ 0) i exp [—irtt]ae—f(+0) fs exp [—irt"] dt. 
— Fi (+ 0)rrt exp [—in/4] —f(+ 0) (ae Cine]. 


But | f(+0)|[ = (—Rh/)ÈZ (Ra) for all y. Furthermore, 


+0 a. 
(26) | f exp [— irt"]dt | < Const./r. 
a(#) 


For G(r) = Í rs exp [— iy]dy exists and is O (13) in virtue of the Second 


Mean Value Theorem applied to a finite interval. On setting ri? == y, the 
integral i in (26) becomes, up to a constant factor, 


rO (r[a(4)]?°) = 0 (7%), since (y) > B > 0 for all y, 


where £ is the constant defined, above following equation (6). Consequently, 
by (21) and (26), | | 
(27) La —4(— hr) (=)t oxp [— tn/4] + O(r*), 


where the O-term is independent of y, in the sense that in absolute value it 
is not greater than const./r, where the constant is independent of y. 
It therefore remains to consider Ib, where ®(t; y) is of bounded variation 
in (OS tSa,(y)) and 6(-+0;y) — 0 uniformly with respect to y in the 
sense that 


| B(t;y)| <e forall 0St <8,8—8(c) independent of y. 
We next define the non-increasing function m(r) by 
(28) m(r) =. u. b. | (t; y)} for 0< tS rt; 0S ys de, 


so that m(r) +0 as r—> -+ œ. Since we are interested only in very large 
values of r, we may always suppose 0 < r1 = <p < &(ÿ) for ally. Let A(r) 
be a non-decreasing function of r which becomes infinite with r so slowly that 


(29) mL (Ar) TA (De 0, (r> +). 
In particular, we may let A(r) — min (1/4, (m[r/4])4). Now 
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pay) y atty) : 
(80) S7" (#34) exp [irt] = 3 LU 8 (Bs y) tt exp [— treat. 
` 0 
Consider the last integral from 0 to b, where 0 < bSa,?(y). 


< m(b4) f Bat — mo) rs 








F ‘ 
(81) | (etes [réa 
o 
If we place b = 171 (A (r))*, the last expression on the right of (31) becomes 
Rm[r8/A(r) JA(r) 14 = o (rà) 
‘by virtue of (29). Hence 
b : . 
(32) f D(A; y)thexp [— éré]dt = o(r8) = ¢(r)rà, 
0 
where | {(r)| < «ïf r= R(e), R independent of bt since m(-) is by defini- 
tion independent of y and A depends only on r and on m(:). 
3(#) 
In order to appraise f b(t; y) exp = irt|dt, we apply the 
; 

Second Mean Value Theorem to the monotone function ##, obtaining 


(38) ba f B(A; y) exp [iri]dt + [oxy [Ve 9) exp [irtis 


where it is understood. that the Second Mean Value Theorem is applied sepa- 
rately to the real and the imaginary parts of the integral, the notation being 


Lena feat fre @ <a < arly) sd h< ar. 


Now &(#;y) is of bounded variation, inasmuch as ®(t; y) is, so it may be 
supposed without loss of generality that ®(#;y) is a bounded monotone 
function, whereupon the Second Mean Value may be applied to each of the 
integrals in (33). From (17) and the continuity of k”(#$;w#), hence of 
w(b;w), on the torus and consequently for 0Sta(¥) or OS rSa,7(y), 
where + is the ¢ of the right-hand member of (30), and for 0 & y < 2r, 
it follows that Lio and Lu are bounded in 0Ær<a?(#) uniformly with 
respect to y. Therefore, from (22) and the remark immediately following it, 
one infers the existence of a constant K such that 


| ®(#;y)| <K forall t,0StSa,*(y), and all y in (0S yp < x). 
Finally, 0 < b < a? (4), so that [a,(¥)]* S b+ and from (33) it follows that 


(34) [fee 3¥)t4 exp Lee = 16Kbàr: = 16K[à (r); — 0o(rà), 
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where the o-term is uniform with respect to y in the same sense as in (32). 
From (25), (27), (30), (82) and (34), it then follows that - 


f B N E E EE [—ix/4] +o(ri), . 


corresponding to (24). 
Substituting into (8), we obtain 


(85) Jı = 4 (Bar) TR (oi (9) 59) P exp [i(rh (di (4) sy) — 2/4) ] + o(ré), 


the o-term being uniform with respect’ to y. 

To calculate the integral Jz, we observe that A’(¢;y) is negative for 
n(Y) SoS n(Y), 80 that h(m (Y); y) —h(b;w) is in this interval steadily 
increasing from zero, and if we set 


tem | hm (Ws) — h(g; y) b, 


t increases from O to a2(W) = | h(m (y); y) —h(m(y) ; W)l as ġ increases 
from’ yı (4) to (y). By the introduction of ¢ as integration variable in J2, 


| 1 ; * at) : j 
Jam oxp [irh] S exp [ire] (A(t y) iydi. 
CACR | 
This last integral is of the form Í f(t; y) exp [— wt]dt, where 
0 


(36) f= f(t; 4) = t/h’(o(t 4); Y). 


Just as ņı(y) has already been so chosen that £/3 < m(#) —¢i(¥) < 26/3, 
one may so choose 72(#) that £/3 < $s(w) —m(y) < 24/3, where ¢ (defined 
just above equation (4)) is independent of 4y. Then from continuity con- 
siderations it follows that X (¢(t, Y) ;y) >y > 0 for allt in (0OStSa(y) 
S (2u)4) and for all y, x being the maximum of h(¢;¥) on the torus. 
Since k”(#;w) is continuous and of bounded variation in ¢, h’(¢;~) enjoys 
the same properties. Moreover, œ is a continuous monotone non-decreasing 
function of t, so that h’(¢(t,~);¥), as a function of tin (0S tS a(y¥)) 
is of bounded variation in ¢. Consequently, by (8) and («), f(t) is, for fixed 
y, a function of bounded variation in t. Moreover, f(0;#)—0/h/(m(y) :#)=0 
for all y, so that, in J2, f(t; y) plays the rôle of f(t; y) —f(0;%) = ®(t;y) 
in Jı, and, inasmuch as 


(37) [f(y] Stn (iy, 


it follows on the one hand that |f¢;¥){<e for all ¢ such that 
OS t < 8 = ôa (€), where ô» (e) is independent of #, while, on the other hand, 
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- there exists a constant K, such that | F; y)| < Kı forall tin (0StSa(y)) 


and all y. By the same reasoning as that used in the calculation of J,, it then 
follows that 


(38) a A i 


where the o-term is independent of y. 

To each zero of h’ of the form Das, (k = 1,---+, 2/2), there correspond 
two integrals, of which one, like J,, has pus as lower integration limit, while 
the other, like Jn, has pax-s(— ax-s + 2) as upper integration limit. The 
contribution of order r% from each of these may realy be shown to be the 
same, Viz. 


(89) £(2ar) AL — A” (pus(y) so) 13 exp [i(rh (pus (y) 58) —2/4)]. 


Similarly, by a slight modification of the foregoing reasoning, it may be proved ` 
that to each zero of h’ of the form bg, (k =1, 2,- , n/2), there correspond 
two integrals, from each of which the contribution of order r+ is 


(40) E (drr) A” (aaa (y) 5) J exp [i(rh (bana) ; Y) +7/4)]. 


| Finally, just as in the case of J2, it may be shown that for each of the integrals 
Jam, (k= 1,+++,), over an interval containing a zero of h”, 


(41) | Jak: = 0 (172), : 


where the o-term is independent of y. 
From (39), (40) and (41), ¥ we then obtain (3), q. e. d. 


Tue LINCOLN UNIVERSITY, 
CHESTER COUNTY, PENNSYLVANIA. 
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: ON TAUBERIAN THEOREMS FOR DOUBLE SERIES.* : 
By RALPH PALMER AGNEW. 


1. Introduction. Let 
n 1 nn . y 
En = DU; F on = > Sk (n =I, 2,-- +) 
k=1 N yl i 
denote the sequerice of. partial sums and the C, transform of a real series 


Su. A classic Tauberian theorem states that if on —s and the unilateral 
Tauberian condition nur < K is satisfied, then Sn — s. 


Let 
"mn M | 7 
mn = D, Uje Omn = Dyk (m, n= 1,2,- >) 
3,k=1 MR j kal . 


denote the sequence of partial sums and the C, transform of a real double 
series Sum, K. Knopp? has recently proved several Tauberian. theorems of 
-which his third is the following: 


If omn—>s and (m? + 1?)timn < K, then smn s. 


The “natural” question whether this theorem holds when the Tauberian 
condition (m? + n*) umn < K is replaced by the weaker condition mnumn < K 
~ was taised and left unanswered by Knopp. | | 

. In $2 we give examples which show that the unilateral condition 
Mit.» < E will not serve; that the stronger O-condition mn | umn | < E 
will not serve; and in fact that the still stronger set of o-conditions 


lim min | tn | = 0 | (m=1,2,- +) 
Gyr. 3 lim mn | tm, | == 0 | (n— 1,2, °°) 


on mn | Um,n | = 0 
| M, n>% f 
will not serve. The sequences d, and en of §2 are specialized to obtain further 
_results of this charaëter. 
In $8, we show that the situation is the same for many other methods 


+ Received December 7,. 1939. 

1 Presented to the American Mathematical Society, February 24, 1940. 

*K, Knopp, “ Limitierungs-Umkehrsätze für Doppelfolgen,” Afathematisohe Zeit- 
sohrift, vol. 45 (1939), pp. 573-589, p. 581. Adjustment from Knopp’s subscripts 
0,1,2,... to our subscripts 1,2,3,. . . is easily made. 
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of summability, including ii Cesàro methods of all Te orders and the 
Abel power series method. 
In § 4, we show that the stronger hypothesis that all of the limits 
lim ome; lim ome; lim om» 
, R->0O mw m, n0 

exist, the first for each m == 1, 2,- - + and the second for each n == 1,2,- -, , 
together with a Tauberian condition such as (1), implies neither convergence 
nor convergence by rows of Sums. 

It therefore appears that the double sequence mnum,n does jk play, in 
Tauberian theory for double series, a rôle analogous to the rôle of the simple 
sequence nus in Tauberian theory for simple series. 

In connection with the examples of §2, it is illuminating (but not 
essential) to recognize the fact that if Su» has bounded partial sums smn 
and Smn—>s, then omn—s; and that, irrespective of whether Sum,» has 
bounded partial sums, if Sun —>s and omn—>o, then s==0. General theory 
and references to literature covering these points may be found in two papers 
in this Journal.’ It follows that if omn—>s and it is not true that smn — S$, 
then lim 8m,» cannot exist. 


2. Some examples. Let d, be a bounded sequence of real non-negative 
numbers such that Zd, == 0. Let en be a sequence of positive numbers for 
which 0 < er = 1. Choose D such that 


0Z ds < D | (n—1,2,38,- >), 
and let no = D. 
We define by TA a sequence 
No <L vr L M L ve L na L n Ln L° | 


of indices and the terms Ums of a series Suma. For the first step in the induc- 
tion, take k= 1. Define tox, for n == 1, 2, 3,- > + by the formulas 


Uti. = ‘drda LS <nSn 
(2) = — edn VE L N< Ng 
= — dn n= x 


== . 0 ‘otherwise 
where e’; is the lesser of ex 1 and ex; x is so chosen that 


(3) D<de(dausr + dus +° © F dn) < 2D; 


- ° l . 
SR. P. Agnew, “ On summability of double sequences,” American Journal of Mathe- 
matics, vol. 54 (1932), pp. 848- tob; “On summability of multiple sequences,” ibid., 
vol. 56 (1934), pp. 62-68. ; 
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ny is so chosen that 


(4) ee (dng +t a ee 0 dy,) = Er (Grass +: ee + dos + Oda) 


is 20 when 0—1 and q= ny—1 but is <0 when 6=1 and q= ng; and 
finally & is chosen such that the difference (4) is 0 when g == ns and 8 = 6%. 
Observe that OS 0s < 1. Let ux be defined for n == 1,2, 3,:- by the 

formulas f : 


(5) . | Un == — Uzki, E (n == 1, 2, 3,°° Sy 


Successive steps in the induction are obtained by giving k the values 2, 3, 4,-- - 
in turn. 

The terms of the series Sum which we have just defined may be displayed 
im the form i | è = 


gs- +totdo4+---+04+0+---4+04-:- 
yest yf Of fO+04---+04--- 
O+---+0+24+---4+27+0+---+0+4+::-- 

(6) Of OH yb y HOH +0; 
O+---+0+0+---4+04+e+---4a4+--- 
0+: A00 fe Dee gua ea ype? ot 
+: 


in which the value of each wm, which may differ from 0 is represented by an 
zorbyay. The definition of & implies that the sum of the æs in each row 
is 0, and (5)-implies that each y is the negative of the x above it. These 
considerations imply that the sequence m,n, of partial sums of the series 
Em, may be displayed in the form 


Bey BOs 20 OSS te Oss ox 
0,-:-,0,0,-:-,0,0,-::,0,+ 
0,---,0,2z, +,2,0,°°°,0,°°° 

(7) 0,- + +,0,0,---,0;0,---,0,: 
- 0,7 , 9, 0, 02 Eee pre 
<ú; -,0,0, -,0,0,:--,0,--: 


in which the value of each Sm,„ which may differ from 0 is represented by a z. 
The definitions of vx, mx, and Um,n imply that’ 
(8) 0S Sman <D © (m, n = 1,2,- -), 


(9) . D < sam < 2D (k—1,2,-:-), 
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(10) Sn = 0 . (k, n=1,2 >). 


Hence , 
(41): lim inf sun = 0; PS lim sup Saa S 2D 
M, n0 fh AWO 


and therefore lim Sm,n does not exist. 
The fact that 0 = sm, = 2D, and that at most n of the terms s,x in 


the sum 
1 M,A 
Omn = — À 8 83,k 
. MA j,i 


are different from 0, implies that 0 = om.» S < 2D/m a and hence that Tmn —> 0. 
Our definitions imply that, for each X and n, 


| torn | = | tMen-an | S px; 
and since e's is the lesser of esx. and ex, this implies that 
(12) | | tmx | A (mn, >). 
For the particular sequences i 
(18) da = 1/n log TERTI en = 1/n2% 
the series Sumy satisfies the Taubcrian condition 
(14) : mn | umn | SS 1/8" log. (n + 1) 


while om,n—> 0 and the sequence Sma is bounded and lim Sm,n fails to exist. 
For the sequences 


(15) dy = en = 1/n [log (n + 2) ] [log log (a+ 16)] 
we obtain the symmetric inadequate Tauberian condition 
(16) mnlog(m + 2)log(n + 2)| umn | S 1/log log(m + 16)log log (n + 16). 
Each one of (14) and (16) demonstrates inadequacy of the o-conditions (1).*. 
3. Other methods of summability. ‘Let anx and dur, N, k = 1,2,8,4, 
denote matrices of regular simple-sequence transformations 
e ; 00 D 
(17) on? == > An ESk 5 oy? —=— > durs ; 
kel A=1 


and let the matrix an, satisfy the additional condition 


t An example of a divergent series ‘Su, which is summable O., and which satisfies 
the condition mn | than | < K and the condition MNU pp 70 as m, 9>, has just been 
published by W. Meyer-Kinig, “Zur Wrage der Umkehrung des C- and -1-Verfahrens | 
bei Doppelfolgen,” Mathematische: Zeitschrift,. vol. 46 (1940), pp. 157-160.—Added in 
the proof. 
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(18) lim Lub. | anz|—=0. 


A0 - k=1,2,... 


This condition is of course not satisfied when an» is the identity matrix 
8n,x, but it is satisfied for many other regular matrices. In ARE, (18) 
is satisfied when ax is the matrix 


o nif n—i \,, /n—k+1 1<ix< 
GE) (EH) thse 
=e | k>n 


ef a Cesèro transformation C whose order r is a real or complex number 


with a positive real part 1”; for 


jan) < ltl rs) =) Li) 
ki = a \n+r—1/\n+r—2 n+ —k 
when 1 Sk Sn s0 that ifr 1, 


|a? |S] r|/n (k—1,2,° L SF 
andif0<r<1 


jeg] S |r] (r)a) Tat) E ee 


| It is well known that C, is regular when r’ > 0. 


Let 4 © B denote the double san method of sine defined by 
19 (a, D) mx a . 
(19) ga = Gm, 30 n,b8 jx 


Let Zum,n and Sm,n be as conatructed in §2. Then | smn|< 2D so that 
the series in (19) converges absolutely ; hence 


œ% oo g 
oo = E bne D Om, 485.4 
á k=l 3=1 


For each k there is at most one 7, say Bx, for which s; x 54 0. Hence 


e Fe sites 
-so that | 
(22) ` CHEP | dnx | [ T u. a. b. | &n,s |] [2D] 
_ and therefore Don 
(23) lim pad 0, 
M RIO 


` Thus Yum,» is summable À © B to 0, and the examples of 82 apply to 4 © B 


as well as to C1. In particular the examples apply to the Cesaro transformation 
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. Cr © C, if the real parts of r and s are positive. In case merak COCs. 
becomes the special double sequence transformation 0; previously. considered. 
It can be shown in the same way that if 


(24) o0) Sata; t) malia 


are regular sequence-to-function transformations and - 
(25). lim Lu b. | ax(t)| = 0, 


tto k=1, 2 


then each series Yumn of §2 is summable to 0 by the double sequence-to- 
function transformation | 


(26) oP (tu) = È OLOLA 
This applies to the Abel power series method for which 


(27) a(t) = ba(t) = #7 (1 — t), 


the variable ¢ approaching 1 over the real interval 0 Æ t< 1 or over the 
complex sets of Stolz and Pringsheim. 


4. Convergence by rows. A double series is called convergent by rows 
to Sr if 


SS una—se 

m=i n=l 
or, what amounts to the same thing, if lim, sm,n exists for each m = 1,2,--- 
and | 
(28) lim lim Sm, = spr. 


MO n-r00 


` The series constructed in § 2 converge by rows to 0; hence the examples do 
not preclude the possibility that om n„n—>s and the Tauberian condition 
| Milt,» < K-may imply (28) or at least the weaker condition l 
(29) i lim lim inf sm, = lim lim sup sun == $. 
MO n> m>% ACO 
This question and others are settled by the following exansple. | 
Let da, en, and D be given as in §2; and for each k = 1,2, : - let ds 
be the least of the four numbers ex 5, », €4b-1, and es. For each k=—1,2,-.-, 
choose rx such that 


| D < dudit da H: + dm) < 2D 
and let A 
Uak-s,n = — Uson == — Uk in = Un (n = 1,2,°° à 
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where 
Usha = € ,dn lisnsm, 
= 0 n> Ry 


The sequence Sm, of partial sums, and the transforms by various methods of. 
* summability, of this series are more complicated than those for the series of 
§ 2. However it is possible to show that Sun satisfies the Tauberian condition | 


y 


| tmn | SS emda; 


that — 2D Z Snn = < 2D; that if own is as before the C1 ra of Ytim,n, 
then - 
(30) ’ PETR lim Om,» lim Oman . 


n= m->00 m, #00 


all exist, the first for each m = 1, 2,- +- and the second for each n = 1, 2,- 
that Stun,» fails to converge; and. nally that each row of the series Stim ; 
converges but that the series of values of the rows does not converge. 

This example is of interest because existence of the first limits in (30) 
and the. Tauberian condition mnwmn < K imply (by iterated use of the 
Tauberian theorem for simple series given in $ 1) cénvergence of each row of 
Yum. The example shows that existence of all of the limits in (30) and 
stronger Tauberian conditions | tm,» |S emdn do not imply convergence of the 
series of values of the rows. 


5. Conclusion. It is sometimes desirable to have, in addition to a 
proof of a result, a plausible argument which indicates roughly why. the 
result may possibly hold. “The question here is “why” omn—>s and 
MNUm, n L K can fail to imply Sm,n —> S a8 on —> S and NUn < K imply sas. 
The “ answer ” seems to be that the condition mnumn < K does not prevent 
an effective dilution of a double sequence Sm,n by insertion of zeros in the two 
‘dimensional pattern, while nu, < K does prevent an effective dilution of a 
simple sequence s, by insertion. of zeros in the linear pattern. i 
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ANALYTIC FUNCTIONS AND MULTIPLE FOURIER INTEGRALS.* 
By W. T. MARTIS. 


Introduction. In the first part of this note we consider the class E of 
entire functions f(2:,- + *,Z„) which satisfies relations of the form 


(1) L a | f(a. + tyr, ++, En + iyn) | d1 de, < Aela. -+y 
-c0 -00 . ; 


for all finite values of 4:,° - `, Y» where a and A are positive constants. It is 
easily shown that this class of functions is identical with the class of functions 
having Fourier transforms $(th;° * `, Up)a It... +atn) which vanish outside 
a certain finite region. Next if we denote by K the common part of all convex 
bodies (in the u-space) in whose exteriors œ vanishes: identically and by s(A) 
its supporting function, 


(2) 8(A) mæ ae {A +: + Anta}, Ais 7 ©; An real, 


then we show that s(A) is equal to a growth- -function h(a) of f defined as 
follows + 


(3) ho) = Him Eog f% af |f (21 + iip, = En + Dap) |? da: *- 


From these considerations it follows that the class E is identical with the figs 
considered by Plancherel and Polya’ of entire functions of integrable square 
over the real space yı ==" > `= yn = 0 and that the growth-function (À) 
defined in (3) is egual to the growth-function 


(4)  he(A) = max Tim sup = log | f(e, + ps: : * , an + Hep) 


eee A 
defined by them. 
In the second section we prove results of a similar nature for the class of 
functions f analytic in the “octant” Im{%} > 0, k= 1,- >, n, and satis-, 
fving relations of the form. (1) for all positive values of Yrs Une 


* Received October 12, 1939. $ 
2 The idea of considering a growth-function of the sort defined here arose in a con- 
versation which the author had with Professor S. Bochner. - 
2M. Plancherel and G. Pólya, “ Fqpctions entières et integrales de Fourier multi- 
ples,” Commentarii Math. Helvetici, vol. 9 (1936-37), pp. 224-248; vol. 10 (1937-38), 
pp. 110-163. 
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1. The s P of entire functions. We consider the class P of func- 
tions f representable in the form 


CRC ETC SE i g (uatan: Had, 


where o(u) == h (u © >, un) is of integrable pir over — 4 < t <a a, 
k=1;---,n, and do, is the volume element du,- - - dus, and we show that” 
this class. is identical with the class E of entire functions satisfying relations 
of the form (1).: First, by the Schwarz inequality, i 


| 1 f g (u) etna: --tuntadoy | 
= =f: f | TE Ls us + Hate) doy 


and thus the function f denied in (5) is an entire. function. Next by 
Plancherel’s theorem 


Í =f FFC + iyn: y Ln + iyn) |° dos — f :.f Ne 


< e2a (alt: +l) f ‘ $ f | $ |? dow | 
-a 


and thus a relation of the form (1) holds. Conversely, if f belongs to the 
class E, then for each (Y1, ``, Yn) it has a Fourier transform yip (u). 
By a theorem due to Bochner, since the left-hand side of (1) is bounded 
for (y) in any bounded region, it follows that w(w) has the form 
pts", Un) overt: + tan, Thus by Plancherel’s theorem 


(6) f(a, . ` . > Zn) Ey” sfc f (u) Oa ee Watap- itha .. tone, 


We next show that if f has the representation (6) and if (1) holds then 
¢==0 outside the “cube” C[—a<u <a, k=æ1,-::,n]. For suppose 
$50 in some region R which lies outside C.. There is no loss in generality 
in assuming that À is of the form ar< us < Br, k =m 1,: -:,n, where 
a < 4 < B1. Then for Y°: Yn positive Plancherel’s theorem yields _ 


(7) f Je aarti) doem S E, feet 


a f | o [rest tin) doy 
=> ellato. Hanya) J Ei | h l'duy. 


293. Bochner, “ Bounded analytic functiofs in several variables and multiple 
Laplace integrals,” American Journal of HARAS, vol. 59 (1937), pp. 732-738, 
- esp. 733-734. 
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‘ 


As 4,2 œ, for Yas" '*,Yn fixed and ‘positive, this contradicts (1) since 
a < a, and f, | o |’ dan > 0. Thus we have a contradiction and hence ġ = 0 


outside C. Hence the two classes P and E are Haia 

Next let f belong to the class P (=E) and let us denote w K the 
intersection of all convex bodies in the u-space in whose exteriors ¢=0, 
and let s(A) be the supporting-function of K defined as in (2). Then f has 
the representation 


(8) reed -GV Í. b(t) eben dai 


and 
` co J i ; s 
(9) SE J FGA dap + +, Sa + Anp) Pde ex 
=f lé |202 rut ves Antia) 0 


< ene f | $ [?doy. 
Thus ; zi 


1... oo : A 
(10) 5 lim sup 20g f ef | f(a: + tip, +++, En + tAnp) [dus S (A). 


In order to see that the actual-limit in (10) exists and is equal to 3(A) 
let us consider a fixed direction (A°). Then there is an extreme point * (u°) 
of K such that s(A°) == A u° -+ < -H Anun’. Moreover for à > 0 there 
clearly exists a neighborhood N — N(3) of (u°) such that | 


(11) 8(A°) —8 SA + + + Au, Æs(X°) +8, for (u) NE. 
Hence i 


co 
COR MR AeH STE UE | 
-Í | g [Perat -seada ( | p [eut e Aun Ado 
K A NK 7 


> e2la(X°)-51p f | à |?dow. 
Tx NK 


«By an extreme point of a convex body K is meant a boundary point which is not ` 
an inner point of any line segment of K. For each direction (A) there is an extreme 
point which lies on the supporting plane in that direction, i.e. on At, +++ -+A,u, 
=s (à). An extreme point also possesses the property that if any neighborhood N 
of it is omitted from K, then the convex extension of K — NK is a proper subset of K. 
For these properties see T. Bonnensen and W. Fenchel, “ Theorie der Konvexen Körper,” 
Ergebnisse der Math. und ihrer Grenzgebiete, Berlin (1934), esp. pp. 15, 16 or G. Pôlye, 
“Untersuchungen über Lücken und Singularititen von Potenzreihen,” Mathematische 
Zeitschrift, vol. 29 (1929), pp. 549-640, = pp. 573- 578. 
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Now f 5 | @ [dos > 0 since otherwise ¢ would be identically zero in NK 
N. 
and thus ¢ would be identically zero in the exterior of the convex body K* 


`. which.is the convex extension of K —NK. But this is impossible since K* 


is a proper subset of K (see ‘) and this contradicts the definition of K. 
Hence (12) yields | 


(19) Prim int biog f fest ate at Oat) aE 


From (10) and (13), since: (A°) is an arbitrary direction and 8 is an arbitrary. 
positive number, it follows that the limit in (10) exists and that it is equal 
to s(A). 

We have proved the following theorem. 


“Tororem 1. Let f(a,:-:,2) be an entire function satisfying (1). 
Then the limit in (3) ewists and is equal to (A), where 3(A) is the supporting 
function defined by (2) of the convex body K, where K is the intersection 
of all convex bodies in whose exteriors the Fourier transform of f is identi- 
cally zero. 


Plancherel and Pólya (loc. cit.?) have considered the class P of functions 
and have shown that the growth function Ap(A) defined by them as in (4) 
ig equal to the function s(A). Thus we have 


COROLLARY. If feP then 


(14) Stim A tog ff [fei + dus + + stn + yp) doe 
p> P -© , 


== max lim sup L log | f(a + up,” - ©, Gn np) |. 
po P 


ps.s lh 


In connection with Theorem 1, let us remark that Plancherel and Pólya 
(loc. cit. °, p. 146) have shown that 


x ; 
S S FC + tas + san + iya) Eds 
i < ezo lult -Hlin f- f | fes": + Tn) [dos : 
| Fu . 
where c is the cardinal increase of f, that is, c is the greatest value of hp(A) 


. for (A) ranging over all the sets for which one às is + 1 and all others are 0. 
ney also obtain .an di result for the class L?. 
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2.. Functions analytic in the ‘‘octant”? Q = E[In{%} > 0, k = 1, 

sn]. Let f(2,-:-,2) be analytic in Q and let it satisfy a relation of 

the form (1) for all positive values of 4, :,4. We shall obtain a result 
for this case analogous to that obtained in the previous section. Define 


(16). g (zu . *,2n) == etats er tad F(z, - . FAN 


Then by (1) (for y:,' - *,Yn positive) 


(16) MRC ES stat iy) Pde 
St se SILCEA ` `, En + tyn) | doa 
< À. 


Now Bergmann and Martin ë have shown that a function. g analytic in Q and 
satisfying a relation of the form (16) for (y:,: : :,Y») positive has a Fourier 
transform y(u)eñ%*:.-#"#s which vanishes outside the octant q = Eux <0, . 
k= 1,- +,n] and which has the property that y(u) «L°. Thus g has a 
representation of the form . 


(17) g(2y* `, 2a) ~(2)"f oS (petite amd, (2) eQ. 
Using (15), we see that 


(18) (as <, Zn) = E To f y(u) gads +6 (nta) Em deny 
~ (>) fE an f (u) eittir. -tustadou, (2) 7 Q, 
where ‘ ` i 


(19) ACUPE a *; Un) = y(t — 4, an ue). 


Thus the class of all functions f analytic in Q and satisfying Taies 
of the form (1) for all positive values of 4 ths" * +, Yn, is contained in the class 
of functions defined by 


n/2 a 
(20) f(a cr, Zn) _ (+) f ee f p(w) enter - - +~tintnday, (z) € Q, 
where ġe L? over — 0 <u Sa,.k = À. --,n, and vanishes identically 


elsewhere. That these two classes are identical follows at once. We omit 
the details. 


58. Bergmann and W.T. Martin, “ 6n a modified moment problem in two variables,” 
to appear in the Duke Mathematical Journal. Bee esp. Theorem 1. 





te Next let Sykbe a cli TR of radius p, center th ==: > -=m Up = 0, 
. let. KE . be the intersection of all convex bodies Ç in Sp such that == 0 
in SEC. Then clearly Fe Ske and the point set K’ ‘consisting of all 


| points in in ay Ey, p= 1,2,- > -, has the property that = 0 oùtside K’. Thus 






i 


(1) les: only ° f (ujete -tudou (2) eQ, 
and for positive A1,° * :, An we have. 


, . & 
(22) f SS [f(t + ps © En + np) | dos | 
GS ji l p | 22 Cut vee atts) dong = e28 (A)p Í. | p dos, 
K’ K' 


where 
(23) s(A) = Len b. {its + i Anta} À," +, An positive. 


Hence : 
(24) 5 lim sup *Tog ve. flit ilap =s oa + dap) doe E CA) 
for Jes Vs. Again we can show that the actual limit exists and that 


it is equal to s’(A). For this purpose let us apply Plancherel’s theorem to 
(21). Then : 


(25) 1 JF + Maps” jan + Dap) dog 
| =f, | g Peur uunogo, (p—=1,2,° °°). 


Now by Theorem 1 we have 


(26) i lim Lice f | ÿ |203 asst - + Hat) deny — Sp(A) 
2 p>% P a Kp 
and thus | 


de x & f | is 
(a7) tliminttlog f -2 f [f(s + dap = -sn + up) don = (A), 
o>% P -c cs 
; | (2 = 1,2; °°), 
‘for àp’ ’*, An positive. This implies that the left-hand side of (27) is 
greater than. or equal to s’(A) for positive Vs. For let (A°) be a positive 


*It is clear that K’ C H[— œ <u <a, &=1,. . -, n] and hence that 


lu. b. yy H. Anta} S (A te- -Hpo for A,,- . -Ap positive, 
(u) eK’. . 
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direction. Then in view of the definition (23) of s’(A) ‘theré ï is à sequence (we) 
of points such that (u?) e Kv, (where vp — œ as p— œ) and ES thot: 
(28) MUP + + + + An Un?  8”(A°) as p> ©. Eu 
The relation (28) together with the fact that Au, + - - + An Un? = 8, (1°) 
gives 
lim inf sy, (A°) = s’(A°), 
pw 


and hence the left-hand side of (27) (for (A) = (à°)) is greater than or 
equal to s’(A°). Since (A°) is an arbitrary positive direction this result 
together with (24) yields the following result. 

THEOREM 2. If f is analytic in Q and if (1) holds for positive y'>, Yn 
then : 


5 tim Dog [Te f 1P + taps à sa + Hap) dec = (a) 


poo 


for positive Ns, where s’ (à) is defined as in equation (23). 
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PROJECTIVE ANALOGUES OF THE CONGRUENCE OF 
NORMALS.* 


By Pup O. BELL. 


1. Introduction. The projective normal at a point of a non-ruled sur- 
face 9 in ordinary space was defined by Fubini as the cusp-axis of y with 
respect to the extremal curves of his integral invariant 


J (a’bv’) du. 


It is well known that the pseudo-normal which Green proposed as a projective 
' analogue of the normal coincides with the projective-normal. 

Green and Fubini discovered, quite independently, certain analogies which 
exist between this line and the normal. Green noted that the projective 
normal, like the normal, is intrinsically connected with the surface, and that 
the curves which correspond to the developables of the projective normal con- 
gruence resemble the lines of curvature by also forming a conjugate net. 
Fubini’s considerations reveal that both normal and projective normal may be 
defined as cusp-axes of certain integral invariants. Fubini’s definition lacks 
geometric significance without a geometric interpretation for his integral in- 
variant. The author [1, p. 403]+ has recently provided such an interpretation. 

Green designated a congruence whose developables correspond to a con- 
jugate net a conjugate congruence. Grove [2] has proved analytically the 
existence of a class of covariant conjugate congruences. a general one of which 
he calls an R-conjugate congruence. He does not however characterize geo- 
metrically any one. of these congruences. It is the purpose of this paper to 
present a method for the geometric determination of a general R-conjugate 
congruence and to show that it is also characterized by the other important 
property of the. projective normal by being similarly determined with respect 
-to the extremals of an integral invariant. A method will also be given for. 
- the geometric interpretation of these extremals. Finally certain special a 
conjugate congruences will be introduced. 

Let the surface § be referred to its asymptotic net as parametric, with the 
_ fundamental differential equations in Wilezynski’s canonical form 


(1.1) 2 Yuu + bys + fy=0, ` Yor + 2a’yu + gy = 0. 


* Received November 10, 1939. . 
1 Numbers in brackets refer to the bibliography at the end of the paper. 
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Using the notation introduced in the celebrated memoir by Green BT let us 
consider the parametric vector equations 


(1. 2) y = y(u, v), p= Ys — BY, o = Yo — ay, T= Yuv — Yu — Bye + ABY 


where æ and £ are arbitrary analytic functions of u and v. Equations (1.2) 
define the general homogeneous coôrdinates of four points which we denote 
simply by y, p, e and r when no confusion can arise. The line l joining the 
points p, o,'according to Green’s classification, is an arbitrary line of the first 
kind and generates a congruence T of the first kind as y moves over ©. The 
reciprocal ? of the line J with respect to 9 at y is an arbitrary line of the 
second kind and generates a congruence I” of the second kind as y moves over 
S. If the functions a, 8 are chosen suitably the points y, p, « and 7 become 
covariant points and the congruences T and I” become covariant congruences. 


i 


2. Conjugate congruences. Consider any two covariant points œo, and 
w Which are collinear with y but do not lie in the tangent plane to S at y. 
The general coördinates for w, and w: are given by o; = r + 14y, (i= 1,2), 
where r is defined by (1,2) and r, and 7r, are functions of u,v. The tangent 
lines at w, and œ: to the curves described by these points as y moves along a 
curve Cy, defined by dv —A(u, v) du = 0, intersect the tangent plane to S at y 
in the points which we denote by Wi and W.™. Expressions for the coördi- 
nates of W,™, (i= 1,2) are linear combinations of œ; and (w)u + A(wi)v 
which do not contain Yus. The terms of (ws)u + A(os)» which involve yu, are 
equal to — (8 + aA) Yur. Hence, the expressions for the general codrdinates 
of W,% and W. are given by 


(2.1) 7,0) — (01 )a FA l)o + (B+ os (1,2). 
Making use of the forms for o, and w: and the equations (2.1), we have 
(2.2) Wa — Wi == R[(yu— By) + A(yo— ay) ], 

where 


B—=— (8+ [log Bly), %—=— (a+ [log B]e), R=m—n. 


Let ¢, denote the tangent to Cy at y. Let », denote the point of intersection 
of 4 and the line joining W, and W,%. The right hand member of (2.2) 
is clearly the expression’ for the general codrdinates of y. We shall call the 
point n the v-point of ta, corresponding to the points œ, and w 

. Since the right hand member of (2. 2) is a linear combination of y, — py 
and y,— ay, the point m, for any value of A, lies on a straight line T which 
joins p and o given by 


15 


682 | : PHILIP 0. BELL. 
P—yu— Ëy o= yo — By, 
where B and @ are defined above. Hence, we have the theorem | 


“THEOREM (2.1). As the direction À is varied, while u and.v are held 
constant, the v-point of ty, corresponding to the points o, and wz describes a 
straight line I. 


The point u of intersection of the line 7 with the reciprocal of the line 
joining w, and wz has general codrdinates of the form 


p= (20+ [log B]e) (ye + [log R]ay/2)—(28 + [log Ælu) (ys + [log Rl ey/2). 


Let ty denote the tangent to S at y which passes through the point p. In view 


. of the forms of the functions a, 8, we have 


THEOREM (2.2). The harmonic conjugate of the tangent ty with respect 
to the line T and the reciprocal of the line joining w, and w, is the R-harmonic 
line, which joins the points p and o given by p= y, + (log R)uy/2 and 
omy, + (log 2) oy/2. The reciprocal of this line ts the R-conjugate line. 


To complete the characterization of the B-conjugate line for a given 
function R= R(u, v) it is, of course, necessary to have geometric definitions 


‘of covariant points w, and wz whose general cobrdinates 1 are related by the 


equation, w = w, + kRy, k = const. 

The integral f (Hv’)4du, where R(u,v) is associated with covariant 
points w, wa in the manner described in the preceding paragraph, is an in- 
tegral invariant which is projectively and intrinsically related to the arc of a 
curve along which it is calculated. The extremals of this integral are defined 
by the curvilinear equation ` 
(2. 3) v” = (log R) uy’ — (log RB) yv’*. 

It is well known that if Wilezynski’s canonical form (1.1) is used, the cusp- 
axis of y with respect to a two parameter family of hypergeodesics defined by 


= A+ BY + Cv? + Dv’, 


passes Het y and the point z given by z = Yuo— Yu — By», Where « = 0/2, 


, P, == — B/2. Hence, we have 


 Txrorex (2.3). The R-conjugate line is ‘the cusp-axis of y with respect. 
lo the extremal curves of the integral invariant f (Rv’)#du. 


To add to the geometric significanc® of the above theorem the extremal 
curves of the integral f (Rv’)*du will be geometrically characterized. 
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THEOREM (2.4). The tangent ty associated geometrically with covariant . 
points w, and w: which lie on the cusp-axis of y with respect to a pencil m of 
conjugate nets and the tangent tx of the curve Cy of the fundamental net Ny . 
at y are conjugate tangents tf, and only tf; the curve Cy ts an extremal of the 
integral invariant f (Rv’)*du, where kRy = o,— 01, k = const. 


According to the hypothesis we must have | 


(2. 4) ` A= (28 + [log R]u)/ (2a + [log R]e ), 

where B— = — (log d)u/2 and a = (log A)./2. Hence, on dearing of fractions 
we obtain 

(2. 5) o Aw + Av = (log E)uà — (log B)eà?, 


which, on substituting v’ for À and v” for Ay + AA, becomes equation (2.3). 
The operations Are reversible and therefore the condition is necessary and © 
sufficient. g 


3. Special conjugate congruences. The projective normal is the special 
case of the R-conjugate line for which R = ka’b, k = arbitrary const. To 
complete its geometric characterization it is necessary to locate two points w, 
and œ whose general codrdinates are related by the equation w — o, = ka’by, 
k= const. Two such points are the intersections (distinct from y) of an 
arbitrary line V of the second kind with the ee of Wilezynski and the 
quadric of Lie. | 

From the standpoint of analytic simplicity the projective normal is the 
best available projective substitute for the normal. From a geometric point 
of view, however, it is quite conceivable that there may be other #-conjugate 
lines equally suitable as a projective substitute for the normal. An Æ-conjugate 
line of this character will be introduced in connection with a new pencil of 
quadric surfaces. 

Let lẹ denote a general line of the first canonical pencil. The line lx 
intersects the u and v-tangents to § at y in | the pone p, o defined in 2), 
where 
ED a ab) a/e — (iog db)a/3» 

7 a = (log ba’),/k — (log-a’b),/2. 


The lines lz, ls, la and lœ are the first directrix of Wilczynski, the reciprocal 
of the axis of Cech, the first canonical edge of Green, and the reciprocal of 
the projective normal, respectively. As y moves over S the points p o of & 
generate transversal .surfaces. Sp and So of the congruence described by ly. 
The v-tangent at p to Sp intersects V+, the reciprocal of l, in the point which 
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` we denote by m whose coôrdinates are given by ys = r — fry, where a, 8 are 
= functions associated with 4. Likewise the u-tangent at o to So intersects 

Vz in the point which we denote by & whose coördinates are given by’ 
és == T — ayy? Let tx denote the harmonic conjugate of y with respect to the 
points m and &. The general codrdinates of & may be easily found to be 
given by k = r + (k — 3) (log a’b) wy/2k, where the functions g, 8 in the 
expression for 7 are given by (3.1). It is well known that just one quadric 
of Darboux at y passes through a given point not in the tangent plane to 9 
at y. The equation of the unique quadric of Darboux which passes through 
the point éx, k == const., is easily found to be 


(3.8) ` Talg — dits + (k — 3) (log a’b) ut? /2k = 0, 


This quadric is, therefore, a general member of a pencil of quadrics whose 
members are in one to one correspondence with the lines of the first canonical 
pencil. The quadrics of this pencil will therefore be called canonical quadrics. 
The special case of (3.2) for k==3 is clearly the canonical quadric of 
Wilczynski. Stouffer [4], without introducing the general quadric (3.2), has 
given the above characterization for the quadric of Wilczynski. 

The intersection of a general line I’ of the second kind with the quadric 
(3. 2) is a point, which we denote by wx, whose general coordinates are given 
by o mr + (k— 3) (log a’b) ay/2k, where the functions «, B in the ex- 
pression for r are arbitrary. The form for the codrdinates of wp shows that 
oj — wy = C(log a’b) wey where cs (j — 3) /27 — (k — 3) /2k, j, k = const.’s. 
Hence, the following theorem is an immediate consequence. 


THEOREM (3.1). If the fundamental points w and ws are chosen as the 
intersection of a line l of the second kind with two quadrics from the pencil 
of canonical quadrics, the associated R-conjugate line is independent of the 
choice of V and is independent of the selection of the two quadrics of the pencil. 
For this line the associated functions a, B are given by a == — (log R),/2, 
8 = — (log B)u/2, where R == (log a’b) ac. 


4. Fe-conjneste, congruences associated with one-parameter families 
of curves. 


The transformation : 
(4.1) : ` y—c/(R)# 


® 
*The points 7, and & are special cases of the points 7, and 7,, respectively, which 
were introduced by Green [3, p. 95]. 
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transforms the. covariant points (R)*(yu + Byy/2k), (R)*(Yo + Ryy/2B) 
‘and 


(BR) (ue + Royu/2R + Ruyo/2R + [RuBe/4R? + (log B) wo/2]y} 


into ty, £v and Tuv respectively. The points xy, x are the intersections of the 
R-harmonic line with the asymptotic u and v-tangents to J at x, and the point 
uv lies on the R-conjugate line and is characterized like a point &, but with ls 
replaced by the R-harmonic line... The effect of transformation (4.1) on 
system (1.1) is to produce the following canonical form 


(4. 2) { Tuy == PL + butu + Be, 


: Loy == qT + you + (ZA 
wherein, 


0 = log R, B =— 2b, y—— W 
P= — f + bho + buu/2 — 0/4 and q= — g + Ou 00/2 — 6.7/4. 


If R= a’b, the form (4.2) is Fubini’s canonical form. 

The intersection of the R-harmonic line with the tangent at z to the curve 
Cy defined by dv—Adu—0 is the point Tu -+ Az. The-tangent plane at 
Tu + Az, to the ruled surface described by the R-harmonic line as « moves 
along C\ intersects the R-conjugate line in a point P\ whose general coördi- 
nates are found to be given by 


(4.3) Py — uv + (p + ga?) 2/2. 


The T-curves of the R-harmonic congruence form a conjugate net Ny, 
whose curvilinear differential equation is 


(4. 4) . dv? —d,'du? = 0, where “Ai = (p/q)*. 


The points P-a, Pa associated with the curves of Na, which pass through the 
point æ are given by 


Py, = Gav— (pq), Pi eA (pq) #2. 


We recall that a conjugate line may be determined in association with an 
arbitrary line I’ by choosing fundamental points w1, œz on V and following the 
method outlined in §2. The conjugate line thus determined with respect to 
the fundamental points P-a, Pa, which lie on an arbitrary chosen R-conjugate 
line is especially interesting because of its remarkable analytic, as well as 
geometric, simplicity. By making use of equations (4.2) in carrying out the 
analysis for this determination we obtain the following 


THEOREM (4.1). The R-conjugate line (È = [pq]*) determined with 
respect to the points Poy = Cur — (pq) #z, Py, = Tao + (pg) 8x, of an arbi- 
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trarily chosen R-conjugate line, as fundamental, passes through the points x 


and 2 = Duo — ax, — bar, where a, b are defined by 


a= [log(B/(pq)*)]o/2,- b = [log(R/(pg)#)lu/2. 


` This line is the cusp-azis of the point z with respect. to the extr emal curves 
of the integral invariant : 
f (pq) #v du, 
“ Of course, other R-conjugate congruences may be associated with a given 
one by selecting points P.a which are associated with other significant curves 
of 8. The investigation of some of these may prove interesting. | 
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` CONVERGENCE THEOREMS FOR FUNCTIONS OF TWO 
. COMPLEX VARIABLES.* 


By Wiciax F. WHITMORE. 


1. Introduction. The theory of harmonic measure has proved a very 
valuable tool in the theory of functions of one complex variable. The possi- 
bility of these applications is due on the one hand to the fact that the real or 
imaginary part of an a.f.1c.v. (analytic function of one complex. variable) 
ig a harmonic function and on the other to the fact that the Dirichlet problem 
can be solved uniquely in terms of harmonic functions, thus assuring the 
existence of the harmonic measure. In attempting to carry over these ideas 
to functions of two complex variables, one ‘is confronted by the fact that it is 
not possible to preacribe arbitrary boundary values for a biharmonic function 
(real or imaginary part of an a.f.2c.v.) on the entire three dimensional 
boundary of a four dimensional domain. In order to preserve at least a por- ` 
tion of the properties of the one variable case, Bergmann (B,) has introduced 
the concept of domains with distinguished boundary surface. The three 
dimensional boundary of such a domain contains a closed two dimensional 
manifold—the distinguished surface (ausgezeichnete Randfläche, surface re- 
marquable)—vwhich has properties for the theory of a. f. 2 c. v. analogous to 
those of the bonnen for the one variable case, in that a regular a. f. 2c. v. 


* Received September 21, 1939. 
1 The method of approach used here has been chiefly developed by Stefan à Bergmann 
in a long series of papers, of which I have had occasion to cite five in particular: 
(B,) “Ueber die ausgezeichneten Randflichen in der Theorie der Funktionen von 
zwei komplexen Veränderlichen,” Mathematische Annalen, vol. 104 (1931), 
pp. 611-636. 
(B,) “Zwei Sütze aus dem Ideenkreis des Schwarzschen Lemma bei den Funk- 
‘tionen von zwei komplexen Nene Mathematisohe Annalen, vol. 109 
(1934), pp. 324-348. 
(B,) “ Ueber eine Integraldarstellung von Funktionen zweier komplexer Veränder- 
lichen,” Afathematioheskti Sbornik, vol. 1 (43) (ney series), pp. 851-861. 
(B,) “Ueber eine in gewissen Bereichen mit Maximumfliche gültige Integral- 
darstellung der Funktionen zweier Variabler,’ AMathematisohe Zeitschrift, 
vol. 39, pp. 606-608. 
(B,) Ueber eine Abschätzung von méromorpton Funktionen zweier komplexer 
Veränderlichen in Bereichen mit ausgezeichneter Randfiäche,” Travaux de 
PInst. Math. Tbilissi, vol. 1, pp. 187-204. 
The theory of harmonic measure for ofe variable is given in Nevanlinna: “ Eindeutige 
Analytische Funktionen,” here cited as (N). : 
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attains its maximum on this surface, a biharmonic function is determined by 
its values there, etc. An example of a domain with distinguished surface is 
given by any domain bounded by a finite number of analytic hypersurfaces 
(three dimensional manifolds defined by analytic relations between the two 
complex variables), the distinguished surface being formed by the intersections 
of these hypersurfaces—e. g., the bicylinder | z,| <1, | 2 | < 1 with a boun- 
dary composed of the two analytic hypersurfaces 2, — et == 0, < 1 
and 2: — es: == 0, | zı | < 1 has the distinguished surface | 2: | =1, | z: | —1. 

Although a biharmonic function is uniquely determined by its values on 
the distinguished surface of a domain, it is in general not possible to find a 
biharmonic function defined in the domain which assumes arbitrarily pre- 
scribed values on this surface. Hence a biharmonic measure cannot be used 
to generalize the notion of harmonic measure, for such a measure may not 
exist. ‘Bergmann (B;) has met this further complexity by introducing the 
notion of functions of extended class. This class possesses properties necessary 
for the extension of harmonic measure; in particular, the property. that to 
every bounded, piecewise: continuous function given on the distinguished sur- 
face of a domain there corresponds a unique function of the extended class 
defined in the domain, and also that the operator defining the class is linear. 
The class depends, in general, on the domain. For a domain where the range 
of each complex variable is independent of the other (product domain, also 
called cylinder domain), the extended class is known to be the class of doubly 
harmonic functions (B.), so that for such domains the notion of harmonic 
measure can be replaced by that of doubly harmonic measure. For functions 
of 1c. v., Lindelöf has proved a theorem to the following effect (N, p. 44): 

If an analytic function defined and bounded in the upper half-plane 
converges to a limit for z tending to infinity along the negative real axis, then 
it converges to the same value uniformly in each angle-space m > argz >> 0. 

Or stated for the unit circle: 

If an analytic function defined and bounded in the unit circle converges 
to a limit for one-sided approach along the boundary to a given boundary 
point, then it converges to the same value uniformly along any path in the 
interior which ends at the given point and makes a positive angle with the 
circumference at the Point. 

With the aid of the theory of doubly harmonic measure we shall show 
that analogous results can be established on the convergence of bounded 
functions of 2c. v. defined in certain domains with distinguished surface. 








Ze 
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2. Notation and definitions. We consider functions of the two com- 
plex variables 24 == 2% + ty, (k = 1,2). A doubly harmonic function of the 
four real variables 21, Y1, T2, Y2 is defined by the equations 


(1) 





. 2y 

Ja oae e . (k=1,2). 

A biharmonic function is the real or imaginary part of an a.f. 2c. v. and 
satisfies in addition to equations (1) the equations 


ou, Oy fu Fu _, 

00,80,  0y:0ye ? OryOy,  Oraby, | ? 

as can be verified by application of the Cauchy-Riemann equations. The 
symbol - indicates the intersection of two point sets; the symbol X indicates 
their topological product. ÆET:::7] denotes the set of points satisfying the 
relations enclosed in the brackets. A four or two dimensional domain will . 
be indicated by a capital letter and the corresponding three or one dimensional 
boundary. by the corresponding small letter. An upper index 7 attached to 
the symbol for a set gives its dimensionality (0 < j < 4); e. g, 3° is a two 
dimensional set. Let @? (k= 1,2) be a domain in the #-plane, bounded 
by a finite number of Jordan arcs gr! (@4° may be multiply connected). The 
product domain # == ©? X Gs? is a four dimensional domain in the (2: za)- 
space. The two dimensional surface F= gi X g is the distinguished 
surface of Y. As noted in the introduction, the Dirichlet problem of deter- 
mining a function defined in 9 which assumes prescribed bounded and piece- 
wise continuous values on {$° can be solved uniquely in terms of doubly 
harmonic functions; in the case of a bicylinder, an explicit form for the 
desired function is given by an iterated Poisson integral (B.).- Hence, if 3* 
is a subset of %° having positive two dimensional measure, there exists a unique 
doubly harmonic function defined in # which assumes the value 1 on 3 and 
the value 0 on ° — %°. This function will be denoted. by w(2,, 223 X, A) 
and is defined to be the doubly harmonic measure of X? with respect to A 
taken at the point {21, 22}. 








(2) 


8. Convergence in Bicylinders. Using the notion of functions of 
extended class, Bergmann has established for domaifis with distinguished 
surface a generalization of a theorem given by Ostrowski for the one variable | 
case (Bz, p. 344). It will be stated here in the restricted case of product 
domains with the aid of doubly harmonic measure.? 


+ 


3 Note that in Bergmann’s statem@nt a summation sign is omitted. 
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THEOREMA 1. Let f(2,2:) be an a. f-2c.v. defined and' regular in a 
product domain À and continuous on the boundary aè of A. Let the distin- 
Š m \ 
guished surface ° of A be composed of m disjunct pieces, F = X X; and 
: kzi 


let o (21, 22; Se*, A) be the doubly harmonic measure of X. If there exist m 
` constants My (k—=1,°--,m) such that f(s %2)| = Me for {2,22} e€ Xx, then 
one has in A the inequality: 


(3) dog | f (4 #)| s > (log Mx)o (25 #25 Se? M). 


For the case m == 2 this theorem becomes a generalization of the so-called 
“two-constant ” theorem (N, p. 41): 


THEOREM 2. Let f(2:,22) and satisfy the hypotheses of Theorem 1. If 
| f (21, #2) |S M on FF and | f(z, 22)| Sm (m < M) on a subset X of P, 
then : | 
(4) . . log | f (21,22) | < plog m + (1— x) log M 
l at all points of the set {Zis 22} ut as ZA) >p, 1p > 01. 


With the aid of Th. 2, the first of the desired convergence theorems can 
be established. | 


THEOREM 3. Given f(%,%) an a. f. Èc. v. defined and regular in the 
closed quarter-space yı = 0, y: = 0 (topological product of two upper half- 
planes), with |f(2,2:)| <1 on the distinguished surface yı = yı — 0. If 
| f(y %2)| <6 (O<e <1) for {2 z2} €E[y, = ya =0, 8 (2, + a) + r? <0] 
where 8.and a are arbitrary positive constants, then |f (21 2:)| < # for 


(5) {21, #2} e E[arg (8(% + a) + 2°) > ur, 15 p> 0]. 
Proof. Apply Theorem 2 with m = «e, M —1. The function 


dy: + Rae 
(z +a) + 247 — yr 


is a doubly harmonio (in fact, biharmonic) function which is 1 on 
B= Ely: = gi 0, 8(a, + a) + a < 0} 


and 0 on the remainder of &° and hence is the doubly harmonic measure of $ 37. 
Two remarks can be made concerning this result. First, the proof can 
obviously be extended without change to the case where z:° and 22° are replaced 
by 24°” and a," respectively. Second, in the limiting’ case where 8 is allowed 
to approach infinity, the parabola 8(z,-+ a) + z4? = 0 becomes the line 


= = arg (Ca + a) + za?) =? arctan 
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Tı — — 4, and the theorem reduces to the ordinary one variable result, the 
convergence being supposed to depend only on the variable zi. .- f 
Put in another form, Theorem 3 says that if a bounded function con- 
verges to zero for approach to infinity in the real plane in such fashion that 
to every e there corresponds a & and an a so that | f(2:,#)| < e for every 
{z:, Za} belonging to the set X° of the theorem, then | f (21, 2)| < e (p a fixed 
positive quantity less than 1) throughout the four dimensional domain (5), 
depending only on the parameters & and a. Thus, if lim e= 0, convergence 


g0 ” 
to zero uniformly in 3? implies convergence to zero in the domain (5) also. 
The pair of linear transformations : 


(6) p me) (a tr), 
(IF) (2,1)? 1 — 3 
map the quarter-space Imf, 20, Imf,=0 on the bicylinder | z | <r, 
|2e¢|<<1. The first transformation takes the points. (æ,—a,0) in tke 
ĉ-plane into the points (r,ret#(e), —r) in the z,-plane, so that the segment 
(— ©,— a) goes into the are (r,ret#(a)) ; here (a) is any suitable function 
` ofa for which ¢(a) —> 0 as a—> œ. The second transformation takes (0,1, 0) 
in the ¢.-plane into (—1,—4,1) in the z-plane. Since such a mapping of 
the quarter-space on a bi-cylinder leaves the doubly harmonic measure in- 
variant (the Poisson integral is invariant under linear transformations), it 
can be applied to the domain employed in Theorem 3 to give a convergence 
theorem for a bicylinder. The first transformation takes Re({,-+ a) into 


2a[ (1 + cos $) | zı — 1 |? — 2yr sing] 


[t+ sa art 
and the second takes Re be into rea -so that Theorem 3 becomes: 


THEOREM 3a. Given f(a, %) an a.f.2c.v. defined and regular in the 
closed bicylinder | z, | £E r, | za | 5 1, with | f(u,%)| = 1 on the distinguished 


surface |z, | =r, | zz | 1. If | f(a, 22)| <e (0 ee 
(7) (4, 4a} E 2a8[ (1 + cos $) | z —r |? — 2yr sing] 


[if o ar 





AY 


+R <lal=r lait], 


where 8 and a are arbitrary positive constants, then | f (215 23)| < e for 


ayy La cE[ arg ets) ) ET 1]; | 
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where o(a) (0 < p <x/2) ts any suitable function of a which tends to zero 
as a tends to infinity. 


4, The Q-domain and its properties. An M-domain is a-four dimen- 
sional domain defined by | 


(9) M= Efn = th(za A), |z| <1, 0St<1, 0OSAS2n, 
h (22, 0) = h (22, r) J. 


Its distinguished surface is E[2, = h (Z2, À), 22 | = 1]. The function A (zs, A) 
is subject to the following conditions : 


(a) (zs, À) is an analytic function of z2; 


(b) h (2a À) is a continuous function of À whose derivative with respect to À 
exists and is finite ; 


(c) each curve zı = h (22°, A) (22° = const., 0 <S A S 2r) has a positive radius 
of curvature at all points, the limit inferior of all radii of curvature being 
positive, and is such that any sufficiently small arc at lies entirely within a 
circle whose circumference cuts the curve only at the endpoints of at, whose 
radius is not less than the distance between these end-points, and whose center 
lies in the interior of the curve.* 


It is possible to extend to Mt-domains a theorem on analytic continuation 
needed in what follows; the result was first established by Hartogs* in the 
_ case of product domains. 


Lemma 1. Given an M-domain defined by (9), assume that Dt contains 
a product domain 8? X E[| | < 1] (S* simply connected). Let f(z, 22) 
be a function satisfying the following conditions: 


(a) (2122) is an a.f. 2c. v. in the interior of the product domain; and if 
2,° is any given interior point of &, then f (21°, 22) ts a continuous function 
of 2, on the circumference | ze | = 1; 

(b) tf |t| = 1, then f(a, la) is an analytic function of z, for zı = th (te, À) 
(OSt<1, 0SAS 2x) and continuous on the boundary 2, = h (tz, À); 








(c) f(R(E, A), te) is continuous on the distinguished surface of Dt; i.e., 


8 Hypothesis (c) can also be phrased in terms of conditions as to the boundedness, 
of the first and second derivatives of 4(#,°,) with respect to A, in a similar manner 
to that used in a recent paper by Bergmann and Marcinkiewicz, Fundamenta Mathe- 
matioae, vol. 33 (1939), pp. 75-94; in particulgr, Lemma 3, p. .80. ; 

+A statement of Hartogs’ theorem will be found in Osgood, Lehrbuoh der Funk- 
tionentheorie, vol. II, part 1, p. 199. : 
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continuous in the variables À and ta (| te | ==1). Then f(z, 22) can be con- 
tinued analytically throughout M. 


Proof. For each interior point of the product domain, the Cauchy in- 
tegral formula for 1 c. v. applied to the variable z, gives 


f(a 22) — 5 eet dt. 
ue ? 
Moreover, for each z, e &?, the function f(z,, t2) can by a second application 


of the Cauchy formula be written as 


3r F(h(ta, A), ta) Oh (te, A) 
f(a te) = z. HN) ay, T 


Combining these two one has for all points of the product domain the integral 
representation 

dt Sr f(h(te, A), te) Ol td 
(10) f(#s a2) =T ef 2 (A (te, À) Oh (tad) ay 


la — o! A es 
{to[=1 





But this last expression is the generalized Cauchy integral for the domain W, 
as given by Bergmann (Bs), and thus represents an a.f. 2 c.v. defined 
throughout Yt. Since the integral agrees with the original function in the 
product domain, it represents the. analytic continuation of f(z,,22) over W. 


5. Convergence in Qt-domains. Because the theory of two complex 
variables possesses no analogue to the Riemann mapping theorem it is not 
possible to pass directly from results for the bicylinder to statements con- 
cerning M-domains. An indirect method of surmounting this difficulty is to 
make use of a domain of comparison (B,)—in this case, of a small bicylinder 
contained in the Dt-domain—and to show that certain hypotheses as to con- 
vergence on the distinguished surface of the M-domain imply conditions as to 
convergence on the bicylinder to which Theorem 3a is applicable. It is known 
by Lemma 1 that any f. 2 c.v., analytic in the bicylinder, which satisfies 
certain hypotheses on the he surface of the M- domain can be con- 
tinued analytically throughout the M-domain. The first step is to insure the 
existence of a bicylinder contained in Mt, by the introduction of suitable | 
normal coördinates (B,). Such a system of coördinates is given by the 
transformations 
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These take all points z, == A (Z:, ào) into 2%, =r and all the inner normals 
to the curves (22°, À), taken at the point Ao, into the direction of the negative 
real axis. Thus by the use of these codrdinates it may without loss of. gen- 
erality be assumed that for a particular value À — const., say À = 0, the value 
of h (22, 0) is independent of Zo and that the inner normal to the curve (42°, À) 
(z2°— const.) has at the point A 0 a direction independent of the value 
of z4. Since by hypothesis (c) on the two dimensional sections Z, == const. 
of the M-domain the boundaries of these sections all have radii of curvature 
greater than or equal to some positive number r, it may now be supposed that 
the M-domain contains a bicylinder | z, | S r; | | € 1 which is tangent to 
the boundary of Ÿ along the two dimensional surface 2; == h (23, 0). 

As a further preliminary, it is necessary to state some results for the one 
variable case. Given a two dimensional simply-connected domain @? whose 
boundary g! satisfies the conditions imposed on the boundaries of the sections 
z: = const. of the M-domain. Then it is possible, given a sufficiently small 
are at of gt, to describe a circle * with center in ©? whose circumference cuts 
g? only at the end-points of a and which has a radius not less than the distance |. 
between these end-points; so that if b> denotes the arc of the circumference of . 
_§ which is subtended by a* and lies outside @?, then the central angle sub- 
tended by b* is not greater than 7/3. By Carleman’s extension principle 
(N, p. 63), the following inequalities are valid for all z e @*- R2: . | 


w(z, at, 6°) > o(z, a, ©- R) > w(2, BM); 


so that the set E[w(z, Bt, R2) > a] is contained in the set E[w(z, a, ©) > a]. 
(Since there is little chance of confusion, w is here used, as usual, to denote 
the harmonic measure specified by its argument). But the equipotential 
„© (z, bt, 8?) = y is known (N, p. 7) to be the circular are interior to Q whose 
“end-points are the same as those of bt and which ‘makes an angle (1 — p)r ` 
with $t. In particular, by reason of the above hypotheses, a semi-circle whose 
end-points coincide with the common end-points of at and bt makes an angle 
not greater than 21/3 with bt; so that for this case x 21/3. Applying the 
one variable form of the two-constant theorem, we thus have the following 
result : he E 


LEMMA 2. Given f(z) defined and regular in a domain ©? whose 
boundary g* has at all points a positive radius of curvature and is such. that 
about any sufficiently small arc a of gt it is possible to describe a circle whose 
center is in $, whose circumference cuts g! only at the end-points of at, and 
which has a radius not less than the. distalice between these end-points. Then 
if | f(z)| S1 on g and | f(z)| <e (0 <e< 1) on a, one has | f(z)| < 4 
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at all points of the domain bounded by a’ and the semi-circle whose end-points 
coincide with those of a+. 


Using this last result, it now becomes possible to pass from a majorant 
on the distinguished surface of an Nt-domain to a majorant on the dis- 
tinguished surface of the bicylinder to be used as a domain of comparison, 
and thus to use the result already obtained on convergence in the bicylinder 
to obtain a result on convergence in the Mt-domain. In what follows, the 
normal coordinates introduced in equations (11) with A—0 will be used 
without further explicit mention of the fact. 

Still considering the one variable case, let gt be any curve Z, = h (22°, A) 
and let z, == h (z, 0) be one end-point of the arc at. Let a circle €? of fixed 
radius 7, where r is the lower bound of the radii of curvature of g? (positive, 
by hypothesis), be drawn tangent to g* at z, = h (Zz, 0) and lying in ©, Then 
the semi-circle G? whose end-points coincide with those of a! cuts off an arc 
on the circumference ©! of ©? whose length is certainly greater than half the 
length of a’, provided the distance d between the end-points of a’ is not greater 
than r. For let e be the are cut off on è, and let a and e be the length of 
at and e! respectively. Obviously, the most unfavorable case is for d — r and 
for a! coinciding with the tangent to g* at z,==/(2°,0). In this case 
dSa< rd/3 and e = rd/4, so that a < 4e/3 and a fortiori a < 2e. 

To summarize these results: By the hypotheses on the M-domain and by 
the use of the appropriate normal codrdinates it is possible to assume without 
loss of generality that W contains a bicylinder | zı | <1, | z: | S1 (ra fixed 
positive quantity) to which it is tangent along the two dimensional surface 
zı = h(22,0) =r, < 1. Moreover, to any arc at == (h (23A), r) (à > 0) 
on a section z» = const., there corresponds an are (ret, r) (8 > 0) cut off on - 
| zı | =r by a semi-circle whose end-points coincide with those of at, the 
length of this latter arc being greater than half the length of a’. Thus the set 








Z2 








18) a 2a8[| (z2, A)—r |2 (1 + Reh (z2, A*)— 2r (Imh (22, A) ) (Imh (22, A*))] 
Rss al [1+ h(a, A*)|? Res à) — 1 |? 
4y? 
F Pion, < 0, 


| z2 | = 1, A* = const., A* 120, | Hz, A*) —r | > 2r | ett —1 |] 


corresponds in this manner to a set on the distinguished surface of the bi- 
cylinder which includes the set (7) of Theorem 3a. Hence, using the results 
of Theorem 3a, it is now possible.to state the following result for the case 
of an Mt-domain : 
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| ~ THEOREM 4. Given f(a, 22) ‘defined and regular in an M-domain given 
by equation (9) and satisfying the hypotheses (a), (b), and (c) following 
equation (9). If | f(z, 22)|S1 on the distinguished surface z, = h (za, À), 
Us | = 1 of M and | f(a, 22)| < e for {2 22} belonging to the set X? given 
by (12), then. | F(a, 22)| < HS for {z, a ro to the set (3) of 
Theorem 3a. 


This theorem has an interpretation for convergence wholly analogous to 
that previously given for Theorem 3. 


6. Extensions and ds These results can be extended’ in at 
least two directions. ‘First, the hypotheses on the set 3? of Theorem 4 can . 
undoubtedly be made sharper if necessary, and the hypothesis that f(z1,%) 
` be regular in the entire Dt-domain can be lightened in accordance with Lemma 
1 to the supposition that the function is merely defined, bounded, and con- 
tinuous on the distinguished surface of df and regular in the intersection of m 


` and a. product domain which includes the particular two dimensional surface 


zı = h (c2, 0), | z2 | S 1 where convergence is to'be studied. Second, reverting 
to Theorem-3, it is possible to use n overlapping parabolas and to suppose, for 
example, that, | f(z,2)| S1 on the real plane yı — ya = 0 and | f(2, 22) 
<ne in the overlapping portions of the parabolas. The doubly harmonic 
measure of the resulting region is obtained merely by addition of. the doubly 
. harmonie measures of the individual parabolas. 
| One of the interesting applications of the theory of functions of two 
complex variables is to the theory of pseudo-conformal mapping; i. e., mapping . 
ofa four dimensional domain by a pair of analytic functions of two complex 
` variables: The results here stated apply only to a single function, but can ` 
i gly be applied, in connection with some results of Bergmann (B;), to the 
an Study. of convergence to the boundary of the pseudo-conformal map of a 
re domain. vt peur to discuss these matters further at a later time. | | 
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ON THE NON-EXISTENCE OF THE EUCLIDEAN ALGORITHM 
IN CERTAIN QUADRATIC NUMBER FIELDS.* f 


a 
5 


By ALFRED BRAUER. 


Introduction. Let P be the field of rational numbers, and, m be a rational 
integer which is not divisible by the square of a prime. If for any pair of 
integers a, B with 8 ~0 of P(m#) a third integer Y of this field can be 
determined such that 
(1) [Nee < LB) 


where N (a) is the norm of a, we say 1 that the Euclidean algorithm exists in : 


the field P(m*) or that the field is Euclidéan. 


The problem of determining all Euclidean quadratic fields has not been . 


solved completely, although this question has béen studied a great deal during 


the last few years. In this paper I prove that the algorithm does not exist 


in certain cases in which the question has remained unsolved till now. 

If a field is Euclidean, then the greatest’ common divisor exists for any 
pairs of integers of this field; thus it is necessary that the class number is 
equal to 1. Dedekind? remarked that this condition is not. sufficient, because 
the class number is equal to 1 in the sed P(V—19 18), although this Sie is 
not Euclidean: 


For imaginary quadratic fields it was- ‘shown by L. È. Dickson * that the- 
Euclidean algorithm exists only in the cases’ m == —1, — 2,83, — 7, and i 
— 11. For real quadratic fields the Fan nagd not et. been. solved com- 


pletely. For 


4 


` (2) | m= 2,3, 5,6, 7, 11, 18, 1%, 19,21, 29, 33, 87, 41,57, —. 2. 


the algorithm exists. This follows from the investigations. of O. Perron! : ms 


A. Oppenheim,’ R. Remak,* E. Berg,’ and N. Hofreiter.* I. Schur? remarked: à 


*Received October 13, 1939. 


Cf. G. H. Hardy and E. M. Wright, An Introduction to the Theory of Numbers, + 


Oxford (1938), pp. 212-217. 


3P. G. Lejeune Dirichlet, Vorlesungen über Zahlentheorie, heravigegeben y Yon R. Pr 


Dedekind, 4. Aufl. Braunschweig (1894), p. 451. 
3 Algebren und ihre Zahlentheorie, Zürich u. Leipzig (1927), pp. 150- 151. 
. *“Quadratische Zahlkürper mit Euklidischem Algorithmus,” Mathematische An- 
nalen, vol. 107 (1932), pp. 489-496. 


3“ Quadratic fields with and withoyt Euclid’s algorithm,” Mathematische Annalen, 


vol. 109 (1934), pp. 349-352. 
: *“ Über den Euklidischen Algorithmus in reell-quadratischen Zahlkörpern,” Jahres- 
. bericht d. Deutschen Mathematiker-Vereinigung, vol. 44 (684), p pp: 238- 250., 
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that the algorithm does not' Sat for m=47. À. Oppenheim +° proved the | 
non-existence for m = 23 and m = 53; N: Hofreiter for m == 14 (mod 24), 
_ and also # for m == 77 and for m = 21 (mod 24) with m > 21, E. Berg ™ and 
J. Fox Keston + for m 321 (mod 4) except i in the cases (2). Some of these 


results are also proved in Hardy and Wright’s book mentioned above in the ` 


footnote i. H. Behrbohm and L. Rédei™* showed that, excepting the cases 
(2), the algorithm can only exist in the following three cases (p and q denote ` 
primes) 
I. m==p==18 (mod 24), 
II, m==p=1 (mod 8), . 
III. m= pq with p = q = 3 (mod 8) or p= q = 7 (mod 8). 


` Using analytical methods, P. Erdös and Ch. Ko™ proved that the algorithm 
does not exist in the cases I and II, if m is sufficiently large. ‘The corre- 
sponding fact in the case III was shown by H. Heilbronn.** Finally, L. 
Schuster ** proved that in the case III, the algorithm exists at most for 
m == 1 (mod 24) except for m = 33 and m — 57. | 
In this paper I improve the theorem of Erdés and Ko for the case I. 
I show by elementary methods that here the algorithm cannot exist for 
p> 109. In the cases p= 13 and p= 37 the algorithm exists; whether or 
not the fields P( V61) and P(V109) are Euclidean, I cannot decide. 
In their paper mentioned above, Erdés and Ko prove that the algorithm 
does not exist in the case I, if the two least quadratic non-residues u and v 
which are odd primes satisfy the condition 


T“ Über die Existenz eines Euklidischen Algorithmus in quadratischen Zahl- 
kürpern,” Kungl. Fysiografiska Sallskapets i Lund Förhandlingar, vol. 5 (1935), Nr. 5. 

3“ Quadratische Körper mit und ohne Euklidischen Algorithmus: Monatshefte far 

, Mathematik und Physik, vol. 42 (1935), pp. 3917400; 

9 OF, loc. oit. 6), p. 351. 

10 Loo. cit. 5). 

31 Quadratische Zahlkôrper ohne Euklidischen Algorithmus,” Mathematische An- 
nalen, vol. 110 (1935), pp. 195-196. 

12 Loc. oit. 8). 

13 Loc. cit. 7). 

131 Existence of a Puctidean algorithm in aiak fields,” Thesis Yale University 

, (1935); cf. Bulletin of the Amerioan Mathematical Society, vol. 41 (1935), p. 186. 

144 Dér Euklidische Algorithmus in quadratischen Zahlkörpern, ” Journal f. d. 
reine u. angewandte Mathematik, vol. 174 (1936), pp. 192-205. 

15“ Note on the Euclidean algorithm,” Journal of the London Af athematigal 
Society, vol. 13 (1938), pp. 3-8. 

184 On Euclid’s algorithm in real quadrate fields,” Proceedings of the Oambridge 
Philosophical Society, vol. 34 (1938), pp. 021-026. 

5 17“ Reellquadratische Zahlkörper ohne Euklidischen Algorithmus,” on f. 

Mathematik u. Physik, vol 47 (1938), pp. 117-127. 
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(3) Bun <p. 


Then they’ use analytical dani for proving that (3) is satisfied for all 
primes which are sufficiently large.* I heré prove (3) for all primes p > 421 
of the form 24n +13 in an elementary manner, using the inequality for the 
least odd quadratie non-residue u modulo a prime p of the form 8n + 5, 
(4) | u< 2{(4p)*" + (ap) #} +1, 
- which I had obtained by elementary methods in a former paper.t® 

After dealing briefly with the method of Erdüs and Ko in $ 1, I prove 
in § 2 that the algorithm does not exist in P(p%), if p is of the form 24n + 13 
and v > 8u. If, however, v < 8u, then the non-existence of the algorithm 
for sufficiently large p follows immediately from (4). The limit for p for 
which this holds can easily be given. In order to replace it by a smaller 
number, I’ show in 83 that the Euclidean algorithm‘ does not exist, if 
p = 24n + 13 > 12696 and v > 6u.. From these theorems, the non-existence : 
of the algorithm follows for the primes of this type, if 24u? < p, or if 
18u? < p and p > 12696. 

In $ 4, I improve (4) to 

u < 25/5p2/5 +. 2-8/5). 2551/5 4 3 for p= 8n -+ 5, 
u < Rp + 49 pi LT for p == 8n + 3. 

For p= 8n + 5 this is still further improved. On the basis of these theorems 
we obtain the result in $ 6 that the algorithm does not exist in P(p%*) for 
p = 24n + 13 > 3 300 000. The primes below this limit must be treated 
“directly. Using the above theorems we can see that the algorithm does not 
exist for p > 109. This direct treatment requires long computation, even if 
- properly arranged. In these computations I have been assisted by my mother 
and my wife. - 2 


1. The method of Erdis and Ko.. Erdös and Ko prove the following 
theorems: 


THEOREM 1. For a prime p of the form dn +-1, the Huckdean algo- 
rithm cannot exist in P(p*), if p can be written in the s form 


(5) . p= qm, + qama, 


where mi, Mz, Qi, Qa are all positive and quadratic non-residues (mod p), and 
where the q; are odd primes which divide qim; to an odd power for 11,2. 


Proof. We write the condition (1) in the form 


18 ber den kleinsten quadratischen Nichtrest,” Mathematische Zeitschrift, vol. 33 
(1930), pp. 161-176. a 
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T E [N(a/B—y)| <1. 
Suppose now : 
| a/B =r +s p*, 
D Cl 


where r,s are rational and x, y rational integers with omy (mod 2). This 
is possible, since y is any integer of the field P(p4) and p= 1 (mod 4). From 
(6) and (7) we obtain 


(8) E E eee < 4. 


Consequently, if for a pair of rational numbers + and s it is impossible 
to determine the rational integers x and y such that the condition (8) is 
satisfied, the field is not Euclidean. Since qim, is a quadratic residue, the 
congruence 2° = q,m, (mod p) has a solution z4. 

We now choosé 

r= 0, 8 == 2;/D, 


and it follows from (8) that 


| pu? — (py — 221)? | < 4p. 
Since here the left-hand side is congruent to 42,7 (mod 4p), we have either ' 


(9) pu? — (py — 22)? = — Ag, 
or, by (5), oy ? : 
(10) po” — (py — Ra)" = 4p — 4q:mi = 4gome. 


We have to show that (9) is not possible. Suppose first that ze= 0 (mod qı). 
Then also py —2z,==0 (mod qı) and qi ‘divides the left-hand side of (9) 
to an even power, but the right-hand side to an odd power. This is impossi- 
ble. Suppose now té 0 (mod g:). Then it follows from (9) that pz? is a 
quadratic residue of qı. This ig impossible, because 


(P/Q) = (G/p) = — 1. 
Thus (9) is impossible. In the same way it follows that (10) cannot be 
solvable. . | 
THEOREM 2. Let p be a prime of the form 24n+ 13. If u and v are 
the two least quadratic non-residues (mod p), which are odd primes, and if 
(11) . 8w < p, | 
then the Euclidean algorithm does not exist in P(p*). 


Proof. If we set 
(13) - p— Bue ed 
and . . 
(13) | p= uv + abe 
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then b, and bz are positive integers, honie of. (i). For primes p = 24n +13, 
the number 2 is a quadratic non-residue and 3.a quadratic residue. Hence, 
from (12) it follows that 


(2b:/p) = (— 3uv/p) — Or —1. 

Consequently, b, is a non-residue. In an analogous manner it follows from 
(13) that b; is a non-residue. Further, from (12) and (18), we have 
, P nd 302 RS bi. : 
Therefore, one of the two numbers b:, bə is odd. Let us first assume that b, 
is odd. Since b, was à quadratic non-residue, there exists at least one odd 
prime q which is a non-residue (mod p) and which divides b, to an odd power. 
Because of (12), the number g is different from u and v. We set in Theorem 1 

qi = U, mı = 80, q: =q, Mo = ÊD;/q. 
Then (12) yields a representation of p which satisfies the conditions of 
Theorem 1. Therefore the algorithm does not exist in P(p#) in this case. 

Tf, however, b» is odd, then there exists at least one odd prime ‘q’ which 
is a quadratic non-residue (mod p) and which divides b» to an odd power. 
Erom Theorem 1 for : ; 

Qi = U, My t, Ga, Mz = 2b2/q’, 
and from (13) it then follows that the algorithm does not exist in B pry in 
thig case. 

The Theorems 1 and 2 will be used in the following. The ae are 
given here again, in the first place in order to show that they are elementary, 
in the second because the Theorem 2 is not given explicitly in the paper of 
Erdés and Ko. There it is assumed instead of (11) that the three least 
quadratic non-residues u,v,w (mod p) which are odd primes satisfy the 
condition | f ; | 

uvw <p 
where n <: 001 is a positive constant. But for the primes of the form 
24n + 13, Erdôs and Ko actually use.only the weaker assumption (11). In 
my paper it will be important that the condition (11) is ue in this case. 


2. Elementary proof of the theorem for large i. We first prove the 
following theorem: | 

Taronrar 3. Let p be a prime of the form 24n +13. If the two least 
quadratic non-residues u,v (mod p), which are odd eee satisfy- the 
condition e 
(14) i ; v > Bu, 


then there does not exist an Euclidean algorithm in P(p#). 
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Proof. We have nothing to prove for p — 13, 3%, and 61, since (14) 
` is not true for these primes. We assume therefore p = 109. 

The least odd quadratic non-residue u modulo a prime p of the form 
8n + 5 satisfies the condition ` 


(15) u< (p+ 4) +2, 

as I have shown in the paper mentioned above.’® Hence, because of p > 96 
(16) p— 8u > p—8(p+ 4)4— 16 > 0, 

since 


(p — 16)? = p? — 32p + 256 > 64p + 256. 

Let 2ku be the largest multiple of 2u which is less than p. The eons 
eight even numbers 
(17) 2(k—3)u, 2(k —2)u, 2(k—1)u, 2ku, 2(k + 1)u, 

2(k-+2)u, 2(k + 8)u, 2(k + 4)u 

lie in the interval {p— 8u: - -p+ 8u} and are therefore positive because 
of (16). They form an arithmetical progression with the difference 2u. It 
follows that exactly. two of these numbers are divisible by 4 and not by 8; 
the difference of these two numbers is 8u. This implies that at least one of 
them, say 4lu, is not divisible by u®. Then 


(18) . (1, 2u) = 1. | | l 


Furthermore, since lu is one of the numbers (17), we have 
(19) ` p— 8u < Alu < p+ Su, 
(20) | p — tlu | < 8u. 


On the other hand, | p—4lu | is an odd integer less than 8u because 

of (20), hence less than v because of (14). It follows that | p— 4u | isa . 
quadratic residue (mod p), since | p— 4lu | is not divisible by u, and all the 
odd positive integers less than v, which are not divisible by u, are quadratic 
residues. Then lu also is a quadratic residue, and therefore ? a quadratic 
non-residue (mod p). Because of (18), l contains at least one odd prime w 
* which is different from u and a quadratic non-residue os p)- Conseguentiy, 
v & w, hence because, of (19) 

4uv = tuw S Alu < p + Bu, 

Buv + uv < p + Bu, 

Buv + u(v — 8) <-p. 
Thus, because of (14) 

` 3wv <a 


38 Loe. cit. 18), Satz 2. 
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Theorem 2 now shows that the Euclidean calgon’ cannot ee in P(p*). 
: This proves Theorem 3. 


I£, on the other hand, the assumption (ai is not satisfied, i e, if v< Su, 
then (4) implies the inequality 


(21). Buv < 4u? < 24{2 (4p)? + 2(4p) + 1}. 
However, if p is sufficiently large, we have 
(22y .  2A{2 (4p)? + (4p) +1} < p. i 


For all values of p for which (22) holds we have because of (21) 
| Bw < p. 


According to Theorem 2 the Euclidean algorithm does not exist in P(p*) for 


these p in the case v < bu we are considering. In connection with Theorem 3 
„this yields the theorem of Erdös and Ko. 


THEOREM 4. If p is a sufficiently” large prime of the form 24n + 18, 
` then there does not exist an Euclidean algorithm in P(p'#). 


Since (4) had been obtained. by elementary methods, we have given a 
proof which is free of analytical methods. More exactly, we see that the 
algorithm does not exist, when (22) is satisfied. We may easily obtain a lower 
bound for p from which (22) holds. We do not give the computation, since 
we shall later obtain a still smaller value of this lower bound. 

As an immediate consequence of Theorem 3 we have 


THEOREM 5. If the least odd quadratic non-residue u modolu a prime 
p of the form 24n + 13 satisfies the condition ?4u? < p, then there does not 
exist an Euclidean algorithm in P(p#). 


Proof. Let again u and v be the least quadratic non-residues (mod p} 


which are odd primes, u < v. If v > 8u, the statement follows from Theorem 3. 
If, however, v < 8u, then 


Buv < 24u5 < ? 
and the theorem follows from Theorem 2. 


3. Improvement of Theorem 3. The Theorem 3 can be improved in 
the following manner: ` 

THEOREM 6. If p> 12696 ts a prime of the form 24n + 13, and tf 
the two least quadratic non-residues u and v a p) which are odd primes 
satisfy the condition 


(28) ` 2>6u, 
then there does not exist an Euclidean algorithm in P(p*%). 
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| Proof. If u = 28, then 24u” = 12696, and the statement follows from ` 
Theorem 5. We assume therefore that 
(24) ‘ u = 29. 
As in the proof of Theorem 3, let 2ku be the greatest integral multiple of 2u 
less than p. We take here the following four integers 
(25) 2(k—1)u, ku, 2k+1)u, 2(k+2)u | 

s — 

which belong to (17). These integers lie in the interval {p—4u---p— 4u}. 
They are all positive because of (16), since p > 12696 > 96. There is exactly 
one of them which is divisible by 4, but not by 8. Suppose that this is 
the number 4lu. Analogously to (19) and (20), we obtain from (25) 


(26) p—4u<4lu < p+ 4u, ` 
(27) - | p—4lu| < 4u < 6u. 
Further, Lisodd. If we have 

(28) ; i (l, u) = 1, 


in accordance with (18), then the statement follows from (23), (26), (27), 
and (28) in complete analogy with the proof of Theorem 3. 
On the other hand, let us suppose that 
: | (1,4) > 1. 
. Then we have 
(i, u) =u, 
since u is a prime; hence 
(29) | Alu = 0 (mod u*). 
From (15) it follows that 
(30) p— du > p—24(p 4+ 4)* — 48 > 0 
since p > 672, and therefore 
(p — 48)? == p? — 96p + 2304 > 5%6p + 2304. 


We consider the interval 


(31) : ° I = {p — 24u; - - p}. 
There lie exactly 4 odd multiples of 3u in I. Let 
(32) ` 3su, “3(s+2)u, 38(s+4)u, 3(sp6ju “| 


“be these multiples. `The even integers : 
(33) p—8(s+6)u, p—3(st+4)u,*p—3(s+2)u, p—3su 


form an’ arithmetical progression „with -the difference -6u. It follows that 


t 
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exactly one of the bet (33) is divisible by 4-and not by 8. Suppose ` 
that this is p— 3tu; then we have 


(34) 4(p— dtu) = 1 (mod 2). 

The integer 3tu belongs to the numbers (32), hence we have ` 
(35) . t=1 (mod 2) | 

and 

(36) | ; 0 < p— Mu < 3tu < p 


according to (31) and (30). | 
Moreover 3tu and 4lu ‘both lie in the interval {p—?4u: - -p + 4u} 
because of (26) and (36). Their ne then is at most equal to 28u. 
Because of (24), we have 
37) || Btu Au | £ 28u.< wt 


oe to (35), 3tu is odd, and therefore different from Alu. It follows 
from (37) that 3¢w dnd 4lu are not both divisible by u°. Hence we obtain 


(38) 3iu 5 0 (mod u’) 
by (29). Furthermore, we have 
(39) ; D. <bu<v 


because of (36) and (23). 
The integer 4(p— 3tu) is odd, sigh divisible by u, and less than v because 
of (34) and (39). Consequently, it is a quadratic residue (mod p) ; so is 3{u: 
It follows that 3¢ and ¢ are quadratic non-residues. But t was odd because 
of (35), positive because of (36), and not divisible by u, according to (38). 
Then ¢ contains at least one odd prime factor w which is a quadratic non- 
residue (mod p) but different from u. For it, we have v = w, and therefore, 
because of (36) 
uv = Buw. < Sut <p. 


The statement of Theorem 6 follows now from Theorem 2. 
From Theorem 6 we obtain at once the following theorem which pape 


Theorem 4: R 


THEOREM 7. If os 12696 ts a prime of the jonni Rán + 13 arid tf the. 
least quadratic non-residue u (mod p), which ts an odd prime, satisfies the 
condition 18u” < p, then there does not exist an Euclidean algorithm in P(p*). 


4. Estimates for the least odd quadratic non-residue.. In my paper. 
mentioned in the introduction, I have shown. that the least odd quadratic 
non-residue u for a prime of the form 8n + 5 satisfies the inequality 


2 
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(40) Hi MN de 


It was mentioned there that this relation can still be improved for primes of : 
the form 8n +6. In this manner, we may obtain 


(41) u < 2{ (2p) + (Rp) + 1. 
We now have to improve (40) and (41) still further. 


‘ THEOREM 8. The least odd quadratic non-residue u modulo a prime p 
satisfies 
(42) j u < 23/5p?/5 4 278/5) . 25p/5 +. 3 for p = 8n + 5, 
u < p + 4% pT for p = 8n +8. 


Proof. _ We have nothing to prove for u = 3 and u = 5; hence we assume 
uv. The even numbers 


p+ip+3,--,p+u—2 
are quadratic residues. For a pof the form 8n + 5, the numbers 
prise <- p—u+e2 


are also quadratic residues. Let U denote the interval {p-- - P +u—i} . 
if p is of the form 8n +3, and the interval {p—u +1: -p—+u—1}. 
for p of the form 8n +5. Then alleven integers of U are quadteti residues. 
Tf z is an arbitrary odd integer such that 


pisrcm for p = 8n + 3, 
1<z<u/2 for p= 8n + ñ, 


then U contains integral multiples of 2z. Let 
(44) k- 22; (k +1)22,---, (k +1— 1)2z 
be those multiples of 2z; then 


(48) 


(45) es[Z] +2 


All the numbers (44) are quadratic residues as even numbers of U, 2 is a 
non-residue, and z a*quadratic residue because of (43). This implies that 
the numbers f 

(46) «kok +4,-+ +, k+i—i 


form a sequence of l non-residues. None of them is therefore a square. Hence 
we may find a positive integer a such tha} 


(47) . @ckSk+l—1< (a+1)% 
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Then it follows from (45) that 
2. z] | i 
= E $ 

(48) . | 


We divide now the interval À == {a*- - - (a + 1)°} into parts; we have 
to distinguish between two cases. 


I. a even: 
We determine the positive integer t such that 
(49) © (a +1)? — (BP)? > e > (a+ 1)? — (RE +2): 


This can always be done since an even square number cannot equal the 
difference of an odd and an even square number and since a= 2. Then 


(RE)? < 2a +1, - 


oy at < [Va]. 


The points 
(51) (a + 1)*— (2r)° (v == 1, 2,° | sY) 


divide the interval A into subintervals. The distance between two such 
consecutive points is 


(52) (@-+.1)?— (2v)*— [(a + 1)*— (2 + 2)*] = 8r +4. 
The largest of the subintervals of A is therefors either the interval 
Ie = {(a-+ 1)? — (2 — 2) - (a+ 1)?— (299), 
or the interval | 

Tey {(a + 1)? — (20)?- + as}. 
Because of (52), we find for the length | I+ | of Iv 
(53) | Iv | == 8 — 4. 

Since a was even, we have 
a? —[(a+1)?— (2 + 2)?] =3 (mod 4). 

Because of (49), we find then | i 

at —[(a +1)? (2 + 2)°] Z3. 
For the length | Irs: | of Its: we have therefore 
(54) | Iva | S8f +4—3—8/ +1 


because of (52). If s” now denotes the maximal length of a subinterval of 4, 
we obtain from (53), (54), and (50) 
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(55) fS84+154[ Va] +1. 
II. a odd: l 
In this case we determine an integer t” = 0'such that 
(56) (a+ 1)*— QE +1)? > @ > (a +1) — (RE + 3)% 
_ This again is possible since the sum of two odd squares can not be a square. 
Then 
S at” +1< Va +1, 
(57) : PA 
at” +1< [Vea]. 
The points | 
(58) (a+ 1)?— (2 +1) (= 0,1, > +, 2”) 
“again divide the interval À into parts, and the distance of two consecutive 
points (58) is given by | i 
(59) (a+ 1) — (2r + 1) — ((a + 1)? — (2 + 8)*) = By + 8. 
~” Consequently, if t” > 0 then either the interval 
Te = ((@ $1)? (8—1)? + (a+ 1)2— (2 + 14) 
or the interval 
lra = { (a + 1)? — (21% +1) a°} 
“ig the largest of the subintervals of A. If #” 0 then I: is the largest 
subinterval of A. If again |I: | and | Zes | denote the lengths of I, and 
Ira, then because of (59) 
(60) rl 88”. 
` Since a was odd, we have 
a — ( (a + 1)? — (2t” + 8)*) = 2 (mod 4) ; 
hence because of (66) 
a?— ((a + 1)*— (W 4 8)*) B2 
and. because of (59) 
(61) | Len | S80 + 8—2 — 81" 4 6. 


het s” denote the maximal length of the subintervals of A. From (60), (61), 
‘and (57), it follows that 
(62) 0° s S80" + 6S 4[ Vea] +2. 
We set now 
` (63) s = Max (s,s”) 


In both cases I and IT, the length of each subinterval of À is at most re to s 
and we have, because of (55) and (62): 
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(64) s S 4[ Va] +2. 

Tf we had now ` 
(65) . . w= Max {a + 2 + [V2a], ezs}, 
where : | 


: ex] for p = 8n + 5, 
(66) a orig 
then all the odd integers S a + 1 + [V2a] would be quadratic residues. 
In the case that a is even, the odd integers 


(67) Kee Diy — (a+ 1 + 2v)(a-+1—2yr) 


for v=—1,2,---,{ would all be quadratic residues because of aa Similarly, 
for odd a, the sad numbers 


(68) ‘(@+1)?— (2 4+1)2—= (a aa ENES 
for y= 0, 1,: : -; t” would all be quadratic residues because of (57). 

In both cases, we have divided the interval A by the points (51) and (58) 
respectively which are quadratic residues according to (67) and (68). The 
length of each subinterval of A was smaller than or equal to s because of (63). 
It follows that each interval of length s which lies in A contains a quadratic. 
residue, and hence A can not contain a sequence of s non-residues. 

On the other hand, we have because of (65). 


(69) u = 28. 


In the case of a prime p of the form 8n + 3, the interval U contained u con- 
secutive integers, and then, according to (69) and (66) at least s complete 
residue systems mod 2z and hence at least s multiples of 2z. For p == 8n +5, 
there appear at least s multiples of z among the u numbers p,p+1,:-:, 
p + u— 1 because of (69) and (66). The same is true for the u numbers 
p,p—1,-**,p—u+1. But here, we have z>1, according to (43), 
and this shows that z does not divide p. Thus, we have also in 
U=={p—u-+1---p+u—1)} at least 2s consecutive multiples of z, 
hence s multiples of 2z. | 

On the other hand, the number of HEADER of in U was ce tol 
because of (44), hence , 

. l=s | 

It follows from (46) and (47) that the interval À ‘contains a sequence 
k,k-+1,---,4-+1—1 of at least s non-residues. This gives a contradiction 
which shows ‘that (65) is not true., Hence - 


u < Max {a+ 2 + [V 2a], es}. 
and then according to (64) 
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u < Max {a +2 + [Via], (AL VF] + 2))}, 
u = Max {a + 1 + [Va], e(4[ V£a] + 2) —1}. 


Because of (48), we have then 


(a) (a) ue es) 
| D. u< Max | (2)" + (22)’ EE te(2pet)* + 2e —1 |. | 


For p of the form 8n + 5, we assume first that p > 2048. For p of the 
form Bn +- 3, no restriction is necessary. 
If we assume that 


(71) u E 2/e2/Bpy2/8 1. 29/8,8/8p1/9(3 1. et) p de — 1 
` and if we determine the odd integer zo so that 

| | 1 (pe\" Lo 1 gy” 

(72) 2(#) +1>0> LE(S) : 


then it follows from (71) and (72) that 4u > zo For p = 8n + 5, we have 
Zo > 1 because of (66), since p > 2048. For z= zo the conditions (43) 
are satisfied. Then it follows from (70) and (72) that | 


u < Max j 12e (5) + 182 (2) Eh à 
STORE a], 


(73) u < Max j 93/52/58 p” + 24e 1/59 1/5 + 1, dep! “(as (6) 


+a AE (E 6) +4 (2) a 


We state now that 


: 1 pe 3/8 8 ae 6 Gi 7 Za 
ee VE pes 6 2-(28/5) (12/5) 3/5 
ae) (5) i(i a e Mae ie LE 
Ki 3:9- «2/6)% (875) pis + 27: 9-(11/5) € 4/85 1/5 + 27 + 81: Q-(4/5) et/5p7 0/5) 
- e (2- (7/5) —- (8/5) po + 3-92- (2/8) 1/8 p a/20) Ve, 
This can be shown as follows. The first i terms of both sides are equal. 
Furthermore i 
.8: 24/5 (4/0) p1/8 < 3: 24/66-4/5) pus % mn 27 i 2-0/8) g (4/0) p1/5 
and | - ` 


16 < 27 + 81 DAD tga, 
Consequently (74) holds. From. (73) and (74), we obtain 
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u< Max {23/5e/p?/s + 24/51/69 1/5 + 1, 
dept (2 (1/5) 18/5) p°/2 + 3-2- Bas (4/20) ) + 9-(8/5) e/5p 1/8 ie 4e — 1}, 
u< Max {28/5 2/5 p2/5 os 94/5 1/6 pus Bose 
25/52/59) 2/5 + 20/8. 8e/59 4 + 2- (6/5) 1/8 p + de 1), 
Hence 
u< 28/8 _2/552/5 + 20/5e9/6p1/5 (3 an 2%e1) + 4e— 1. 
This gives a contradiction to (71), showing that (71) can not hold. Hence, 
according to (66) ; 
u < 29/5p7/8 +. 28/9 . 25- p> +3 for p= 8n + 5 > 2048, 
u < p + 4% ps LT for p = 8n + 3. 
This proves the statement (42) for all primes of the form 8n + 3 and for 
primes of the form 8n + 5 which exceed 2048. 


It remains to prove (42) for primes of the form 8n + 5 which are less 
than 2048. For these primes, we have, according to (15) 


u< Vp+4+2. 
It is therefore sufficient to show that for p < 2048 we have 
(75) Vp EEL < QE pV 4 8/0 . 25/5 + A 


But this follows from 
p+4< (23/5p3/5 4 ace. 25 pV) 2 = 29/5 4/8 4. 92/8. 2578/8 4 2702/0 . 625 p°/8 


which is true, since p < 2048 and therefore P < 25p*/5, Hence (75) holds 
and, Theorem 8 is proved. 
It will be necessary to improve Theorem 8 still further. We have 


THEOREM 9. Let p be a prime of the form 8n + 5 which satisfies 
(76) p> Hr 108, 


Assume that the two least quadratic non-residues u,v which are odd primes 
satisfy the condition 


(77) u <v < Eu. | 

If we set | 

(78) | pæv/u +1, | 

we have ¢ 

(79). u < (p/p)? + (24/p + 4) (p/p) + 9/p. 
Proof. From (77) and (78) we obtain 

(80) | RLp<T. 


For u < 16, the statement (79) is certainly true because of (76). For u > 16, 
we have ° 
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u — (2p—2) > 4 . 
according to (80). Let z be an arbitrary odd integer which satisfies - 
(81) Rp —8 ZE re <u. | 
. We consider the numbers of the form p + hvu with integral hy = 0 which 
lie in the interval {p—v+1,---,p-+»v—1}. We have here yu < v, 
and according to (78) For: 
(82) | hy < PTT F 
At most one of the numbers p + hvu is divisible by 2z. Indeed, if two of 
these numbers were divisible by 2z, so would their difference, which is of the 
form h*u with | h*| < 2p—2, because of (82). But u is a prime and 
therefore prime to’ 2z, according to (81). Hence h* would be divisible by 2z. 
This, however, is impossible because (81) implies ` 
[he | < 2p —2 S 2z. 
Consequently, either all the numbers p + hvu in the interval {p—v +1- p}, 
or all those numbers in the interval {p-::p+v—1} are not divisible by 2z. ` 
Let us assume that {p-::p+v— 1} does not contain a mumber p + hvu 
which is divisible bÿ 2z. In the other case we may argue in exactly the same 
manner. | De 
Let V be the interval {p—u+1---p-+v—1}, and suppose that 
(83) k: 2z, (k + 1)22,- +, (b+ 1—I1)2z 
are the multiples of 2z in V. : Since it has beeen assumed that the interval 
{p---p-+v—1} does not contain a multiple of 2z of the form p + hvu, 
and since the interval {p—u + 1: - -p—1} does not contain a number of 
the form p + hvu, none of the numbers (83) is of the form p+ hvu with 
integral hy. However, all the even numbers of V, except the numbers 
p + hyu, are quadratic residues (mod p). Since 2 is a How residue and z is a 
residue, according to (81), the numbers . 
(84) > k++, k+l—2 
form a sequence of | quadratic non-residues, analogously to-(46). From (83) 
it follows that (45) again holds. _ 
` Let a, A, and s ‘have the same significance as in the proof of Theorem 8. 
We see in the same manner as above, that (48) and (64) hold. Then 


i 4% 
(85) | e<(£) 
(36) | sS AVG] +2. 
If now ; 
(87) wz mas fope [vE Lei}, 


then we could show as above that A cannot contain a sequence of s non-residues. 
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Becausé of (83), V contains exactly 1 integral multiples of 2z. On the 
other hand, V contains exactly u + 0 — 1 consecutive integers, and therefore 


at least hae] multiples of 2z. Consequently, we have 


12 [ES] [FE JE p g 


becanse of (78) and (87), and there lies a sequence k, k + 1,:--,k+1i—1 
of at least s non-residues in À, according to (84). This gives a contradiction 
to the result above, and hence (87).is not true. We have, therefore 


EL, 





u< Max Í a +2 + [Va], 


Since the first expression on the right-hand side i is an integer, we see, because 
of (86) and (85), that j 


uS Max f a+1+4 [Vm], 2E via] nn 


{ [24 [2 2 ( 2p ) 1} 
u< Max { P+ - ve CE +2 +> ; 
Pa AP 8405 stl 
(88) u< Mex} E+N Ea kan . 
Let us assume that 
(89) wee (2)" 4 (4b) (Py +0. 
p/ . \p 27 P 
We then determine the odd integer zo such that 
(90) 7 SP 4.2 > to > E pp, 


Because of (80), we have p ae Y. From (76) it follows that 


(91) p> 211. Be 28-5 = eat 27° tt 
P 


ee = fl 
—1)5 à ‘ 
gince E= ie increasing with s for s = 1. From (90) and (91) we obtain 





a> =p" >[p—1], * 
(92) = [p] >p—1, 


since 2) was an integer 
Furthermore, we have to show that 


(93) | 2B, <u. 
But this is true, since 


2 . 
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Re < ze (2) 4<2 (2) "+ 4<2 (e)" e. 
+ +e <2 ("+ (+ +) (2 aes Za 
because of (90),. (80), fey ne (39). Pw 


According to (92) and (93) the conditions (81)- ‘are satisfied for z = 20. 
Hence it follows trom (88) and (90) that ; 


{ 4p \': 16p \¥ 
(oh <a | (apm) + (Gaps) +2 
1 pnp 4 pq + aop 4 16) à p TE 3 \. 
6 ee aid 16 . ‘ 2\p P 

We now state that 

A 12/5 8/5 : 8 8/5 72/6 4/6 451/85 abs 12/5 8/5 3 8/5n2/5 
(95) ggg POP A gg PPDA PPS + 16 < oer p p + Te pip 

+ TT plegas +27 + GROTTES G pèp??? F stages Y, 


This follows immediately from 3 < 27/8 ‘and 16 < 27, since the first two 
terms on either side of the inequality are equal. We obtain from (94) and (95) 


<a [a ("o(a 
1/pŅ\” 


8 Sp G 2/9/20 4. gpa g-(0/20) ) ye (e $ s ! , 


OO a E CRE 


Since p < 7, each term of the second expression on the right side is not 
smaller than the corresponding term of the first expression. Hence 


eT 


This gives a contradiction to (89) which canñōt then be true, proving (79). 





8 pls 


5. Proof of the non-existence of the algorithm for p > 3 300 000. 
Based on the results, of the preceding section we shall now show that there 
does not exist an Euclidean algorithm in P(p%), if p is a prime of the form 
24n + 13, and p > 3 300 000. 

According to Theorem 6, it is sufficient to assume v < 6u, and hence 
2< p< 7% because of (80). Theorem 9 gives then the inequality (79) for u. 

According to Theorem 2, the algorithm certainly does not exist, if 


(96) Buv == 3(p—1l)w<p 
or because of (79) 
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Act | 2/8 24 1 1/6 9)? 
—1)u? eds B EENE 
en sonecon fal PACHA) LT <r 
We now state that the function of P 
, A . ze 2/5 a LA i 
(8): (0) =3 f 2 (2) 0—1 
24 1/8 1 1/5 A 9 . 2 
+# (one +2 (2) one +2 (pay } 
p \p p P 


is increasing in the interval 2 < p & 7, if p > 33800000 is fixed. Since 
7 (p/p)*/*(p — 1)? is increasing for p > 2, it is sufficient to show that 


ni 0-0 


is increasing for 2 < p & 7, if p > 8 800 000 is fixed. Differentiating with 
regard to p for a fixed p and setting p'/* = y, we obtain 


V (0) = (p= 1) A gyt  2Agpr 4 9p) 


Re ge Ey En EE ypa + 90” b 
(99) 10(p—1) Py (p)= y*p*/* (2p + 8) + y (288 — 168p) + 450 (2 — p). 


-We have to distinguish between two cases 


1) 2<pS3. 
Because of p > 3 300 000 we have y > 20 and therefore from (99) 


(100) 10(p—1)/*p'¥/*¥’(p) > y(40p + 160 + 288 —168p) + 45 : 3*/*(2—p) 
> y(448 —128p) + 90(2—p)> 64y — 90 > 0. 
2) 3<p=T. 
From (99) we obtain 


(101) 10(p—1) py (p) > 8y (2p + 8)+ y(288 — 168p) 
+ 45 «7/8 (2 — p) > 2.2y? (2p + 8)+ y(288 — 168p)— 450 
= y (88p + 352 + 288 — 1689) — 450 > 80y — 450 > 0. 


Because of (100) and (101) we have #(p) > 0 for 2<pS7, hence 
w(p) and (p) are increasing in this interval, if p > 3 300 000 is fixed. It 
then follows from (98) for 2 < p & 7 that 


(102) $(p) < 72 (2)"+ (108, y 26) (e)" 


10368 , 1080 , 9\/p\**, (7776 , 162 ae 1458 
+ $ PEC) tat +) TE 
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_ According to (96), (97), (98), and (102), the Euclidean algorithm does 
not exist in the field P(p*) if , 


š 8/5 
duo 3(p— 1) < gie) <72(2)"4 (2) 


36297 /p\’/ A ey" Hs 
98 (2) + 49 ( +79" 
This is true for p > 3 300 000, since the polynomial 


1280 36257 8910 1458 
» 1/5 
has only one positive root, and assumes a positive value for  — er) A 


as can be seen by a simple computation. 


- @ The case p < 8800000. The primes p < 3 300 000 must be investi- 
gated directly using the Theorems 5, 7, 1, and 2. This investigation is tedious 
but the methods are straightforward. 

If the integer 5-is a non-residue for such a p, then we cannot have an 
algorithm, according to Theorem 5, if p > 24-5%==600. Analogously, we 
can argue-for p > 24:77, if (7/p) —=—1, for p > 24-11%, if (11/p) =— 1, 
and for p> 24-13%, if (18/p) =— 1. 

We can then form the 180 arithmetical progressions which contain those 
primes p = 24m + 13, for which 5, 7, 11, and 13 are residues. We consider 
those primes p < 3 300 000 using the modules 17, 19, 23,- - - as far as it is 
necessary for the construction of the least quadratic non-residue. For most 
of these p, it follows from Theorems 5, ? or 2 that we have no Euclidean 
algorithm in P(p*). The only exceptional cases are for p == 18; 37, 61, 109, 
181, 229, and 421. For p == 13 and p — 37, the algorithm exists, as has been 
mentioned in the introduction. For p= 181, 229, and 421, there is no 
. algorithm, as follows from Theorem 1, because of 


181 = 7:17 + 2-31, 229 = 7- 13 4- 6:23, 421 = 13-194 6-29. 
Whether or not the algorithm exists for p == 61 and pe 109, I cannot decide. 

We have thus the result 

Tasorsm 10. There is no Euclidean diyorini in the field P(p^), if 
p > 109 is a prime of the form 24n + 18. 

Tarore 11. Let p > 421 be a prime of the forni 24n + 13, and u,v 
the two least gaara atic non-residues (mod p), which are odd primes. Then 
we have ‘a 

buy < p. 
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PRINCETON, N.J. : 


POSTULATIONAL BASES FOR THE UMBRAL CALCULUS. F 
BYE E. T. BELL, 


As the somewhat condensed treatment of thé umbral calculus which I 
gave elsewhere’ has been misunderstood ? a fuller treatment than was given 
‘before is desirable. Incidentally, what follows validates the purely formal uses 

- of this calculus, or of its special cases, which have appeared in the literature, 
when such uses give correct results. There are immiediate generalizations to 
abstract commutative rings, obtainable by obvious modifications of the fol- 
lowing; but as such generalizations seem to be of no ‘use at present, it seems 
hardly worth while to develop them. 


1. . Rational operations on umbrae. 


{1.1) Real, or complex, numbers are called scalars. The sign == denotes 
either definitional identity or identity as in algebra; which, will be clear from ` 
the context. | n 

(1.2) Scalars are denoted by small Latin letters with PE AR integer 
suffizes, thus zy (N == 0,1,-°-), or by small Greek letters, a, 8,---. As usual, 
the sum, product of any scalars a, B are a + B, aß, and 0,1 have their usual 
meanings. 


(1.3) Latin capitals, A *,N,::: denote non-negative integers. 


(1.4) If zy (N =0, de - +) are any scalars, the one-rowed matrix 
{Toy Z" * *y%y,* * *) is denoted by z: T= (to t't, x, * *). 


(1.6) The (N -+ 1)-th element, W—=0,1,---, of z in (1.4) is denoted 
by æ: | 
Na (02). 


(1.6) “The z in (1.4) is called an umbra; x is the umbra of (To, %1, * *, 
- +), or of the sequence ty (N =0,1,---). Note that an umbra has 
neither exponent nor suffix. E iy 


(1.7) Equality of umbrae is matric equality: if æ is as in (1.4), and 


* Received April 8, 1940. | š 
14 Algebraic arithmetic,” American Mathematical Society Publications, vol. T 
{1927), pp. 146-159. 
2G. Temple, Journal of the Londo® Mathematical Society, vol. 12 (1937), p. Id. ` 
Professor Temple has seen the present note, and writes (Feb. 21, 1938) that it clears 
up the obscurity. 
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Y= (Yo Ys" + > Ym" * *), “x is equal to y; written z= y, if, and only if, 
ty = YN (Wo, 1,:--). Hence 

(1.71) sc. E 

(1 72) If cy, then y= 2. 

(1. 73) If z= y, and y = z, then z =z, 


(1. 8) The coefficient of 7,%--- 2,57 in the expansion of ve He + on) 
by the multinomial theorem, is denoted by Ms, ..., sp Note that exponents 


. and suffixes 0,1 are to be indicated precisely in the same way as ape 
and suffixes > 1. 


The next refer to rational functions of umbrae, and define ‘umbral | 
scalar multiplication ‘umbral addition, ete. The Seen ‘umbral’ 
will be dropped, as it is taken care of in the notation. 


` (1.9). The scalar product, az, of a and == (%,- ` +, m, * *) is 
aT E= a (To, * $ "5 TN’ e +) = (42o : ‘so AN, * ). 


By definition, ta = ax. 7 
Now az is an umbra, by (1.6), and it is a | compound symbol. To denote 
the (N + 1)-th element of ax in accordance with (1.5), we write oy thus 


(1. 91) {ar} Y = ac’ == agy. 


Similarly, if * is any compound symbol of scalars and umbrae, and if * 
is an umbra, the (N + 1)-th element of * is denoted by {*}". on 


(1.10) "The sum, s,s==aa-+---- &, of aa,’ - -, ér, where 


a= (to't tam *)s 7s r= (za +, 2y,° °°), 
is ; | f 

s= (am +: O H Etot aay + - LE Een," ++), 
‘Hence - ; X 
(1:101) {ant + EN = aay ++: + by; 


(1.102) Addition, e-, of umbrae is commutative and associative ; 7 


(1.108) There is a unique z, the zero umbra, such that ata for 
© every T: , To ; 
= ea p z= (0,3,0, O); 

(1. 104) For every x there is a unique g such that zt y =z; y is called 
the negative of x; y = (—1)z, and is denoted by — z; 


(1.105) With respect to + the set of all umbrae is an abelian group; the 
inverse of z in the group is — v, and the identity of the group is z. 
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(1.11) If no two of a,---,2 are equal as defined in (1.7), a+ -,2 
are said to be distinct. In (1.12)-(1. 126), a, : > ae are distinct. 


(1.12) Ha,- sare T distinct umbrae, oe i er à denotes the 
scalar py, ; 


(1.120) py= (aa +: CH iej ZM,- spn “rag: © Tgm 
(see (1.8)).. In particular; | à ae 
(1.121) | Po = o ` * ` To 

Hence, by (1. 5), | 


(1.122) (aa+:-- Hé) SM 0% : Sas. - +287 
the left of which is called the N-th power of the sum aa F- : --+ é&. Hence 
such powers are expanded by the multinomial theorem, and + is replaced by.. 
-+ in the result. . 

If py is as above defined, and P= (Pp ‘+, py,’ +), then pY — py,. 
by (1:5). Note the distinction, as shown in (1.101), (1.122),.between ` 


fat: té" (aa fe + ue 
only the second of which 4 Jee power; both are scalars.. — o, rs 
_By (1.121), i eo. oo 
(1.128). (at: Hé) 
In (1.122) replace N by N +R. The resulting scalar,- ue 
ad ae A 
ia called the product, i | | | 
Gt te CES +e), 


a 


of < 
(a4 He CEE go): 


(1.124) (aa +: -ep ge)": (ax +: ere (at: + gx) MR, +, 


It follows that this multiplication, - , is commutative and associative, and that 
it has the ‘identity’ (aa +>,- --+ ér)”. The right of (1.124) may be (and 
is) calculated from the left by expanding each of the*factors ( )¥, ()” by 
the multinomial theorem, multiplying the resulting (scalar) polynomials 
together as in common algebra and finally degrading all exponents of small- 
Latin letters to suffixes. For example, noting thari @° == B° — 1, end at mel 
B* E since be B are peat, we HANG : j 


O (aH e CETO T 
“(aabt F path!) - (atab? + 2a Bab? + Ba?) | ru Pigan 
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= gab? + 38a78a7b! + BaBa? + BSa°bs, 
== cash, + 3a Bab, + B48 lbe + Bobs, 
= (aa + 8b)’. 


As a mere convénience of notation we write | 


(1.125) (és): [(aa)™ + (Bb) +- > -+ (ye)"] 
= (É)N (aa) + (£2)N- (BB) Ff > > + (60) > (y0)8, 


the (scalar) sum of ‘scalars on the right defining the expression on the left. 
Similarly for an infinity of scalar summands. 

All in this section (1.12) refers only to the case in which the T umbrae 

a,-++,a@ are distinct. The contrary case is equally important in applications 
of the calculus, and requires special consideration. 
(1.18) If in ar ++: + & there are precisely T summands az,: - >, éz, 
each of which is a scalar product of a scalar and x, we replace (—) the T xs 
by T distinct umbrae, say a,- > -,2, in any order, and indicate this replace- 
ment by writing 


(1. 181) av +--+ +4 iraa t: o H ée. 


Then (aa $- - $ ér)Y is to be calculated by (1.122), and the exponents are 
degraded, as in (1.120). In the result, each of a, -++ , x is replaced (<) by z; 
the resulting polynomial is defined to be N-th power (ax +----+ éx)” of the 
sum aœ ++: ér.. 


For example, 


(az + pr)” —> (aa + Bz)’; 
(aa + BT)? == a?ao + 34 Baer, + 34B lT + Bouts, 

= @L3% + Ba? Barer, + BaB tTa + Pars; 
(as + Ba)? — (05 + B°) Tots + 3aB(a + B)ritz. 


The relation (1.124) holds also for powers (ax: +--+ éc) when 
therein the replacements z> are made. 

Similarly, if in a (+) sum s there are precisely 4 summands each of which 
is a scalar product of a scalar and z,---, precisely C summands each of which 
is a scalar product ofea scalar and w, and if these summands exhaust s, the 
S=A+---+0C as,:--,w’s, are replaced (—) by S distinct umbrae, say 
s—t. Then (¢)¥ is calculated by (1.122), (1.120), and the final replace- 
ment (<-)' of the S distinct umbrae by those introduced by (—). These 
powers (s)% also satisfy (1.124). | 


(1.132) Hence (1.124) holds for any umbrae a,-::,%, distinct or not. 
(1.14) oN was defined in (1.5); it denotes the scalar ty. Hence, since 
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| a of scalars is indicated (as always) by mere juxtaposition, 
without any symbol denoting the operation of multiplication, 


(1. 141) , INTE = EyTp. 


Since this multiplication is multiplication of scalars, it is commutative and 
associative. 

In (aa ++ £r)”, defined in (1.120), take a and each of the 
other scalars —0. Then by (1.9), (1.103), (0a +---+ 1x)" = (x)¥%, 
Note that ( ) is not omitted:on the right. By (1.120), (x) =g. Hence, 
by (1.5), (£) = zx By (1.124), (£)¥- (x)? = (x)NtR, and hence, by 
what has just been shown, oY +g? == gN+R oe gyn, 


(1. 142) ty’ Tr = TNiR. 


Thus, unless typ == ty p, TNUr = Ty' Tr The ‘dot multiplication,’ :, is an 
operation peculiar to the calculus, and will be explicitly indicated where there 
js any possibility of confusion. 

Similarly, (aa +-:-+ ér) (ar -+--+ &)*, without the dot, is the 
{scalar) product of the scalars (aa $: + ér)”, (ax +---+ ér), which 
are defined in (1.222); and this scalar product is different from the dot 
product in (1.124). To see the difference in an example, we compare the 
example illustrating (1.124) with the following: 


(aa + Bb) (aa + Bb)’, 
== (ado + Baobs) (a7debo + 2aBa.b, + B’aob2), 
== aid? + BDD, (241? + dots) + aP7aot, (2012 + bobz) + BP ay*bide, 
7 (aa + BD): + (aa + Bb)? 


(1.16) A particular case of (1.120) occurs so frequently that a special 
notation is convenient. If s == ax -4 -+ az is a sum of precisely A scalar 
products ax, we write ; 

(1. 151) A- ar=s =ar Luz. 


There can be no confusion between the dot in À : «x and that in (1. 124), 
since here the dot is between a scalar and an umbra, while in (1.124) it is 
between two scalars. If desired, the dot in (1.151) may be circled, thus ©. 
It would be incorrect to write Aar instead of À : az, since Aq is a scalar, and 
hence, by (1.9), Aaz is a scalar product. 


(1.16) Umbral multiplication can be defined in many (actually, an infinity 
of) ways to yield algebras simply isomorphic with parts of the common algebra 
of scalars, for example rings. Here we need mention only that species of 
-umbral multiplication which is directly applicable to the power series in § 2. 
It will not be used in the sequel. 
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- (1.161) © o E= (T/0l,m/11, --,æx/Nl,-:) 


r said to be of e-type (e=" exponential’). Hence, if y is of e-type, 

= ÿy/N1. If w is not of e-type, it is replaced by Ù, in which wY = N/N l, 
anti after all calculations involving #. have been completed, when iy is 
replaced by N lwy. 


Let s= (2/01, ce, an / Nl h y= (yo/01, -> + yy/N1,+ ++) be 


of e-type. The product, zy, of z,y (in this order) is the matrix p which is 
guch that 


(1. 162) PCT ; 


í ð . k 5 1 , N at 
(1. 163) g = (CEM, ery .. T EL. : ) 


` Hence umbral multiplication is commutative and associative. Thus 
powers may be defined as usual; A-th power of x is denoted by a), to 
distinguish it from zá, 


2. Power ae The set of all (formal) power series in the sedans 8 
is closed under the four rational operations. Division is immediately referred 
to multiplication, and need not: be separately: discussed. Irrational functions 
of these power series also occur, but as they are of less interest than the rational 
functions, and ‘are readily investigated if desired, they will not be considered 
here. The use of formal (disregard of convergence) power series can be justi- 
fied in detail, if not obviously legitimate in the present connection (for example, 
as in my paper, Transactions of the Americin Mathematical Society, vol. 25, 
1923, 185-54) ;.however, there is sufficient generality in the set of all power 
series in @ convergent in the same domain’ |a] >0 to show here how the 
definitions, ete., in 8 1 give immediately the PERS of. Blissard’s umbral 
calculus. oe 

Ti c= (20, * *,&x,° J we e write 


(2.1) o ot È 2x(6V/N D); 


where ¢ has its usual meaning (2. LES by (1. 5), 
oo 
(2.11) | | pose (AND. 
By ¢ either of these, foot is @ scalar. " Henceyif Alé: =g) isa polynomial in 


é, < +, 9 with scalar coefficients, A = AE, - + ne) is a scalar, as is alsó 
the N-th derivative a,%A of A with respect fe 8. By vs A as 8 me 
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series in 6, we express it in the form re“, and simildrly for the derivative. 
For any A (or its derivative) the appropriate re”? ig: built up by repeated 
applications of the elementary identities (2. 2)—(2. 4) in 6. 
(2.2) 620er — on = È (x 4 y)(6¥/N!),- 
o 
which, by (1.120), is merely the formal multiplication of two MacLaurin 
series to produce a third. Generally, for any number of factors on the left, 
(2.91) efe. + + em == glot.. nt S (ba + + ny) (GN/NT). 
For addition, (1.101) gives | 
DRA A , z e e ES - 7 
(2.2) ` 0 + e mefr am 5 {r Ha (ON), 
with the ions extension to any number of summands. 


Powers are obtained directly from (2: 21), or more rae thence 
by (1.151): 


a A : [ele] molt 600 a (4 £0) (OA. 
For derivation; we have. | hey 
DN of? = ON Py May (OM/M !), 
| 2 À Mon (8X/M1), 
= À aMna(M/M) [by (1.5)], 
= È (Er) (EMMI) [by (1. 124)], 
= (Ne È (Eo!) (OM/MI) [by (128), 
== (éx) ` Gisk l f 
(A | Dgn otet = (ér) Y - ets, 


in complete formal analogy with derivatives of dire (scalar) bal 
functions. From (2.8), (2.4), “7... 


(2.5) Co o DNJA = (A gr) Nod 8 | | 
and from (1.101), (1.120), `> > TS Toi 

ES D RE A I E ne 
(2.6) gtsi[guai 1. . ee ee i 


The coefficient of CM JN ! in the MacLaurin i expansion of the left of (2:6) is 
in fact ` 
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S ie) EN-Say_g[a8ag ++ + -+ yScs], 


È 
= È (g) (eat: Hyos, 
= (ed (ate + ye)”, 


which is the coefficient of 6Y/N! on the right of (2. 6). 
Many of the more interesting applications to special sequences of numbers. 
(like the Bernoulli or Euler numbers), arise in the following simple way. Let 


- A(6,a,- z -,ÿ) : 
(6, a, - ° sy) 
be a rational function of 0, «,: --,y in its lowest terms. Replace a,---,y 


by ae,- - -, ye, and let the MacLaurin expansion of the result be 

(8, aer?, . <, y) 
thus defining the numbers zy a = 0,1,:--). Let the MacLaurin expansions 
of A, be 

. A(8, ae, - +, 679) = ye, D(0, 0%, > >, yet) = fen’, 
thus defining yy, uy. Hence 
ny’ = & (x + u). 
Hence, if #(8) is a polynomial in 8, or a power series, if convergent, 
F(0 -+ y) = E&P (042 + u), 


in which, after expansion, exponents of y, x, u are degraded to suffixes. 

| In practice, the special notations { }, +, ():(),'A:ax are dropped, 
+, () (), Aar being written, as the notation is a sufficient guide to the 

‘correct use of the ‘algorithms. There are many extensions, in particular one 

to multiple suffixes, as in, £a,» o, and the corresponding power series, 


ss 


Finally, everything down to (2.6) goes through unchanged if scalars in 
(1.1) are re-defined to be elements of any commutative ring with a modulus : 
(= identity with respect to we 
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THE ABELIAN QUASI-GROUP#* 
By HARRIET GRIFFIN. 


Introduction. The purpose of this paper is to investigate the abelian 
quasi-group, which is a commutative system of elements closed under a single 
operation, when certain conditions with respect to coset expansions are imposed 
by stated associative laws as explained in the first section. Throughout the 
paper we show how the abelian quasi-group differs from the abelian group. 
In section two we study in particular the minimal quasi-group of units both 
when each element is the unit for an element of the minimal quasi-group and 
also when this is not the case. In sections three and four we study the orders 

. of elements in the case in which the minimal quasi-group is the identity ele- 
ment, and develop a method for setting up a quasi-group with an identity 
element and no subquasi-group other than the idéntity. We show that two 
conformal abelian quasi-groups need not be isomorphic, 

The subquasi-groups of an abelian quasi-group under the conditions 
imposed form a Dedekind structure only in a special case as shown in, section 
five and thus the abelian quasi-group differs greatly from the abelian group. 
Finally in section six we determine a necessary and sufficient condition that 
the cosets of an abelian quasi-group under the imposed associative laws shall 
form a quotient quasi-group. 


SECTION I. 
Associative laws of the abelian quasi-group. 


To facilitate the reading of this paper, we begin with a connected account 
of certain fundamental properties of quasi-groups drawn largely from the 
paper of Hausmann and Ore? upon which our work.is based. We do not 
always follow their exact wording, but we believe that any essential departure 
from their presentation is clearly indicated. 


1. Fundamental definitions and notions of the finite abelian quasi- 
group Q. A groupoid is a system consisting of a set of distinct elements 
d,b,+-: and one binary operation (multiplication) such that to every ordered 


* Received September 1, 1939; Reviagd November 20, 1939. 
1B. A. Hausmann and Oystein Ore, “ Theory of quasi-groups,” American Journal 
of Mathematios, vol. 59 no, 4. (October, 1937), 
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pair of elements a,b b, there corresponds a unique third (the product), C= ab, 
of the set. ; 

If, further, to each siasi Date, a,b there HAE a unique æ such 
that az = b, and a unique y such that ya == b, the HR is called a quasi- 
group. 

Since we are here interested in the spear quasi- group, we impose the 
kadad condition that ab = ba. Then the quotients x and y above are equal. 
| No identity element need exist. However for each element « there is a 
unique ĉa such that ae, = a, called the unit for a. 
pt 

If a is an element of Q, we define the powers of a as a” == a for n = 1 
and a” =at -a for n > 1. The order of a is then the least positive integer 
n for which a” == ¢,. Such a finite power of a exists, since Q is assumed finite. 

Let a,b,:-:;k be any subset of Q. Then there is a least subquasi-group 
of Q which. contains the elements a, b,- +, k. We denote this subquasi-group 
by {a,b,--:,k}. In particular the quasi-group {a} generated by a single 
element is called a cyclic quasi-group. It is to be noted that {a} may contain 
elements other than the powers of a. 


©? Fundamental properties of Q. We wish the abelian quasi-group Q 
to have certain: properties and hence impose the following conditions. 

The expansion of Q by means of disjoint cosets with respect to any 
subquasi-group A is to exist, i. e., Q—=A+qA+:-+-+qnd, where the qi 
are in Q but not in A. The associative law expressing a necessary and sufficient : 
condition for this property as proved by Hausmann and Ore is: 

P,. Ifaand b are any elements of Q and co and dy are determined so that 


: (ab) Cy == ado, 
then for any ¢ | 
` (ab)c = ad, 


. where d is an element of {¢y, do, c}. 
Any element of a coset is to define the same coset. Again a necessary and 
sufficient condition for this property as proved by Hausmann and Ore is: : 
Pı. For any elements a and bof Q 


a (ab)c == ad, 
where d belongs to {b, c}. 

It then follows from P, that c(bC) — (b0)c = bC = (cb) C for any O 
containing c, and consequently each subquasi-group C of Q contains all the 
units of Q' and each coset aC contains its multiplier a. Hence there is a 
subquasi-group of Q contained in all subquasi-groups of Q and containing all 
the units of Q. We call it the minimal subquasi-group Æ of Q. When Q con- 
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tains aù element a such that-a*=— a, thé minimal subquasi-group consists of 


` a which is then the identity element. It is to be noted that the minimal quasi- 


group Æ is generated by any one of its elements, but it is not necessary that 


each of its elements be a unit. 


_ Furthermore it is important to notice that P, implies Po | | 
© The decomposition of Q into cosets is to be transitive, and herein we - 
depart from the interpretation of transitivity given by Hausmann and Ore. 


. By transitivity -we mean. that for -every- A: and -B -such that -Q >-A >B, — 


l generated by any one of its elements, 


Q =A +p +H: Hg, and A—=B+aB+:..+aB, the expansion — 
of Q/B can be obtained by substituting the expansion of A/B in the expansion 
of Q/A. A necessary and sufficient condition for this property is: > ` A 
` Pa. When q is an element of, Q > {a, B} but is not in {a,B}, then ` 
for b in B, gon 
q(ab) = (qu), 

where b, is E T by b. : 


‘Proof: (a). Pri isa “necessary. condition. 
Let | 


Am BHaB+: +8, 
and i l 


| CoCr © + ged, 
where Q > A >B. - 
Then for transitivity of coset Ron 


QB OB + + aB + gB + RnB) +: + Gn (4B). 


Tf e is the unit for aj, qi(ase) = qua; and i is in 1 gu(asB). Bute since a à coset j is 


qe(ayB) = (quay) B. 


However for b in B, (o}s B. Hence, as for B, qs (ay{b}) —_ (quay ({B} 3 


~ 


so that for q in Q and not in {a; B}, q(ab) = (qa)b, where b generates b.. 
(b). P: is a sufficient condition since as b varies over the elements of | 
any B, b, varies over B, and thus q(aB) = (qa)B:- , 
It is to be noted that P, does not govern all the products of elements of 
a quasi-group. Transitivity of coset expansion is not different from coset 
expansion except when applied to the product of elements of a proper subquasi- 


group by an element not in that subquasi-group. This fact is exhibited by 


Table X where having determined subquasi-group H.as 1, 2, 3, and A as 
1, 2, 8, 4, 5, 6, 7, 8, 9, 10, 11, 12, P, governs blocks of products like 13(4F), 
but it does not govern 7(4#H') nor any of the products of 4 through 12 by any 
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r: 
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of 4 through” 12. This last set-of elements forms what we shall call a free 
square in the multiplication table because the products are not governed by Pe. 


TABLE X. ABELIAN QUASI-GROUP OBEYING P, AND P, f L 
5 86 8 9 101112 181416 16 17 18 19 20 21 22 23 24 


otre 


pt 
bo 
Hi hd fend 
ow oO 


pd 
bd 





nl 
ro pO D 
wp oo 


n 
o 


2 
1 
3 
4 
6 
5 
7 
8 












D I oj A a aj mw wj w 
e ot bi ti O O | vu pu paa 
art D OX OG 
to 
BR 





How AP al w we] WO 





oon anf, WN 


wy 
= © 


© 


pa iad 
DIS HY NM KH OOH] ROH 
ND ON mm ee pi pet 


AN oO RAH ON Go 
to 
me 0 


bed pod pot 
Qt oO 


D 
to 


Kam onw Nepos] 
w 
co 


D ON DER DOr OO] Oe oh] Pp 


ut bed feed 
wore © 
pi pi bed 
O m © 
bot 
O wf 


bé od 
OS ND +4 
bi 
ee] 


te 
oo 
rn 





eet ti 
oo ot A 
= = bi 
orn 


ww) e be ps 
OI AO © 
PB ED © OO whe 
D OR OON Fw 
a 


| be 
© 
or 


4 6 5 101211 
5 4 6 111012 
6 5 4 121110 


MO bi] ba pa pu D NN eet Hd pa 
D D ba | pi pod pd bu ji js NNW] ee 
WNrY yowo naa ND OMG ER ADE SSRIS 


4 
6 
9 
1 
8 
3 
l + 
2 : 





SECTION II. 5 
The subquasi-group of units. | 


. In this section we shall omit all proofs which refer to special pice: 
They may be found i in the complete paper. T 


1. The ‘Cayley square. It is to be soi that due to the symmetry of 
the Cayley square of an abelian quasi-group, if an element is not placed in - 
the principal diagonal space of a row, the placing of the element causes a 
- second row to be supplied with the element. On the other hand an element 
‘placed in the principal diagonal space o$ a row eliminates just that row. 
. Hence if the order of the quasi-group, which is the number of elements it 

possesses, Is even, an element which appears in the principal diagonal appears 
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there an even number of times. If the order of the quasi-group is odd, however, 
each element appears just once in the principal diagonal. 


2. The quasi-groups €,. We consider first the minimal abelian quasi- 
group consisting of more than one element, i.e., where there is no element 
` such that a? == a, and such that each element is the unit for one of the ele- 
‘ments of the minimal quasi-group. We denote this quasi-group by Ea, while 
the general minimal quasi-group is denoted by Æ or Æ, where the subscript 
gives the order. 

Since any element has but a single unit, and since in €, each element is 
the unit for one element of Es, when a is the unit for b, b must be the unit 
for some c, c for d, etc. If we continue in this fashion and include all the 
elements of €, upon returning to a, we shall say that the units set up a single 
cycle. Otherwise it is clear that the set of units-will be separated into two or 
more cycles. It is evident, however, that no cycle may have but two elements 
since if a is the unit for b, ab — b, and then b cannot be the unit for a. Hence 
in the case of €, there can be but a single cycle. ‘ 


THEOREM 1. There are no €, of order 2 nor 4. 
THEOREM 2. An €, exists for every n not 3 nor 4. 


Proof. By first setting up the cycle of units, 2 for 1, 8 for 2,---, for 1, 
and then the principal diagonal, we have developed general methods for build- 
ing abelian quasi-groups of any odd order, of odd order greater than 5, and 
of even order greater than 6. €, is a special case. There is no subquasi-group 
in these tables since the unit for an element must be in any subquasi-group 
with it. 

THEOREM 3. All €, and all Es are abstractly identical. 

Proof. Having chosen the units the tables are uniquely determined. 

THEOREM 4. The €, are abstractly identical. 


Proof. The set of units must form a single cycle, and the two seemingly 
- distinct quasi-groups that it is possible to build on a cycle of units are 
cyclic permutations of one another. 


Tueorem 5. Although the set of units for €; must form a single cycle, 
the €; are not abstractly identical. 


THEOREM 6. The €, where n > 7 are not abstractly identical. 
e 


Proof. For any even order we can set up an abelian quasi-group €, in 
which the set of units is broken into cycles 1 through n— 3 and n—2 
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through n. For odd integers (not 8 nor ay of the ree 4n n 3a rer 
method shows how to break the set of units of Em, into cycles 1 through 
2n-+ 1, and 2n-+ 2 through 4n + 3, while for odd integers greater than 5 
of the form 4n + 1 we can set up an En with the set of units broken into 
the cycles 1 through 2n + 2, and 2n + 8 through 4n + 1. In all cases the 
diagonal elements are so chosen that there is no subquasi-group. 

_ These methods together with Theorems 2 and 5 show that there are. at 
least two distinct €, for any n= 7 since a quasi-group in which the set of 
units is broken into two cycles cannot be isomorphic with one in which a single 
- cycle includes all elements. 


3. The order of an element of the minimal quasi-group. 
THEOREM %. No element of €, can be of order 1 nor n. 


Proof. The order of an element cannot be one since a? Æa. It cannot 
be n since the element for which a is the unit cannot occur among the powers 
of a. 


THEOREM 8. In a minimal quasi-group E if an element e is the unit 
for just s = 0 elements of E, the order of e is at most n —s, and it cannot 
be n—s— l1. 


Proof. Let e be the ùnit for s distinct elements. Then since noné of 
these s elements can be powers of e, the order of e is at most n —s, and may 
be n — s if the powers of e exhaust the n — s elements for which e is not the 
unit. If the order of e were n —s—1, the remaining element when multi-' 
plied by e would have to give itself and e would be the unit for s + 1 ele- 
ments. On the other hand if the order of e is less than n —s—1, e need 
not be the unit for any of the remaining elements. 


` COROLLARY. In En an element cannot be the unit for n—1 elements. 


When n is 6 or 7, we have set up Es in which there are elements which 
are units for one or two elements and which are of all orders not excluded by 
Theorem 8. We also have examples of elements which are not units for any 
element of En» such that their orders anplude all integers from 2 through n 
` except n—1. ° 


SECTION TII. 


The abokan quasi-group with an identity element and no subquasi-group 
other than the identity. 


1. Order of I. If an abelian quasi group of order n has no subquasi- 
group except the identity element 1, we designate it by 74. 


THE ABELIAN QUASI-GROUP. : 731 


Tarore 1. There are no abelian quast-groups I, of orders 2, 3, nor 5 
except when they are groups, and there are none of even order > 2. 


THEOREM 2. An I, exists for every odd order > 5. 


Proof. Set up the multiplication table of a cyclic group by putting 1 
through nin the first column and row and by filling in all diagonals from the 
upper right to the lower left with the number in the first row of the diagonal. : 
Make the last column n,1,2,3,---,2—1, and carry the diagonals through 
as before. Then by the method of formation the powers of 2 are 2,3,4,---,n,1, 
so that. 2 is always of order n. Now interchange the elements of.the last row 
with the printipal diagonal eleménts immediately above them. This operation 
makes n'the identity, so call 1, n and n, 1 throughout the table. The order’ 
of 2 remains n. The result is quasi-group L. 

There is no subquasi-group other than 1 since for any element except 
1 and 2, a? = a — 1, and hence any element not 1 generates 2. But 2 is of 
the n-th order. Therefore each element except 1 generates the quasi-group D. 

‘Furthermore the quasi-group is not a group since 2(3-4) — 1, while 
(2-3)4 = 3 and this part of the table remains for all orders greater than 5. 


2. The order of elements of L. The manner of forming L makes it 
possible to find the orders of its elements. 


THEOREM 3. The order of every element a except 1, 2, and n of the 
quasi-group L of order n set up by Theorem 2 is n or n/g + 1, where g is 
the greatest common divisor of a—1 and n, according as a—1 is or is not 
prime to n. 


THEOREM 4. The order of the element n of quasi-group L is less than 
n except when n is a prime and 2 is a primitive root mod n in which case 
the order of nis n. 


3. The order of the elements of In. 
THEOREM 5. There is no element of In of order 2 nor n — 1. 


In some cases it is possible to set up a quasi-group lacking a subquasi- 
group except 1 without interchanging all the elements of fhe principal diagonal 
and the last row as in Theorem 2. In particular except when n is a prime 
and 2 is a primitive root mod n, there is a set of less than n interchanges 
which include the 1 and n which always gives a quasi-group. We have used 
these methods to set up J, when n is 7, 9, and 15 and these quasi-groups 
afford examples of elements of every order except 2 and n— 1. 
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SECTION IV. 
The abelian quasi-group in general. 
We turn now to some facts about the general abelian quasi-group Qn. 


THEOREM 1. If a quasi-group Qn has an element of order n— 1, the 
element generates the quasi-group and there is no identity element. 


THEOREM 2. Two quasi-groups of the same order and having elements 
of corresponding orders need not be abstractly identical. 


Proof. We demonstrate this fact by an illustration. Take R with the 
elements 1, 2, 3, 4, 5, 6 such that the products of these elements in order by 
1 are 5, 2, 8, 1, 6, 4; by 2 are 2, 6, 1, 4, 5, 3; by 8 are 3, 1, 4, 5, 2, 6; by 4 
are 1, 4, 5, 6, 3, 2; by 5 are 6, 5, 2, 8, 4, 1; and by 6 are 4, 3, 6, 2, 1, 5. 
Take § with elements 2, 3, and 4 as above but let the products for 1 be 
6, 2, 3, 1, 4, 5; for 5 be 4, 5, 2, 3, 6, 1; and for 6 be 5, 3, 6, 2, 1, 4. Since 3 
is in each the only element of order five, if the quasi-groups were isomorphic, 

` these elements would have to correspond to each other. But then 4 of R must 
correspond to 4-of 8; 5 to 5; 2 to 2; and 1 to 1. But in #1-1—5, while 
in S 1:16. | 

This fact is interesting since we recall that two abelian groups which are 
conformal are isomorphic. 

Under certain conditions which we discuss in Section VI the cosets with. 
respect to a subquasi-group B of Q always appear in blocks throughout the 
multiplication table due to the fact that Q/B forms a quotient quasi-group. 
In this case the cosets themselves form an abelian quasi-group with an identity 
element consisting of the subquasi-group B, and since the blocks must combine 
as do any elements of the blocks, the blocks obey the associative law of the 
original elements. We can under these circumstances make certain general 

„statements about the order of elements of Q. 


THEOREM 3. The order of any element of Qu with blocks of cosets with 
respect to a subquast-group Bm throughout the table may be only: 


a. Those orders permitted in the identity element I = B of the quotient 
quasi-group Q/B. 

b. 1,2,---, or m times the order of any block except I of Q/B. 

If the cosets themselves form a quasi-group without a subquasi-group 


except J, the theorems of Section ILI apply. But what is the order of a coset 
of the quotient quasi-group if there are subquasi-groups other than 7? This 
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question is the same as asking: What is the order of an element of a quasi- 
group of order n with an identity element where there may be subquasi-groups 
other than 1, and where the blocks of cosets with respect to any subquasi-group 
need not appear throughout the Cayley square ? When there is an identity 
element, beyond the fact that the order of an element cannot be n — 1, and 
that: if n is odd, there can be no element of order 2 (due to the coset expan- 
sions), we can state no law which the order of an element obeys. On the other 
hand if Qn has a minimal subquasi-group of units E < Qna and the block 
formation does not prevail throughout the multiplication table, no elements 
of E are of order n — 1, and the order of an element a not in E cannot be 
n— 1, since if b were the element not among the n—1 powers of a, ab == b, 
and then a would be a unit. 

Herein lies the difference between the multiplication table of the abelian 
group and the abelian quasi-group, for the cosets of an abelian group always 
form a quotient group and hence the elements of the cosets form blocks 
throughout the multiplication table. But more than that, due to the associative 
law a(bc) = (ab)c, the elements take the same positions in the blocks through- cen 
out the table. It is this orderly characteristic of the Cayley square for thes 7 a 
abelian group which distinguishes it from that of the abelian quasi-group. fe if y H 


SECTION V. 
Structure theory. 





1. Structure definition is satisfied. A structure is a partially ordered 
system in which every two elements have a union and a cross cut. In the 
case of the quasi-group Q the union [4:,---, An] of subquasi-groups of Q is «> 
the smallest subquasi-group which contains each 44, while the cross cut_ 
(Ai,°++,4An) is the largest subquasi-group contained in every Aj. It is evl- 
dent that the subquasi-groups of an abelian quasi-group form a finite structure. 


2. Dedekind structure. In order that a structure be a Dedekind struc- 
ture, it must satisfy the axiom: oe 
When 4, B, C are any three elements of a Structure such that 
A <C< [A,B], then 
C = [A, (B,C)]. 


When the abelian quasi-group has the properties given by P, and Ps, the 
subquasi-groups do not in general satisfy this axiom. To show that it is vio- 
lated we use the abelian quasi-group W of twenty-four elements built as the 
direct product of two of its subquasi-groups. We let A be the subquasi-group 
of elements 1, 2, 3, 4, 5, 6; B be 1, 2, 3, 13, 14, 15; C be 1, 2, 3, 4, 5, 6, 7, 
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8, 9, 10, 11, 12; while the minimal subquasi-group of units is 1,2, 3. Although 
[A, B] = W, [4, (B, C)] — A40. Hence the subquasi-groups of W, which 
-obeys the laws of. coset expansion and transitivity of coset ce be do 
not in general form a Dedekind structure. 

. The simple case where every subquasi-group of Q belongs to the single 
principal chain Q > A, > Ag:-: >E is an exception since in this case every 
-element of [A, Aa] is an a, and the Dedekind axiom must be satisfied by the 
method of the following Theorem 2. 

In view of the fact that the subgroups of an abelian group always form 

a Dedekind structure and since coset expansions and transitivity of coset 
decomposition are such outstanding properties of the group, it is interesting 
to find that the subquasi-groups of an abelian quasi-group having these 
properties fail to form necessarily a Dedekind structure. 

If, however, we strengthen P, to read: 
P;. For any three elements a, b, c of Q 


a(bc) = (ab)c, 
where c, is an element of {c}, we have, as proved by Hausmann and Ore, a 
Dedekind structure. 
THEOREM 1. When P, holds, the elements of [A,B] take the form ab. 
Proof. Any a= aea Furthermore: 
(11) (@ab2) = ( (Q1b1) Q2) bs = ( (a182) bs) ba == as (babs) = dads 


where b, is generated by bz; by by b1; ete. 
It follows then as proved by Hausmann and Ore that: » 


‘Torrone 2. When À > B, the abelian subquasi-groups A, B, C of Q 
obeying P, satisfy the Dedekind relation 


(A, [B, CT) = CB, (4,0)]. 


Therefore, as pointed out by Hausmann and Ore, the analogues of the 
Zassenhaus-Schreier refinement theorem, of the Jordan-Holder theorem on 
the invariance of the lengths of principal chains, and of the Schmidt-Remak 
theorem on direct decomposition must hold for the subquasi-groups of an 
abelian quasi-group obeying P, We can say, as does Ore with respect to the 
group, that in these respects the theory of the abelian quasi-group is more a 
property of the subquasi-groups than of the elements themselves. . 
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SECTION VI. 
The quotient quasi-group. 

1. The abelian quotient quasi-group Q/B. When all and only the ele- 
ments of a subquasi-group B of Q occupy the upper left-hand corner of the 
Cayley square of Q, due to P, there are always blocks of cosets of B at the 
top and left side of the table of Q. If these blocks are maintained throughout 
the table, they form an abelian quotient quasi-group with the identity element 
B. In order that blocks of cosets with respect to B be maintained throughout 
the table of Q any element of the coset g.B when multiplied by an element of 
g2B' must give another coset and that coset must be the one that contains 
91Q2, Which is (q.g2)B. Conversely if (9:B)(q:B) == (qiq2)B, there are 
blocks of cosets with respect to B throughout the table of Q. Hence 
(q:B) (q2B) == (qig2)B is a necessary and sufficient condition for blocks of 
cosets with respect to the eubquasi-group B throughout the multiplication 
table of Q which obeys P, and Ps. 


2. The quotient quasi-group under P;. Under the law P, we have: 
(aC) (bC) == ((aC)b)C = ((ab)C)C = (ab)C. 
Furthermore if (aC) (bC) = (ab)C, by applying Ps we have: 
(ab)C = ((bC)a)C. 


But a(bC) is a coset with respect to C, and if it is multiplied by an element 
of C, it must give one of its own set. Therefore 


(ab) C = a(bC). ee 


Thus if C == {c}, for any a, b, c, (ab)c — a(bc:), where c generates cı. 
In like manner if we assume for any a, b, c, that (ab)c—a(bce,) where 
cĉ, is generated by c, it follows that: 


(ab) C = ((ab)C)C = ((bC)a) 0 = (0) (a0). 


Furthermore 
a(bC) = (a (b0) )O = (b0) (aC) = (ab)C. 


Hence it follows that: 


Turorem 1. If for every a, b, c, (ab)c—a(bc;) where c, is generated 
by c, then there is a quotient quasi-group with respect to any subquast-group 
C and further c is generated by cx - 


2 Hausmann and Ore, loo. cit; 
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8. The quotient quasi-group under P.. If the abelian qifigi-group obeys 
the law P,, we do not in general have blocks of cosets throughout the table 
since there are the free squares which are not governed by Ps: ” However it is 
interesting to notice that P, forces products of an element not in a subquasi- z 
group B of Q by the elements of a coset of B/E where E is the minimal 
subquasi-group to give the elements of a coset of B/E. | 

Consider subquasi-groups B and E of Q such that E < B < Q. Then P, 
applies to the product of q not in B by any two elements of B and q(bc) . 
= (qb)cı. When all the cosets of Q/E have been determined, the product qb ' 
is determined only in the case where.b is an e. Take b Æe of the coset bE. 
Let qb == d which may not be in gE nor in bE. Then q (bes) == d = (qb) eq. 
For another e, q(be,) — (qb)er — der, and e- eq. Hence as e, varies 
through the e, so does er, and der must vary through the elements of dE. 
Hence the products of g by bE give another coset dE. These cosets must then 
combine to form cosets with respect to the next larger subquasi-group. : 

It is to be noted, however, that another element of qE when multiplied 
by bE may give the elements of a coset different from dE. This fact: is 
exhibited by quasi-group X, and shows that blocks of cosets with respect to F- 
need not exist in this part of the multiplication table. | 

Let C be a cyclic subquasi-group next larger than Æ. Then all elements 
of C not in E generate C. When we assume g(bc) = (qb)c, where q is not in 
{b, C}, if c is in F, then c, is in Æ and one generates tlie other. But if c is 
in C and not ‘in F, then too cı must generate c, and as c varies through these 
c’s, c, must vary through the same c’s. After repeated applications of this 
argument we may conclude: | 


THEOREM 2. For q not in {b,C} and c in O, q(bc) = (gb)c; where c 
generates cı implies that c, generates c, and where c, generates c, c generates ci. 


If we assume P;, how may we strengthen this law in order to have a 
quotient quasi-group with respect to any subquasi-group of Q? First consider 
the case above where gb — d'and which showed that the elements g(bE) 
form a coset with respect to E but that blocks of cosets are not necessarily 
formed. We may take these as the elements of a row. To complete the block 
we consider that qb == bg and b(geg) = (bq) eng. Then if we change eg, qêr 
varies through the coset qE and the products are to give dÈ so that b(qer) 
must give (bg)e, where e, varies through all the e’s. Hence if the column is 
to be dE, b(ge) = (bq) &. 

If we consider the blocks which are to be obtained by multiplying q, by 
qE where neither qı nor q is in B, a like argument permits us to conclude 
- that if we have a quotient quasi-group with respect to #, then a(be) = (ab)e: 
for any a, b, e. 
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In order #0 obtain a condition for a quotient quasi-group with respect to 
every subquasi-group, we take C > FE with no subquasi-group between C and Æ. 
Then as noted, any c in C but not in Ẹ generates C. We assume that we have 
blocks with respect to Æ. Then (qC)(q,C), where q and q, are not in C, 
must give (qq.)C, i.e, g(qiC) must equal (qq,)C. But q(q.e) = (qq) 61- 
Therefore g (qıc) = (qq:)¢, where c generates c;, for any c of C but not in E. 
Tf we call these elements a set of a row, g.(qC) will fill the column and give 
again c, generated by c. By repeating this process we see that: 


THEOREM 3. When an abelian quast-group satisfies P, and Ps, tf a quo- 
tient quasi-group exists for every O, then for every a, b, c, 
a(bc) == (ab)c:, 
where c generates c. 


Together with Theorem 1 of this section we now have: 


THEOREM 4. When an abelian quasi-group satisfies P, and Pa, a neces- 
sary and sufficient condition for a quotient quasi-group with respect to every 
subquast-group C of Q is: 

For any a, b, c, 

a(bc) = (ab) cx, 
where c generates ci. 


New York UNIVERSITY. 





REFERENCES. 





Oystein Ore, “On the foundation of abstract algebra,” I, Annals of Mathemattes, 
vol. 36 (1935), pp. 406-437; TI, ibid., vol, 37 (1936), pp. 265-292; “Structures and 
group theory,” Duke Mathematical Journal, vol. 3 (1937), pp. 149-174; “On the appli- 
cation of structure theory to groups,” Bulletin of the American Mathematical Society, 
vol, 44 (1938), pp. 801-806. ‘ 

Hausmann and Ore, “ Theory of quasi-groups,” American Journal of Mathematics, 
vol. 59 (1937), pp. 983-1004. 


` THE GAUSSIAN LAW OF ERRORS IN THE THEORY OF ADDITIVE 
NUMBER THEORETIC FUNCTIONS.* + 


By P. Ernüs and M. Kao. 


The present paper concerns itself with the applications of statistical 
methods to some number-theoretic problems. Recent investigations of Erdös 
and Wintner* have shown the importance of the notion of statistical in- 
dependence in number theory; the purpose of this paper is to emphasize this 
fact once again. 

It may be mentioned here that we get as a DR case of our main 
theorem the following result: ' i l 

If v(m) denotes the number of prime divisors of m, and K, the number 
of those integers from 1 up to n for which v(m) <iglgn+oV2?lgign 
(w an arbitrary real number), then | 


w 


_ Kn > 4 

lim = —7TÈ f exp(— u*) du. 
-00 

This theorem refines some known results of Hardy, Ramanujan è and 


Erdôs.* 


1. In what follows p will denote a prime and w will denote a real number. 


Let f(m) be an additive number-theoretic function, so that f(mn) 


—f(m) + f(n) if (m,n) = 1. Suppose thet (ys) F(p) and | f(p)| 31. 
Obviously 


Jimy ar (p). 


Furthermore put X p“f(p) == 4, and ( X pf (p))* = Ba. Then our main 
pon pon. 


theorem may be stated as follows: _ 


* Received December, 7, 1939. 
+A preliminary account appeared in the Proceedings of the National Academy. 

, vol. 25 (1839), pp. 206-207. 
aP, Erdës and A. Wintner, “ Additive arithmetic functions and statistical in- 

dependence,” American Journal of Mathematics, vol. 61 (1939), pp. 713-722. 
? Srinivasa Ramanujan; Colleoted Papers (1927), pp. 202-275. 
“P. Erdés, “ Note on the number of prime divisors of integers,” Journal of HIS 

London Mathematical Society, vol, 12 (1937). pp. 508-314. 
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. THEOoREx. If Ba œ asn — wo, and Ka denotes the number of integers 
m from 1 up to n for which | 


f(m) < An + oV2 By 
then 


lim 2? gt J=- jen DO: 


a> n 


2. We first prove the following 


Lemma 1.° Let 
| fi(m) = f(p)- 


p<i 


Then denoting by 8: the density of the set of integers m for which fi(m) 
< dı + w V2 Bı one has 
i lim 8; = D (o). 
1-00 


Let pp(n) be 0 or f(p): according as p does not or does divide n. Then 
fi(m) = Z pp(m). 
p<t 


Since the pp(n) are statistically independent, fı(m) behaves like a sum of 
independent random variables and consequently the distribution function of 
fa(m) — Ai/V2 By is a convolution (Faltung) of the distribution functions 
of pp(m) — p“f(p)/V2Bi* (p< 1). It is easy to see that the “ central limit 
theorem of the calculus of probability” can be applied to the present case,’ 
and this proves our lemma. ` 


3. Lemma 1 is the only “statistical” lemma in the proof. Using this 
iemma, the main result will be established by purely number-theoretical 
methods. 


LEMMA 2. If my tends to œ (as n— œ) more rapidly than any fired 


5 Loc. cit. 2, where statistical independence of arithmetical functions is defined 
and discussed. See also P. Hartman, E. R. van Kampen and A. Wintner, American 
Journal of Mathematios, vol. 61 (1939), pp. 477-486. 

Cf. for instance the first chapter of S. Bernstein’s paper, “Sur l’extension du 
théorème limite du caléul des probabilités aux sommes de quantités dépendantes,” Mathe- 
matische Annalen, vol. 97, pp. 1-59. "See also M. Kac and H. Steinhaus, “Sur les 
fonctions indépendantes Il,” Studia Math., vol. 6 (1936), pp. 59-66. 
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` power of Sa, then the number of integers from t up to my which are not 
divisible by any prime less than 8, ts equal to 


mye € Mn 
Ig sn ae (= À 


where C denotes Euler 8 constant. 








The proof of this statement is implicitly ae in the reasoning of . 
« V. Brun on page 21 of his famous memoir “ Le crible d’Erasosthène et le 
théorème de Goldbach”? and may therefore be omitted. 

Let #(n) represent a function which tends, as n— œ, to 0 in such a 
way that n?™ —> œ. The function n#(#} will be denoted by a, and nV? ) 
by Bu. Let a,(nm),a2(n),- >- be the integers whose prime factors are all less ` 
than an, and let #(m; n) be the greatest a; which divides m, We then have 
the following . ‘ 





. Leusa 8. The number of integers m&n for which y(m; n) == ai (ñ), 
where m(n) S B, is equal to é 


Cp n 
ai (n) p(n) Ign à i Ign ). 


This is a direct consequence of Lemma 2. For consider all those integers = n 
which are of the form r-a;(n) and such that r is not divisible by.any prime 
<a,. Evidently, the integers thus defined are all the integers <= n for which 
u(m;n) = a(n). Their number is equal to the number of integers r which 
are Sn/ai(n) and not divisible by any prime < a. The restriction 
a(n) < Ba makes n/ai(n) tend to œ more rapidly than any power of as 
and therefore Lemma 2 can be applied (put Min == n/u (n) and ri 

. This completes the proof. 


Lemna 4 The number y of integers = M divisible by an a(n) > Ba 
is less than bH V p(n), where b is an absolute constant. (It follows from this 
that the density of the integers which are divisible ” an a(n) > Bn ts less 
than bV ¢(n).) 


We have ° 





` 


i wim; ima Il pA 3 ee < I ph; 
j m=) eas 
and since | 
lg IE PRAN: 3 PTE p— 2M (mn) Ign. 


p< an 


T Skrifter Videns, Kristiania, 1920. 
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one has 
M 
IL gym; n) < nHn, 
s1 $ 
Hence, finally d 
(Bn)¥ = (nVO™))¥ < Mod, ie, y <LAHVE(n). 


4. Lenya 5. Denote by ln the number of integers from 1 up to n 
for which | 


(i) ` fa,(m) < Aa, oV? Ba,. 
Then 
. 


Divide the integers from 1 up to n which satisfy (i) into classes #,, Ea, + 
so that m belongs to Æ, if and only if y(m; n) = a(n) ; and denote by | F; | 
the number of integers in #3. One obviously has 


h=3|E j= 3 |A|+ 3 |B |. 
i ai <Bn ai > Bn 


By Lemma 4 3 | E: | < bn V@(n) and therefore it is sufficient to prove 
at > Pn 
that nt 3, | E4|— D(w) as n— æ. On the other hand by Lemma 3 


(ii) x (Bt ( elon +o n )) E 
a1<Bn p(n) lew p(n) ign uapa UCN)? 

where the dash in the summation indicates that it is extended over the a’s 
satisfying fa (a1) < Aa, + oV? Ba. In order to evaluate 3’, divide all the 
integers- into classes F, Fat - + having the property that m belongs to F; 

if and only if y(m; n) == a(n) and let {F;} denote the density of Fi. TE 
Consider now the set 3’F;, where the dash in summation has the same meaning 

as above. By putting l =—= «„, and using Lemma 1 we have that {2 Fi} > D (w) 

as n—> œ or {3’F;} = D(w) +0(1). Now © 


(iii) XF; ¥ Fip Y F; 
aSpa a> Êa 
and by Lemma 4 
(iv) { ¥ F} <bVé(n). 
t > Bn 


Furthermore there is only a finite number of aps which are less than 8, and 
therefore { ¥ Fi} = ¥ {Fi}. But 
Gi < Ên t1 < Bn 


{Fi} “ay Be (G- =) = ne Gus ne Gr) | 
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and this implies that 


5 ee 1 A 1 
o Care (seat? e)a 
Finally (ii), (iv) and (v) give 


ETERN ee 1 7 1 
Dio) VE < (mer) + Go) Ext <P +00 


The combination of this formula with (ii) completes the proof of our Lemma. 





5. We now come to the proof of the main theorem. Notice first 
that for m<n, | f(m) —fa,(m)| <1/p(n). In fact, | f(p)| 51 implies. 
that | f(m) —fa,(m)| is less than the number of those prime divisors of m 
which are a, This number is obviously < 1/p(n), since (a,)t/#(n) == n. 
Notice furthermore that | f(p)| < 1 and the well known results concerning 
the sum > p> imply that | An — Aan | <—Cilgp(n) and | By — Ba, | 

pon 


< — 0: lg p(n), where C, and C2 are absolute ‘constants. 

Now choose ¢(n) so that 1/¢(n) —0(B,). Evidently every m&n 
satisfying the inequality f(m) < 4» + oV 2 B, also satisfies, for sufficiently 
large n, the inequality fa (Mm) < Aa, + (w+ €) V2 Ba, In addition every 
mn satisfying fa,(m) < Aa, + (w—e) V2 Ba, satisfies, for sufficiently — 
large n, the inequality f(m) < An + oV? Ba. Hence, by Lemma 5, 


Dileep Slim tet Ae < limsup as <D(w-+¢); 
and this proves the theorem, since € > 0 is arbitrary. 


> 6. The theorem mentioned in the introduction is obviously a particular 
case of our main theorem. It corresponds to the case f(p) —1. Because of 
the large number of applications of v(m) it is of special interest. It should 
be mentioned that the assumption f(p*) —f(p) can be removed; also 
| f(p)| = 1 may be replaced by a much weaker condition. This however, would 
complicate the statement of the main theorem 
We may perhaps point out that Lemma 2 (Brun) is the “ deepest” part 
of the proof and that tħe “statistical ” part is relatively superficial. However, 
the statistical considerations seemed to be suggestive and fruitful in leading to 
new and perhaps striking results. 
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ON THE STANDARD DEVIATIONS OF ADDITIVE 
ARITHMETICAL FUNCTIONS.* 


By PHILIP HARTMAN and AUREL WINTNER. 


1. If p= px denotes the k-th prime number and 1 a positive integer, 
then, on starting with any double sequence of. real numbers ais, put, for every 
positive integer n, 


(1): f(n) Sn) ae 
where | | 
(2) Fal) = 3 ln), 
and | Me 

= 0, if n=<0 (mod p+), 
6) fimes | yo if pe!|n and pain, 


finally f (pk) =a. It is clear that the functions f(n) , thus obtained, and 
only these functions, are additive, i. e., such that : 


(4) flmne) = f(m) + f(n) whenever (num) =1;- (f(1) =0). 
In fact, the series (1) is convergent for every choice of. the double sequence 
{{awx}}, since the series has, for every fixed n, at most a finite number of 
non-vanishing terms. It is also clear that an additive function f and either ` 
of the two sequences of additive functions {fx}, {fr} of n determine each other 
uniquely. The additive functions to be considered will be assumed to be 
real-valued. - 

For a given y == f(n), define yt==f*(n) by placing 
(5) y*—y or y*= 1 according as |y| <1 or |y| 21. 


Then the question as to the existence of an asymptotic distribution function 
of an f may be answered as follows: + ` 


(I) An additive f(n) has an asymptotic distribution function o(z), 


— © LT< a co, if and only if both series ° 
+ | 8 
(6) xh), (62) 2O 
p P ; ; p P 
are convergent. 
* Received April 8, 1940. ° 


1P, Erdds and A, Wintner, “ Additive ‘arithmetical functions and statistical in- 
dependence,” American Journal of Mathematics, vol. 61 (1939), pp. 713-721. 
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It is clear that if fi, is an additive function depending only on the ‘k-th 
prime number (i. e., if it is of the type (3)), then it always has the asymptotic 
distribution function 


1 


(7) ox(t) = mi OTL, (p= pm; m—0,1,-- *). 


— 1 ju) <a P 
Furthermore, it is easy to see that the terms f,(n) of an additive function (2), 
which depends on a finite number of prime numbers, are statistically inde- 
- pendent; so that, in particular, the additive function fs always has an asymp- 
‘totic distribution function é&(r),-—- 0 <2 < œ, and the latter is represented 
by the convolution l 


(8) Ok = 01 a. ¥* e e Boy. 


_ It was shown loc. cit.t that the infinite sum (1) cannot have an asymptotic 
distribution function o(x) unless it has the asymptotic distribution which one 
would formally expect in virtue of (1) and (8), i.e., that (I) may be replaced 
by the following theorem: 


(T) An additive f(n) has an asymptotic distribution function o(s), 
— o <T < ++ oo, if and only if the infinite convolution o, * o2 *: + - of-the 
asymptotic dattbulion functions (7) of tts terms (3) is convergent, in whieh 
case one necessarily has 


(9) i o = 0, ort :, 


2. For the more restricted class of daoa periodic functions (B?), the 
following theorem was recently established : ? 


(II) An additive f(n) is almost periodic (B?) if and only tf both series 


1) 2 
(10,) si), (102) 3 site 
p 11 p P 
are convergent. 
Since this theorem is analogous to (I), there arises the question whether 
or not it is possible to replace (II) by a criterion (IT) which relates to it as 
(T) does to (I). We shall prove that such is actually the case: ` 


(Il) An additive fen) is almost periodic (B°) tf and only if tts asymptotic 
distribution function o(x) possesses a second moment 

+00 
(11) ” f sdo(t) < ©; 


—20 
. 


3P. Erdds and A. Wintner, “ Additive functions and almost periodicity (B*),” 
American Journal of Mathematics, vol. 62 (1940), pp. 035-646. 
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in which case one necessarily has 


(12) MP} = f do(a). 


It is understood that M{g} denotes the mean 


(18) Mig} — Jim À (9(1) H9) ++ + +-9(n)), 


if this limit exists. Incidentally, it will be clear from the proof of (II’) that 
the condition (11) of almost periodicity (B?) is satisfied if and only if f has 
an asymptotic distribution function and is such that 


(14) HEP) < œ, 
where J {g} denotes the upper mean 


(15) i (g} = im sup É (901) +9(2) +:::+g(n)), if g 20. 


2bis. It is very striking that the moment criterion (11) of (II) can 
insure almost periodicity (B?). In fact, there will be given at the end of 
§4bis an example of a series of independent almost periodic (B?) func- 
tions, with the property that the series is convergent everywhere to a limit 
function for which the square mean is infinite, although this function possesses 
an asymptotic distribution which is represented by the corresponding infinite 
` convolution and which has a finite second moment. This example shows that 
the possibility of replacing (II) by (IT) depends essentially on the properties 
of prime numbers, and not merely on the statistical independence of the 
terms of (1). = 
Similar remarks hold for the das of (I) and (T). 


3.. For a given distribution function p(s), — co <z < + œ, and for a 
given positive integer i, let u (p)denote the i-th moment 


(16) Aa f: cape) (it fi |z |‘do(2) < 20). 


Thus, p has a finite standard deviation if and only if pa(p) < «©; in which 
case the square of the standard deviation of p is ` 
(17) v(p) = Ha(p) — mp)? = 0. 


‘Before proving (IT), it will he convenient to establish the ae 
theorem : 


4 : i 
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({II*) An additive f(n) is almost periodic (B°) if and only if the asymptotic 
distribution functions (7) of its terms (8) are such as io make both numerical 
series 


(8) Salon), © (181), Bron) 


convergent; in which case (18), (182) necessarily represent p(o), v(o), 
respectively, where o is the asymptotic distribution function (9) of f(n). 


Notice that the convergence of (18:) implies that v(ox) < œ, i.e. 
pelog) < ©, for every k. | 

For a given distribution function p(z), — œ <r < + œ, let p; p 
denote the non- negative numbers 


. (19) ` -< #=p(—1), g =1—p(1), 
and put : 
j ` 0, if — œ < tS —], 
p(z)— p, if —1<r<0, 
p(t) +i 0<r<1, 
1, f1£z<+o; 


(so that p(z), — œ < & < ++ œ, obviously is a distribution function). Then 


_ one can express the theorem which relates to (I’) in the same way as (II*) 
` does to the equivalent formulation (II’) of (II), as follows: 


(I*) An additive f(n) has an asymptotic distribution function if and only 
tf the asymptotic distribution functions (T) of tts terms: (3) are such, as to 
make the three numerical series 


(20) {= 


à oo oO 
(21) 3 (ox + ox”), 3 pi (ox), 3 v(ox) 
k= ; ki 1 
convergent. | 
In fact, it is known ? that an infinite convolution cı * o2 ** * - is con- 


vergent if and only if the three numerical series (21) are convergent. Hence, 
` (I*) follows from (T). 


4, In order to avoid an interruption of the proofs, there will first be 
proved a relation between the existence of time averages and space averages 
of an arbitrary function g(t), 0S # < œ, which is almost periodic (B3) for 
some fixed A21. The case of number-theoretical functions f(n) may be : 


3B. Jessen and A. Wintner, “ Distribution functions and the Riemann zeta func- 
tion,” Transactions of the American Mathematical Society, vol. 38 (1935), pp. 48-88, 
Theorem 34. 
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thought of as the particular case in which g(t) =f(n) for n—1<i<n, 
where n==1,2,.;- (and f(0) —0, say). As will be seen in §4 bis, the 
theorem to be obtained is not obvious in itself. In fact, it is to the effect that, 
in a case of almost periodicity, the general inequality of Fatou becomes an. 
equality. 

THEOREM. If o(x), — œ < z < + co, denotes the asymptotic distribu- 
tion function of a real-valued function g(t), 0Zt< œ, which is almost 
periodic (B>) for a fired X= 1, then ` 


(22) Milgi = f | 2 |Ado(2) 


-0 


for every positive exponent pd (which implies that 
(23) {gt} — f do(a) 
. | —00 


for every positive p SX, if p is an integer). 


Proof. For a fixed T >0, let or(z), — œ <x <+ œ, denote the 
distribution function of the function (of class Z>) which is equal to g(t) for 
0S t< T and has the period T. Then, by the defintion of the Ai 
distribution function (x) of g(t), 


(24) . © oro as I> 


holds at every continuity point z of a; while obviously 


+00 | r ; | 
Cf le pdoe) = à f | a(t) par | _ 
00 o 
Since, by Fatou’s inequality, 
+00 +00 | - 
f | #[dlimen(z) Slimint f |z Pder(2), 
ie T-H00 T® ee 
it follows that co i . 
v +00 š 
(25) f lepote) SH g < ©. 
= -00 


On the other hand, it is known * that if gz(t), 0 Æ t< œ; denotes, for 
a fixed positive number X, the function which is equal to —X, g(t) or X 


4A. S. Besicovitch, Almost Periodio Functions, Cambridge, 1932, p. 100. ` 
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- according as f(t) < — Z, |g(t)| SX, or g(t) > X, then gx is almost 
periodic (B>) and one can find an. X. > 0 such that 
(26) Mj g— gr} < 6 if XEX,. 


In view of (25), one can choose X. so large that 


-X +7 +00 i 
(27) f | x [do (x) +f la Pdo(z) < e it X Z Xe 
-% x 


Since o is monotone, one can assume that z = + X, are continuity points of 
o(s). Then it is clear from (24) that if e > 0 is given and X denotes Xe 
one can choose a positive T == T (Xe «) = Te so large that | 


(28) Lor(z) —o(r)| </I> for s=+X, (XX). 
In addition, T == Te may be so chosen so large that 


p X X 
9) |f |e Pdon(z)—f laldt)l<s (X=); 
-X Í -5X 
(this is clear from (24) and from Helly’s-term-by-term integration theorem, 


since X == X, is fixed). Furthermore, (26) assures that, since XY = X. is 
fixed, one can choose T = T so large that i 


T 
eo BF l(t) —os(t)Pat<g (XX). 
Finally, since A1{| g |>} exists (< co), one-has 
4 ay 
(31) ZS sopaga] 


if T = T, is chosen sufficiently large. 
Since the definitions of oy(#) and gx(t) obviously imply that 


x T 
Île Piore) 22 f gr Pat —Dlor(— Z) + 1— 07 (X)], 


x 


it is clear from (27), (29) and (28) that 


+00 : ` T 
AS lepät) 2% f | get pat | < £4 Dio) +1 —0(X)]} 
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so that, since X*[o(— X) + 1—o(X)] is certainly not larger than the sum 
of the two integrals on the left of (27), one has 


+00 


ae T 
(32) f lepi) à fopa | <4e pet 





a= aS omii f iaoa | 


Since it is seen from (30) and from Hôldér’s inequality that this y is majorized 
by a function of e which tends to 0 as e—> 0, it follows from (31) and (32) 
that (22) is true for p == À. 

It is clear from this proof of (22) for p= à, that (22) holds a fortiori 
for 0 < p < À, and that (23) is valid for every positive integer y not greater 
* than A; so that the proof of the Theorem is complete.. 


- Abis. That in the Theorem, just Dove the assumption of almost 
periodicity cannot be replaced by a mere average assumption, is shown by the 
following example: 

In terms of a sequence of non-negative numbers 4@;,d2,: °°, define a 
function g(t), OS t < œ, by placing g(t) — 0 for every ¢ not contained in 
any of the intervals n & t < n+ n°1, where n == 1,2,: - -, and g(t) == an if 
t is in the n-th of these intervals. Clearly, the asymptotic distribution func- 
tion, o(z), of g(t) exists for every choice of the, sequence {a,}; in fact, 
o(z) = $(1+ gna), so that the Stieltjes integral on the right of (16) 
exists and vanishes for every value of the exponent p. On the other hand, 
one can choose the sequence {an} so that (i) there exists a finite non-vanishing 
mean value M{g*}; (ii) the mean value M{g*} = + oo ; (iii) there does not 
exist a mean value M{g*} = + œ. In order to see this, it is sufficient to 
choose i 


. Le. E Ge cee Lin if n= 2"; 
(i) a» nm; . (ii) an n; (iii) an 19 if n 542”, 


In all three cases, (22) fails to hold for p— 2, os the integral on the 
right exists for p == 2. > > 

In order to obtain a series of the type estion i in 8 2 bis, it is sufficient _ 
to consider the function g(t), 0Æ t< œ, defined by the convergent series 
p(t) + gaft) +++ +, where ga(f) has, for à fixed n, the value a, or 0 
according as ¢ is or is not in the interval n S t < n 4 n^. 
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5. Besides the theorem proved in §4, a parallel theorem on infinite 
convolutions will be needed. 


` Lemma. If an infinite convolution o,*02*- : - converges to a distribu- 
tion function o for which p(o) < ©, then the series (181), (182) are con- 
vergent and represent m (0), »(o), respectively. 


The proof of this Lemma will depend on the following known criterion: 5 

If a sequence of distribution functions os, 02,° © * is such as to make the 
series (182) convergent (so that, in particular, pa (or) < œ for k—1,2,: - :), 
then the infinite convolution o, * o3 *: - -is convergent if and only if the series 
(18,) is convergent; in which case the distribution function e == 01 # og*: >: 
has a second moment a(o) < oo, and the series (181), (182) represent m(o), | 
v(o), respectively. 

It is easily verified from the definition of the convolution « * 8 of two 
distribution functions a = a(z), 8 == B(x), that | 


(33)° pe(a* B) < œ if and only if pe(a) + pa(B) < œ. 
Furthermore, R 
(84) if m(a* 8) <œ, then v(a #8) = v(a) + v(8). 


The Lemma may now be proved as follows: | 
ie that u2(v) < oo holds for a given convergent infinite convolution 
o==o,*o2%---. Then repeated application of (33) and (34) shows that 


(35) vos) H: + ++ v (on) + v (one * ons t * *) —Y(0) 


holds for-every &. In fact, it is known® that if oo, #o,%- -+ is a con- 
vergent infinite ‘convolution, then the infinite convolution own * one *** *, 
where & is arbitrarily fixed, converges to a distribution function, and that the 
convolution of this distribution function and of o, *o2*- + + *o,% is o. 

Since (35), (17) and the assumption z(o) < œ imply the convergence 
of the series (182), the criterion quoted immediately after the penne shows 
that the proof of the Lemma is complete. | 


6. Next, it will,be shown that, in virtue of the T a of 


® This criterion is implied by the proof, though not the wording, of Theorems 4 
and 6 in the paper of B. Jessen and A. Wintner, loo. oit4, pp. 56-58. The method 
applied there is that of the Fourier-Stieltjes transforms. For the above wording and 
for a proof which does not make use of Fourier-Stieltjes transforms, cf. E. R. van 
Kampen, “Infinite product measures and infinjte convolutions,” American Journal of 
Mathematics, vol. 62 (1940), pp. 417-448; more particularly, (9) on p. 442. 

° B, Jessen and A. Wintner, loc. cit, Theorem 2. - 
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the asymptotic distribution functions o:(2), o:(t), © > + of the terms 
fi(n), fa(n),- ©- of an arbitrary additive f(n), the two series (10,), (102) 
are convergent if and only if the two series (181), (182) are convergent. - 


It is clear from (7) and eee that if k is fixed and p= px denotes the 
k-th prime number, then 


= pu 3 eo" 
provided that the series (36,), (36:) are absolutely aes where it is 
understood that if the non-negative series (362) is divergent, then p2(ox)—= œ. 
Furthermore, it is clear from the Schwarz inequality, ` 


f(pt) (h gy fle)? 1 
ee (2% toy s re = t= icon dr ay > À qy i 
that the absolute convergence of the series (36:) is implied by the convergence 
of the series (362). l Consequently, both relations (36,), (862) hold for a 
fixed & whenever pa(ox) < œ. And (16), (17) show that pa(ox) < œ is 
equivalent to v(ox) < œ (a condition which is certainly satisfied for every k 
whenever (182) is a convergent series). Suppose that (ox) < œ for every k. 
It is clear from (361), (362) and (37) that 


mhor) = = (p 


by the definition (17 ) of » On “ui pa (on) from (36:) into the last 
inequality, one obtains ` 


ren); so that v(m) (1 —F oe) salon, 


(38) v(on) 20,3 ier, where 0, = ES FP 


© On the other hand, since a7) implies that (ox) S valor), one sees from 
(86,) that 
(39) (c SA ier where A, pe ae 

v(ox) SA pon Ta as p— Pr— o. 

The relations (38) and (39) imply that either both series (103), 8) 
are convergent or both are divergent. It follows that ia order to complete the - ' 
proof of the last italicized statement, it is sufficient to prove that if the series 
(102), (182) are convergent, then either both series (10:), ated are con- 
vergent or both are divergent. 

To this end, notice first that, since ab < a? vi b?, 





(40) gy SUM) as gt as 2 
51 va p 2 


~ 
1 


1, 
P ? a pt 
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But the first double series on the right of (40) is majorized by the series (102), 
which is supposed to be convergent; so that, since 
œ% í 1 
3 l~ =l 
peep! p p(p—1) 
the double series on the left of (40) is convergent. It follows, therefore, from 
(86,) that the series (18,) is convergent if and only if the series 


: p s TO) | 
ise, ae eee AS 
is conversant: 

Hence, the statement that either both series (18 ), (10;:) are convergent 
or both are divergent is equivalent to the statement that either both series (41), 
(101) are convergent or both are divergent. Since the difference of the series 
(41), (101) is 

f) 
LR > Pb — 1)? 
it follows that all that remains to be shown is that the series (42) is con- 
‘ vergent if either (41) or (10,) is a convergent series. But both {p>} and 

{(p—1)*} are monotone and bounded sequences of numbers (which tend,. 
in fact, to 0). Hence, it is seen from a standard convergence criterion ( partial 
summation), that-the convergence of either of the series (41), (101) implies 
the convergence of the series (42). 

This completes the proof of the last italicized statement. 


7. The proofs of (II*), §.3 and (IT), § 2 are now immediate. In fact, 
it is clear from the Lemma of § 5 and the Theorem of 8 4 (where A = 2m y), 
‘“™that, because of (I’), $ 1 and the italicized result of § 6, both (II*), $3 and 
~ {IT}, 82 are equivalent to (II); 82. 
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ON THE ALMOST PERIODICITY OF ADDITIVE NUMBER- 
_ THEORETICAL FUNCTIONS.* 


By PHILIP HARTMAN and AUREL WINTNER. 


1. By an additive function f — f(n) is meant a sequence f(1),f(2),: - 
defined for every positive integer n in such a way that- 


(1) (ming) = f(a) + f(a) whenever (m,n) 1; (f(1) 0). 


Thus, if p == pe denotes the k-th prime number and I is a positive integer, 
the correspondence au — f(x") establishes.a one-to-one correspondence be- ' 
tween arbitrary additive functions f and arbitrary double sequences of numbers 
{{aw}}. With every additive function f(n), there is associated the sequence 
fi(n), fa(n),: > - of additive functions, where the double sequence {{f;(px!}} 
of f;(n) is defined as follows: | 


(pet) if km, 2, +, tn. 
(2) fi (De) 0 if k> j, | (t= 1, 2, ). 

It is known? that the real additive function f(n) has an asymptotic dis- 
tribution function if and only if both series _ 


F(p) . f*(p)? 
3 3—4; _ (82) zy 
OR G) a 
‘are convergent, where y* == ft is defined by placing | 
(4) y =y ory according as |y| <1 or |y|=1. 


> 
It is also known? thet an additive function f(n) is almost periodic 
{B) if and only tf both series 


$ (ee 1312 
(51) 3/@), ~ (5a) 33 Fp) 
p P wti p P 
are convergent. _ | i | 
By a suitable modification of the proof of the latter theorem, it will be : 
shown in the present paper that an additive function’ fn) is almost periodic’ 
(B) tf and only if the four series 


* Received April 8, 1940: | in À 
1P, Erdôs and A Wintner, “ Additive arithmetical functions and statistical in- 
dependence,” American Journal of Afathematios, vol. $1 (1939), pp. 713-721. : 
3P, Erdôs and A. Wintner, “ Additive functions and almost periodicity (B*),” 
American Journal of Mathematica, vol. 62 (1940), pp. 685-645. . ; 
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(61) 3 tlp). (62) S | FF, (63) S 3 fle} (64) z | f(p) 
» P > P mp P > olz P 
_are convergent. 
Notice that the exterior summation index runs in (52) and (6s) from 
l = 1 and l == 2, respectively. Mn —d 
Tf fm fi + ifn, where fr, fu are real, then f is additive and almost 
- periodic (B) if and only if so are both functions fr, fu. Furthermore, since 
(4) implies that 
. lf |S (fr)? + (fut)? S2 (P| F= fit tn), 
the 4 series (6:), (6:), (6s), (6s) are convergent for f = fr + tfn if and only 
if so are the 4 -+ 4 series which one obtains by writing fr and fr for f. Hence, 


it is sufficient to prove the italicized theorem for the case of a real-valued f. 
This restriction will always be assumed. f 


2. In order to prove first-the sufficiency of the criterion of the italicized 
theorem, suppose that the four series (6,)—(6,) are convergent. 

Let F(n) be the, additive function for which the double sequence 
{{F’(px?) }} is given by | 


fæ) if |f) 21, 
@) F(p)= | F(p) — F(p) if FCP) <L (P= Pr). 
Since the proof given loc. ctt.® (beginning of § 5) for $ 
(8) | Ja sE 3 


on the assumption of the convergence of the series (52) actually uses the 
convergence (6,) and (6,) only, (8) is satisfied. Hence, 


(9) im 3 E0 5 
Le J>% Fr pt 

Since obviously 

no co n f 

3 3 IrIS 3 — | F(p) 

mat pla à =1p>3 P- 


for every j, it follows from (9) that 


(10) lim sup + Š 3 | F(p')| 0 as j> oœ. 
n->00 os . 
eon e 


But if F;(n) denotes the function which belongs to F(n) in the same way 
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as the function f;(n), which is defined by. (2), EA to TO then, since 
F and F} are additive functions of n, 


= | Fm) —Fy(m)| S23 3 5 FEI 


Hence, (10) implies that 
(11) H{|F—F,|}—0 as joo, 


where M {g} denotes the upper mean value 
= : 12 
H {g} = lim sup = 2 g(m) 
#00 m=i 


of a non-negative function g of n.~ 
Let G(n) denote the additive function 


(12) G um f — F, 
Then (7) implies that, for every prime p, 


0 if [ f(p)| 21; 
(13) HO) — À Hep) Sile 
so that | | 


60) à fe 

p P?  Iftpl<1i P 
Since the series on the right is, in view of (4), majorized by the series (62), 
which is supposed to be convergent, it follows that f 


2 
(14) - 3 Her < o. 
; » P f i - 

On the other hand, it is clear from (13) and from the convergence of the 
series (6,) and (64), that | 
(15) x se ) je convergent, 
since i | 

' si _ 4) +s f). 

? P ; P It(p)|21 Pe 
But (14) and (15) imply, as shown loc. ctt., § 8-§ 8 bis, that 
(16) H{| G—G; |} +0 as j> w, 


where G,(n) denotes the additive function which belongs to G(n) in the same 
way as the function f;(m), which $ defined by (2), belongs to f(n). 
Since (12) obviously implies that G; = f; — Fy, it is clear that 
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#{|f—fy |} ERR |} + #{| @—G |}; 


it follows therefore from (11), (16) and from an application of the Schwarz ` 
inequality to M{| G — G; |}, that 


ay Hilft |} as j> w. 


Finally, it is known ° that if an additive function g(n) is, with reference . 
to a fixed prime number p = yj, such that - 


(18) g(n) —0 whenever p;fn, 
then g(n) is almost periodic (B) if and only if 
a n 1 
(19) S vi rae 
1 


‘But it is clear from the definition (2) of the additive functions f; of n, that 
the additive function f of n which is defined for j = 1,2,--- by 
(20) FO (n) = film), FO (n) = fa(n)— fln), FO (n) =fa(n)—fa(n),* > 


is such that (18) is satisfied by gf‘. Furthermore, it is seen from the 
‘definitions (2), (20) of the additive function f of n, that the convergence 
of the series (63) implies that 


(N (pt i 
3 it Hors every p= pj. 
1-1 


"This means that (19) is satisfied by g =f‘) for every j. Consequently, f') is 
almost periodic (B). Since (20) implies that 


frm POFFO $e EPO, 


and since the functions which are almost periodic (B) form a linear space, 
it follows that the additive function f; of n is almost periodic (B) for every j. 
Hence, it is clear from (17) that the poor for the almost periodicity (B) 
of f(n) is now complete. 


3. This proves that the convergence of the four series is a sufficient 
condition for the almost periodicity (B) of f. In order to prove the necessity 
of this condition, suppose that f(n) is a given real additive function which 
is almost periodic (B). 

Since f(n) then has an asymptotic distribution function, both series 


2E. R. van Kampen and A. intner, “ Onethe almost periodic behavior of multi- 
plicative number-theoretica] functions,” American Journal of Mathematios, vol. 62 
{1940}, pp. 613-626, Theorem II, à = 1. 
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(31), (32) are convergent. And, in view of (4), the convergence of (3) 
implies that 


(21) 8 3 tco. 
(le P 
In terms of the given f(n), define an additive function D(n) by placing 
(22) D=f—H, 


where H = H (n) denotes that additive function for which the double sequence 
{{H (pa!) }} is given by 


| F(p’), tf 151 
(23) H(p') = 4 f(p)  1=1 and | f(p)| 21, 
0 if.J—1.and | f(p)| <1. 
Accordingly, 


0, if 141 
D(p!) = f if l= 1 and |f(p)|21 

f(p), if l= 1 and |f(p)| <1. 
and so it is clear from (21) and from the convergence of the series (3,) and 
(32), that one obtains four convergent series by writing D for f in (6:)-(6,). 
It follows, therefore, from the result proved in §2, that D(n) is almost 
periodic (B). Since f is almost periodic (B) by assumption, one sees from 
(22) that H is almost periodic (B). In particular, 


(24) {| H|} < œ. 


But (21) and (24) imply, after an obvious adaptation of the estimates carried 
out loc. ctt.?, $ 11, that 


co 5y] 
. (25) I3 HOD 2 we, 
‘ 1 p Pp a 


this series (25) being the analogue of the last series loc. ctt.*, § 11, in case 
the almost periodic class (B*) is replaced by (B). 

It is now easy to prove the convergence of the four series (6:)— (64). . 
In fact, it is clear from (23) that (25) may be written in the form 


HOT p 2 
Furthermore, both series (81), (32) are in view of the existence 
of the asymptotic distribution function of the function f.. Since (32) is 
identical with (6.), while (26) implies the convergence of (63) and (64), 
the convergence of the three serieæ (6,), (62),°(63) follows. Finally, since 
(21) and (26) imply the absolute convergence of the series 
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ie P i(p) 
27 — 38 =; 27 03 EL 
Phy) ir) lz P | ( 2) Item P’? 


respectively, the convergence of (6,) follows from the convergence of (34) 
and from the fact that, in view of (4), the series (6,) may es. be 
-written as the sum of the three series (8:1), (271), (272). 


4. A careful perusal of the proofs, applied Loc. cit.? and sears: shows 
that, by standard applications of the inequalities of Hélder and Minkowski, 
one can generalize the criteria (5,)—(52) and (6:)-(6:) of the respective 
almost ‘periodic classes (B?) and (B) = (B+) as follows: An additive func- 
tion f(n) is almost Epona (BY) fora fized À Z 1 if and only if the four 
seriés 
(28) sf); (28) gL PI (8) Ss EOP. as) x [tP 

"p P p P mp P Male P 
are convergent. (Correspondingly, the convergence of (284), (282), (28s) 
‘is equivalent to the convergence of the single series (52), if à == 2.) > 
_ À consequence of the italicized theorem is that if the double sequence 
{{f (px!) }} of an additive function f(n) is bounded, then f(n) either is almost 
periodic (B>) for arbitrarily large À or is not even almost periodic (B) = (B+). 
This is not obvious in itself, since f(n) is not in general a bounded function 
. when its double sequence is bounded. 
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ON THE SPHERICAL APPROACH TO THE NORMAL — 
DISTRIBUTION LAW.* 


By Pam HARTMAN and AUREL WINTNER. 


. Introduction. There are two classical “ geometrical ” approaches.to the 
normal distribution law. One of these is represented by the theory of the 
addition of independent random variables or, equivalently, by the theory of ` 
convolutions. This approach, followed in a general and precise manner by 
P. Lévy and his followers, is based on the consideration of a product distribu- 
tion on an n-dimensional cube, a distribution which is then projected orthogo- 
nally on the principal diagonal of this cube.1 The other approach, which is 
due to Boltzmann and is reproduced in some of Borel’s elementary text-books 
on the calculus of probability, has as its starting point, not the theory of in- 
dependent random variables, but rather the simplest model of the Maxwell 
theory of velocity distributions.* This approach, which plays a fundamental 
rôle in the investigations of P. Lévy ë and N. Wiener * in functional analysis, 
is based on the consideration of the equidistribution on the surface of an n- 
dimensional sphere, a distribution which is then projected orthogonally on a 
diameter of this sphere. 

If the unit of length is increased in the proportion 1: Vn in case of the 
first approach, and decreased in the same proportion in the second approach, 
there results, as n —> œ, a symmetric normal distribution in both cases. 

It is known * that the Fourier-Stieltjes transform of the equidistribution 
on the surface of an n-dimensional sphere of radius r is the Bessel function 
I*yn2(1|U])/J* in. (0), where J*» (2) —=2*Jy(z), and that this Bessel funce 
tion is also the Fourier-Stieltjes transform of the 1-dimensional distribution 
which represents the projection on a diameter.. In fact," any 1-dimensional. 


* Received April 15, 1940. | 

Ct, e.g, Lévy [11]. As to the geometrical interpretation of the convolution 
process by means of orthogonal projections, cf. Sommerfeld [17], where the simplest 
case of the “Abrundungsfehler ” is considered. 

*Cf. Boltzmann [2], vol. 2, pp. 96-100 and, e. g., Borel 131, pp. 44-50; also Borel 
[4], pp. 90-93; and Borel and Deltheil [5], pp. 134-136. 

3 Lévy [12]; also Lévy [13]. 

t Cf., e. g„ Wiener [18], pp. 135-143. 

š Ci., e.g, Wintner [20], p. 313, where references are given to the principle of 
Huyghens; Jessen and Wintner [9], p. 59; Wintner [21]. Some of these things were 
recently rediscovered by Schoenberg® (e. g., Schoefberg [15], Lemma 4); cf. also 
Blumenthal [1]. i 

8 Of., e. g., Jessen and Wintner [9], p. 65; Wintner [21], p. 76. 
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distribution which represents the projection of an n-dimensional distribution 
function of radial symmetry (i. e., a distribution which is built up by means 
of an arbitrary Stieltjes weight factor from spherical equidistributions be- 
longing to varying r) has the same Fourier-Stieltjes transform as the projected - 
distribution. 
In view of the multi-dimensional * analogue of Lévy’s inversion formula 
-of Fourier-Stieltjes transforms, this spherically stratified decomposition of an 
arbitrary distribution function of radial symmetry into spherical equidistribu- 
tions is known to be equivalent to the Se formula for spherical 
-waves in n-dimensions.® 
The results of the present paper concern certain questions connected 
partly with the above topics and partly with a problem ° suggested by the 
most primitive approach to Maxwell’s law of velocity distribution. If 
- (Ve, Vy, Vo) denotes the density of probability at the point v == (vo, vy, Us) 
of the velocity space, then Maxwell’s assumptions imply that 6 is a function 
Of | v |= (v + vy? + v,2)à alone, that the probability densites of each of the 
velocity components vs, vy, ve depend on the respective components alone, and 
that the latter densites are represented, up to adjusting factors of propor- 
tionality, by the same function 8 as the probability density of the speed | v |. 
In fact, this condition of the preservation of the density function under 
projections is obviously satisfied if log 8(| v |) is proportional to | v |*. There 
rises, therefore, the question whether or not this property of preservation of 
the probability density under projection is in itself sufficient to assure that 
8(|v|) defines the Maxwell distribution. The result of § 5 will imply that 
the Maxwell law may be deduced from this functional condition alone. 
§ 3 deals with the class of distribution functions which may be represented 
ns stratifications of a given sheaf of distribution functions. The results obtained . 
in this section are illustrated in 8 4 by their application to the special case of 
stable distribution functions.. The simplest and least restricted case of this 
particular case is the one where the underlying stable distribution is normal. 
This limiting case will be separately studied in § 2 by an elementary approach. 
As to this approach, which in § 3 will be extended to the general case, 
a few methodical remarks seem:to be of interest. Recently, Schoenberg +° has 
rediscovered the above-mentioned Cauchy-Poisson decomposition and, in par- 
ticular, the fact that the Fourier-Stieltjes transform of the spherical equi- 


* Haviland [8], I. a 
$ Cf., e. g, Wintner [20], pp. 316- 319; Tab and Wintner [91, P. 55; Wintner [21], 
p. 76. © . 
°? For more refined approaches, cf. e. g., Boltzmann [2], vol. 1, chap. 1. 
19 Schoenberg [15], p. 816; ef. Blumenthal [1] , 
# Schoenberg [14], p. 791; cf. Blumenthal [1]. 
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distribution of radius r is J*y,.(r|u|)/J*yn.(0). Actually, Schoenberg’s 
considerations 1? concern all functions which are Fourier-Stieltjes transforms 
of radically symmetric n-dimensional distributions for every n. Since Borel’s 
approach to the normal distribution law also has escaped Schoenberg, he redis- 
covers ** the spherical approach to the Gaussian law by applying to the Bessel 
` functions, mentioned above, the continuity theorem! of Fourier-Stieltjes trans- 
forms, instead of proceeding.directly as Boltzmann, or Borel and Lévy do. 
And he applies the same method to the spherical stratification formula of 
Cauchy-Poisson. Now, it will be seen in 8 2 that the intuitive and elementary 
method ‘may be transferred without much effort to this general case of an 
arbitrary Stieltjes factor of spherical stratification. In particular, the method 
of Fourier-Stieltjes transforms, which is so fundamental in most problems of 
mathematical statistics, turns out to be a ballast in. the present case. 


1. Let ¢,(#,) denote a distribution function on the n-dimensional 
Euclidean space R,, that is, , is a completely additive, non-negative set 
function defined for all Borel sets #, of Ry in such a way that ¢(#.) — 1. 
It will be supposed that $n is radially symmetric, i. e, (E'n) =¢(En) if 
i’, is the image of any Borel set #, under an arbitrary rotation of the space 
R, about the origin. For k—1,2,---,n—1,,a k-dimensional radially sym- 
metric distribution function x(Æx) can be associated with $,(Æ,) in the 
following manner: 

Let Rr be a k-dimensional hyperplane through the origin of Rn, and Er a 
Borel set on Ry, finally P, (2x) the set of those points in Ra whose orthogonal 
projection on. Rx is in Hy: Then a distribution function x is defined by the 
relation ** 

(1) be (Ee) = bn(Pa(Ee)). à 


x is called the k-dimensional projection of #, ; in virtue of the radial symmetry 
of on, it is independent of the choice of the hyperplane Ry. Tt is clear from 
- this definition that de is also the k-dimensional projection of ¢,, for 
j oa i + losin | G 
The set functions ¢1,° * -,¢n of radial symmetry may be replaced by the 
non-decreasing point functions p;(r),: * *,pn(*) which are defined for 
OZ + < œ as follows: i , 


(a) > px(r) — (Fi), f#>0;  px(0) —0, 


aa Saab [15], pp. 816- 821. 

14 Incidentally, Schoenberg’s [15] gentral ous (2.4), which ia due to Laplace, 
may be found on p. 421 of Watson’s Treatise on Bessel Functions. 

4 Cf. Jessen and Wintner [9], p. 55; Wintner [21], p- 76. 
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where Ex" denotes the k-dimensional sphere of radius r about the origin of Re. 
In the case k — 1, the function p, is usually replaced by the symmetric dis- 
tribution e(z), — © < z < + ©, e 
(2 bis) o(t) ps), . —æ<r<+e, — 

where Æ,(x) denotes the half-line (— ,2) on R,. Obviously, these distribu- 
tion functions pr, o satisfy the boundary conditions 

(8) —pe(0) = 0, (+ 0) —1,  (pr(r) = 0 for — œ <r<0), 
(3bis) o(—o)=—0, o(+) —1; 
‘also, in virtue of the symmetry of ¢:, 

(4) =: o(— z) = 1 — e(r). 
. Inthe sequel, the functions pk e k <n) and o also will be referred to as 


projections of dx. 
It is clear that the relation between px and pris given ‘by the formula 


i 00 ir 
(Bm) (r) pt) + Bab S E f (c080)**(sin @)"#rarldps(t), r > 0, 
rv are cosr/t 
where | 
(6) Bt = A,' As: E te cee i ++ Ab, 
and 15 i f 
Doo. Ayo [ f sin ada] = ~ (2x) 3, 
6 


since the integral Bef (cos 6)*+(sin 6)**"dé in (5yz) is that portion of the 
(n —1)-dimensional area of the boundary of the sphere of radius ¢ (>r), 
ALU projection on the hyperplane Es is on the sphere Ex". The relation 

“between o and pr is given by l 


(8). a(z) = 3 + 4p: (2), where æ>0; cf. (4). 


The formula (ör) may be rewritten ay introducing the distribution ` 
function 


w/z i 
(9) Yne(T) == B of .(cos 6)* (sin 0)**"d6, if OS rS1; 
arc cos r ' 


KA: if r > 1, 


(a function which obviously bears the same relationship to the k-dimensional 
projection of the n-dimensional spherical equidistribution of radius 1, as the 
function (2x) does to gx). The formula x) then becomes 


15 Cf, e. g., Borel and Deltheil [5], p. 135 and p. 187. 
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(10) pelt) = f palid) r> 


In view of (2%), the formula (10) represénts, in terms of the arbitrary 
Stieltjes weight factor p(t), the stratified decomposition of an arbitrary 
n-dimensional distribution function of radial von into equidistributions 
on surfaces of spheres of varying radii. ` 

While it is obvious that the distribution functions px and dr determine 
each other uniquely, and that px is uniquely determined by pn, it is not so 
obvious that p, is uniquely determined by px. That such is the case, never- 
theless, may be seen by considering formula (10) as a convolution on a 
logarithmic scale; the uniqueness of p, then follows from the uniqueness 
theorem of Fourier-Stieltjes transforms. This remark, depending on Fourier- 
Stieltjes transforms, is not used in the sequel. 

For the sake of brevity, a 1-dimensional distribution function which is | 
the projection of an #-dimensional radially symmetric distribution function 

for arbitrarily large n, will be called a distribution function of class Q. 


2. On the basis of the elementary geometrical relations collected above, 
it is easy to prove that a distribution function e(z), — œ < x < + œ, is of 
class Q if and only tf there exists a distribution function r(t), — oo <t < + 0, 
such that 


(11) Oye: aeaa 
and PER | ‘ 
2) o ses f “ot (2/t) a(t), 


where o* (x) i8 the symimeine normal distribution function, of unit standard, 
deviation, i.e., 


(13) > o* (a) = (27) f ei dy. 


In order to prove this, suppose first that o(x) is a distribution function 
of class 2, Then there exists a #,(Æ,) and corresponding functions (2,), 
(21), such that (5,1) and (8) hold for n—1,2,- : -. If n is fixed, these 


relations may be rewritten in the form : ° 

| œ. s/t A 
(14) o(2) =E fonla) + Aant: f g (1—y'/n) dy dpa (rt), 
| wig © “> 0, 


if one changes the integration vatiables from 6 to y = mh cos 9 and from t 
to nit. 
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According to the selection theorems of Hely, there exists a non- 
decreasing function +(¢) and an increasing sequence {mn} of positive integers, 
such that, asn— œ, 

(15) pm, (màt) > T(t), 


where the sign — is meant in the sense of theory of monotone functions, 
(i.e, in the sense that one has convergence at every continuity point t of 
the limit function r). It is clear from (15) that 


(16) - pm, (£) —> 7(+ 0), for all z > 0, as n— oœ. 
In view of the term-by-term integration theorem of Helly,’® it is also 


clear from (15) that 


(17) Jo (2/8) png matt) > [ot (a/t)ar(t), 
if & > 0 is fixed and e is an arbitrary positive number such that t= e« is a 
continuity point of z(t). On the other hand, since 
(18) Ann (27) #, 
holds in virtue of (7), and since 
(LaF /n 80) Se, 
holds uniformly for | y | S const., where const. is arbitrary but fixed, one sees, 


by choosing const. => 2/e, that 


w/t 
Amd f "(1 —y'/nidy > o*(2/t) —4 
0 
qolds uniformly for «Si < œ. It follows that 
œo alt 
(19) Aster à f f (1— y?/tn) A) dy dom, (mnit) 

€ 4/0 

> fot @/t) — art) 


holds for every > 0 and every «> 0 (such that t =e is a continuity 


point of 7). 
Furthermore, if x S 0 is fixed and ¢ = nr, then, by (7), 


w/t ` ni 
ans f (y/n idy S Ayn f(A — g/m) dy =h; 
0 o 
so that l f 


16 Of., e. g, Wintner [22]. 
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ama f i (1 —y"/n 8 dy-dpn (nit) Lon (ne) — pa (2)]. 
Hence, | 
e a/t 
(20): lim sup Ant f fO (1—g2/mq) dy don (mat) 


nee? < &fr(e) —7 : 
Finally, from (13), = = ae 


eD S [o*(2/t) Har) LE fl art Hr) — 74091. 


The relations (14), (16), (19), (20), (21) obviously imply 


(22) o(2) =+ (40) + ff “[o*(2/t) — Hdr(#), if 2 > 0; 


while (11) is a consequence of (3 his), (15) and (22). But (22) is equivalent 
. to (12) in virtue of (11) and (4). This proves the second half of the state- 
ment italicized at the beginning of this section. 
In order to prove the converse, notice first that if the distribution function 

r(t) in (12) is the function r*({) defined by 

(23) or) ent), 

the corresponding distribution function ø in (12) is precisely o*. It is well 
known that o* is a distribution function of class ©; in fact, the function 
o==o* is known to belong, in virtue.of (5m) and (8), to the function 

+ : 
pn == pn = ; 


(24) p*na(r) = f i (2m) he dy, where Sy <T° 
: k=1 : 1 be 


(Gauss, Bravais, Maxwell; also Schoenberg*’). Hence, it is seen +8 that the 
function (12) is the 1-dimensional projection of the n-dimensional radially 
symmetric cons function ¢, belonging to 


27 ' Schoenberg [15], p. 817, (top). 
129 This statement is an obvious consequence of (10) and the fact that if Try Tay Ta 
are three distribution functions such that 7,(0) — 7 (0) =a(0) = 0, then 


S 1 (@/t) dra(#) =f” ra(a/t)drs(2) 
= o 
and j 


s n (a/t) af” nemani f° op” eller ON 
o 


The first of these relations is merely an integration by parts; the second clearly is true 
if r, is a atep-function, so that the relation holda in on in view of the definition 


of Stieltjes integrals. 
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f wo 
(25) par) = f oPa(s/t)ar(t) 
in virtue of (2,). This completes the proof of the italicized statement. 


8. In the sequel, it will be nevessary to make use of the fact that if o 
is a distribution function of class Q, the function r occurring in (12) is unique. 
This is easily proved by using Fourier-Stieltjes transforms; cf. the remark in 
$ 1 concerning px and pn. Incidentally, the uniqueness of r, when combined 
with standard application of the theorem of Helly, obviously implies that (15) 
is valid without applying any selection, i.e., by placing Mn == n. 

The problem of replacing the sheaf of normal distribution functions 
o*(a/t) in the representation (12) of a distribution function of class Q by a 
sheaf of arbitrary distribution functions w(z,y) of class Q will now be con- 
sidered. Let r(t,y) denote a function which is defined for OS t< œ, 
OZ y< œ% in such a way that, for every fixed t= 0, r(t,y) is a Baire func- 
tion of y, 0 Æ y < œ; and that it is, for every fixed y= 0, a distribution 


function, i.e., a non-decreasing function satisfying the boundary conditions | 


7(0,y) =0, r(+ «©,y) = 1. Let w(z,y) denote, for a fixed y, the distribu- 
tion function of class Q corresponding to r(t, y) in virtue of (12), so that 


e) way) — f o*(z/t)dir(t, y). 


It is clear that w(z, y) is, for a fixed z, a Baire function of y (20). As above, 
-it can easily be shown that if é(¢), — œ < t < + œ, is a distribution func- 
tion satisfying (0) — 0, then the distribution function ~ 


e27) o(z) = f olz al) 


is a distribution function of class 2. On the other hand, it will be proved that 
tf o(x) is a distribution function of class Q associated with the function r(t) 
in virtue of (12), then o(x) has a representation of the form (2%) if and only 
tf there exists a distribution function E(t) such that §(0) == 0, (4+ œ) —1 
and i 


. (28) 7 am S rydt). 


Suppose first that o(z) has a representation of the form (27). Define a 
sequence of distribution functions r™(t), which tend to £(t) as m —> and 
which are of the form gn Ae à 


m(t) = 2 tina" (t/ha), (aim > 0, him > 0, À ain — 1), 
=1 
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where r* is defined in (23); so that by (27) and the definition of Stieltjes’ 
integrals, 


(29) o(z)—lim f ? a(z, 8) dr™(t) = lim S ajmo(z, hem) 
M->0O «/ 0 m> il 


= lim Zain f ot (z/t)dir(t, him) 
0 


m> 41 


lim f"ot(e/t)ael_{~(t,9)dem(y)] 


| = [bat [rt ydo). 


Hence, (28) follows from (12) in virtue of the uniqueness of the distribution 
function r in (12). And also the converse of the italicized statement follows 
from (29), since the preceding steps are obviously reversible. 

Suppose that the sheaf of distribution functions w(x, y) has the property 
that, for some fixed L >.0, the function o*(x/L) may be represented in the 
form (27); so that there exists a distribution function é,(y), such that 
£L(0) = 0, (+ 0) =1, and 


(30) o*(2/L) = Í. * w(x, t)dér(t). 


Then, by the italicized statement just proved, the function r == r*(t/L), which 
corresponds to (30) in virtue of (12), satisfies 


P (31) 7*(t/L) = f(y) dey). Les is A 


Let y = T? denote an arbitrary point in the spectrum? of é,(y), and let 

t<L,e>0. Then, by (31), (23), 
TL+e $ 

Om s*(t/L) =f r(ty) ably) 

TL-e X 

= TL — É(TL — i ty). 

2 [e(Te e) e(Te] finint ay) 

Hence, : 

lim infr (t, y) =0 if t< L. - 

TL è 


It follows that there exist a distribution function r,@), — œ < t< + œ, 
and a sequence of positive numbers T, such that Ta — TL and such that 


T(t, Tn) >ti (t), a8 n— œ; finally, 1(t) =0 if t< L. 
Thus, by the term-by-term integration theorem of Helly, 
PEE EAN EES e . 


1° A point is said to belong to the spectrum of a function if the function is not 
constant in any interval containing this point in its interior. 
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lim w(2, Tn) = fe) an), ` (Tu TD). 


Tt is similarly shown that there exist a sequence of positive numbers 7”, such 
that T'a > TL, and a distribution function 7.(/) such that 7.(t) —1,ift>L 
and r(t, Tn) — raft) as n— 0; so that 


lim w(2, T’x) — {“o(a/t)dra(t), (Ta TL). 
n> ` Q 
. It follows that.if the distribution function w(x, y) tends lo w(x, yo) as 
Yy —> Yo (in the sense of monotone functions) for every yo. 0S Yo < ©, then 
every distribution function of class Q may be represented in the form (27) 
if and only if there exists for every L > 0 at least one T = TT such that 
w(r, T) = 0*(x/L), —% < T < %. | O 
Suppose, in particular, that the sheaf of distribution functions (2, t) 
is of the form w(x, t) == w(x/t), where w(æ) is an arbitrary distribution func- 
tion of class 2; so that, by § 2, 


w(2) = fo (a/t)dro(t) 


holds for a suitable distribution function r == ra satisfying (11). A distribu- 
tion function o which may be represented by means of w in the form 


Hu o(s) = f o(/D) 220, 
0 


where é(¢) is a distribution function satisfying é(0) — 0, will be said to be 

of class Q(w). In this particular case, the preceding results are seen to be. 
‘to the effect that a distribution function (12) ts of class Q(w) tf and only if: 
œthere exists a distribution function £(!) which vanishes at t — 0 and satisfies 


(32) r(t) = f” rolt/y)ae(y) 


furthermore, every distribution function of class Q ts of class Q(w) tf and only 
if there exists a positive number T* such that w(x/T®) — (x); — œ < z 
<+ œ. (The italicized statement of § 2 is the particular case T” = 1). | 

If, in addition, uge is made of the Stieltjes-Fubini relation which is the 
second formula of footnote 18, one sees that if the distribution function o(s) 
is of class Q(w) and if the distribution function p(x) is of cluss Q(c), then 
p(z) is of class Q (w). 


4. As an application of these statements, consider the symmetric stable 
distribution functions; that is, the distribution functions whose Fourier- 
Stieltjes transforms are exp(— | u |7), 0 < y 2 (the distribution function, 
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whose Fourier-Stieltjes is identically 1, also is symmetric and stable, but will 
be excluded as trivial). It is known * that these distribution functions are 
of class Q. Let oy(x) be the distribution function whose Fourier-Stieltjes 
transform is exp(— | u |Y), 0 < y& 2. Thus, there exists a distribution func- 
tion r(t) such that 77(0) == 0 and 


(33) oy(2) = [ot (2/t)drv(t), where of — os, 


if o* denotes the same distribution function as in (13), except that the unit 
of x is different, 


, ate 
(13 bis) o2(&) = (Rr) 74 f eh dy. 


The relation (33) implies that 


(34) exp(— |u|") = f° exp(—| ut |*)dr¥(t), —ao<cuc+o. 


This merely states that the Fourier-Stieltjes transform of the function on the 
left of (83) is the same as the Fourier-Stieltjes transform of the function on 
the right. 

On replacing | u | by | u |67, 0 < BS y£ 2, and changing the integra- 
tion variable from ¢ to (4/7, one can write (34) in the form 


(85) exp(—| |?) — [ep | ut |B) dry (tP), — © Lu < +, 
or 


(36) osla) = f° oa (a/) (t), O<BSyS2 


Since B, y are arbitrary (0 < 8 S y 2), this relation is equivalent to the 
first half of the statement: oa (z) is of class Q(og) if and only if a = B. 

To prove the second half of this statement, suppose that ca is of class 
Q(og), O0< B<eS2;3 so that there exists a distribution function rag(t) 
which vanishes for t == 0 and satisfies 


e 
3 Wintner [21]. This result was rediscovered by Schoenberg [16], pp. 532-533 
(cf. Blumenthal [1]), who used methods equivalent (cf. Haviland [8], II, p. 382) to 
those applied loc. cit. [20], where the proof, in fact, was based on the multidimensional 
analogue of Lévy’s continuity theorem (cf. Haviland [8], II). Incidentally, cf. Wiener 
and Wintner [19], pp. 241-242. | | 
It may be mentioned in this connegtion that Thearem 3 of Schoenberg [16] is merely 
a corollary of the classical representation of the infinitely divisible laws which is due 
"P. Lévy (who, in fact, does not assume the symmetry of the distributions). 
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(37) ca(t) = f ” op(2/t) drag (t). 
0 
Then, by the same reasoning which deduced (36) from (33), 
(38) o2(z) = 0% (£) = Jf, casa (8/1) arent). 
i $ 0 


This would imply that there exists a positive number T = T'(csg/a) such that 
Gsp/a(0/T)=00(z), or exp(—| uT /4)==exp(—|u|*), — o <u < + o. 
This contradiction establishes the theorem. 

The theorem just proved and the last italicized theorem of g 3 imply that 
if a < B, the class Q (ca) is a proper subset of class O (og). 


5. The standard methods described at the beginning of the Introduction 
represent asymptotic approaches to the .normal distributions. Another ap- 
proach to these distributions is connected with the well-known fact that the 
stable distribution oy has a finite standard deviation only in the normal case 
y= 2. In what follows, there will be considered still another approach to 
the normal distributions of radial symmetry. This approach might be of 
interest in view of Maxwell’s deduction of his distribution law of velocities. 

Let ¢, be an arbitrary radially symmetric distribution function on the 
Euclidean space Ry. Let x be the k-dimensional projection of n, finally 
pu, px the corresponding functions (2a), (2x). Since the function (9) is 
absolutely continuous, it follows from (10) that if k < n, then pn is absolutely 
continuous on the open half-line (0,-+ œ); so that. 


(39) p(z) —pe(-+0) +" (don(r) /dr)dr, 2 > 0; (keds nd). 


æence, the k-dimensional distribution function dx, where k= 1,- >- ,n— 1 
may be decomposed into a linear combination of two k-dimensional distribution 
functions pl, ox! of radial symmetry, 

(40) pe = Ade! + (1—d) dv", sasl; (k = 1, -+,n—1), 
where px (Es?) — 1 if Fr’ denotes the Borel set consisting of the single point 


which is the origin of Rr, and ¢x” is absolutely continuous; so that there exists 
a non-negative function : 


ds = 8: (21, ° À. *, 2%) = & (| a+... + mi |) 
of the position (x,,: : >, T+) in Ry for which 
(41) ge! (Ex) — (lat tse ae? Bde des 
.It follows from (2+), (39) and (40) that À = px(+ 0) = pa(+0) and 


‘ 
4 
K 
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(42) Eo paz ren FL, (ior almost'all #50), 
(2r)#/T(3k) being the Euclidean measure of the boundary of the k-sphere 
of radius 1. 

Define, for 0 < r <, œ, a Sa T function, ver), k==1,---,n, 
by placing . 


(43) ae v(t) =] -f7 ATES r> 0. 
Thus, for k= 1,- - - n, 

7 (Rar) i -1 - : 
(44) a(t) = 1—0 eG atdy(z), © r>0; 
also, for k = 1, + ,n— 1, fin 
(45) dk (1) = dvx(r)/dr, (for almost all r > 0) 


The relation (45) is meaningless for k <n, unless the arbitrary function #, 
has a decomposition similar to (40), implying the existence of a 84. 

It will be shown that tf ọn is an n-dimensional radially symmetric dis- 
iribution function such that for some fixed k (0- < k <n), and for some pair 
of positive constants c, C, one has 


(46) va(or) == Clynx(r), 17>, 


then $, ts a radially symmetric normal distribution except for a possible jump 
at the origin. This means that ¢, may be written in terms of a non-negative 
constant À = 1 in the form 


(47) Pn = Ada’ + (1—A)gn, OSASI, i 
where s! (E°) — 1, and $” is a radially symmetric n-dimensional normal 
distribution. : 


In order to prove this theorem, note that, in view of (43) and (46), the 
absolute continuity of pax for r > 0 implies the absolute continuity of vex, vn, 
px for r > 0. .Thus, under these conditions, equations similar to (40), (41), 
and (45) hold for k =n. Since ¢y-+” is the projection of ¢,”, it is clear that 
(48) Bat) af e (PHH Ha l)du > - da; 
so that E 
(49) lr = f ae setae: saa? [Ade 


if 6(r) denotes the common value of 8,(r) and 08, (r/c) ; cf. (45) iia 
(46). Since ô is a density, repeated untegralion: of 69 shows that 


eee AREAS ani dm den < oo, (820) 
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for every positive integer m. It follows, therefore, from (48) and (49) that, 
up to a factor of proportionality depending on m, the functions C*8(r), cè (er) 
are the densities of an m-dimensional radially symmetric distribution function 
and its (m— k)-dimensional projection, respectively (unless 8(r) — 0 for 
every * > 0; but tbis trivial case may be discarded, for in this case the state- 
ment (49) is satisfied by A = 1). Thus, one can introduce a 1-dimensional 
distribution function by placing 


(50) ele) = f" adra f aly lay, —o<z<+ o. 


Clearly, this o(æ) is, for m = 1,2, - -, the projection of an (mk+1)- 
dimensional radially symmetric distribution whose density is proportional to 
8(r). Consequently, by (12), 


(51) Jaj {- o*(a/l)dr(t), ~—ocecta, 


where 7(/) is a distribution function satisfying (11). Since (50), (51) and 
(13) imply that 


Ge) 8 D/L Day (ei ft exp(— 4/0) dr(4), 
it follows from (49) that . 
Bel |)/ f 8C y Ddy = (en)a f? as exp(— 4r) dr (2). 


On integrating this relation between r == — œ and 1 = %, one sees from (50) 
and (13) that 


o(cx) = O(a) # {© oF (2/t) de(t). 


It follows, therefore, by comparison with (61) that 


(53) r(cy) = CC) f” Här(t), OSy< oe. 
0 
But it is clear that (53) cannot hold unless c == 1; in which case 
v(t) = *((2e) #02), 
where r* is defined as in (23). Hence, from (62), 
8(1rD/ f aly Ddy — LC exp(— 0/4). 
This completes the proof of ane last icad statement. 


6. It is clear from the ped and Ko the Helly theory of monotone 
functions that the theorem just proved may be generalized as follows: Let 
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o(z) be the 1-dimensional projection of an n-dimensional radially symmetric 
distribution function w, n—1,2,: : -. Let va(r) be the function (43) 
associated with pr, and: suppose that there exists a non-decreasing function 
v(r) which ts the limit of the sequence of functions 


{Dn(r) — va (+ 0) ]/[1 — (+ 0)7} 
in the sense of the theory of monotone functions. Then there exist two con- 
stants c, C (0 < C,0ScS 2C/m) such that 


D Se Í " exp(— Cr°) de. 


Tt is also clear from the above proof that an absolutely integrable solution 
8(r) of the equation (49), i.e., of the Abel integral equation 


c3(cr) = CE Í Va) (1° — reji t-dg da, 


exists only if c == 1; in which case it is proportional to exp (— 0°r?/4r). 

7. The italicized statement of $ 5 implies a characterization of the 
n-dimensional distribution functions which are product distributions with 
respect to every coördinate system. | 

In § 1, the k-dimensional projection of an n-dimensional radially sym- 
metric distribution function was defined by considering a k-dimensional hyper- 
plane Rs through the origin of the n-dimensional Euclidean space Rn. Because 
of the radial symmetry, the projection was independent of the choice of the 
hyperplane Rx. It is clear that if one considers an arbitrary n-dimensional 
distribution function #,(Æ,) (not necessarily of radial symmetry), one can 
obtain a sheaf of k-dimensional projections yx(Ex; Rx), where the argument 
of the distribution function y» is a k-dimensional Borel set Hy on the hyper. 
plane Rx on which #,(Æ,) is projected. An n-dimensional distribution func- 
tion yw(En) is said to be a product distribution *4 if there exists a positive 
integer k <n, a k-dimensional hyperplane Ay and its (n— k)-dimensional 
normal hyperplane Rn- such that if En == Ex X En x is an n-dimensional Borel 
set whose projections on Rr, Rua are Ex, Bnw, respectively, then 


(54) Un( Bx X Enx) = Yu ( Be; Be) bua (Enz; Rae). 
e 
The Fourier-Stieltjes criterion for #, to be a product distribution is that there 


# This is slightly more general than the usual concept of a product distribution, in 
which (54) is replaced by 
; Ya (EL X ... X En) =, (83; R) HE y, (En; Ra), 
where Riyas Ra, are n mutually Perpendicular lines. A distribution which is a 
product distribution in this sense is clearly & product distribution in the sense of (54), 
but not conversely. ; l 


y 


774 PHILIP HARTMAN AND AUREL WINTNER. : 


exists at least one rectangular coördinate system in R, with reference to which 
the Fourier-Stieltjes transform Au, : -,u,) of ẹya can be written as the 
product 


(55) A (Us te, * ` " Un) = Au, ,uxr, 0,° -+,0)A(0,-- "3 0, ter, * ` Un). 


(In this coérdinate system, the hyperplane Ry in (54) is defined by the 
equations Ten = 0,° > +, ty = 0). : 

It will now be proved *? that tf ya (En), where n = 2, is an n-dimensional 
distribution function, then, for a fixed k, (54) holds for every pair of or- 
thogonal hyperplanes Rr, Rax if and only if either Ya (En?) — ] or there exist 
constants a (> 0), bı, + +; bn such that 


(56)  ya(Es) — a ep[— a à (&— by) dm : - dap. 


` It is understood that H,° denotes the Borel set consisting of one PEN in En 
(not necessarily the origin). 

The first half of the theorem is trivial. In order to prove its second hate 
let P — (a, - +, un) be a point in the space of the Fourier-Stieltjes transform 
A(t," © *, un). Then the assumptions of the theorem imply that. 

(57) : A(P) = A(PE)A(P**), 

where P*, P*-* are the projections of P on an arbitrary pair of orthogonal 
k- and (n — k)-dimensional hyperplanes through the origin of (th, * >, Um)- 
space, respectively. 

Suppose first that the distribution function Yn is symmetric with respect 
to the, origin of Ra, i. e, A(t,’ © +‘, Un) = A (— th,’ ` -,—u,). It will be 
shown that yẹ» is then of radial ea In fact, let P, Q be two distinct 
- epoints on any sphere with its center at the origin O of the (w:,: : -, un)-space. 

‘Consider the plane POQ, the pair of lines which bisect the angles formed by 
the lines OP and OQ, and a pair of orthogonal hyperplanes containing these 
` lines and having the dimension numbers k and n— k, respectively. Then 


A(P) —A(P*)A(P**) and A(Q) = A(Q*)A(Q**), 


where PE, Pat, (Qt, Q*~* are the projections of P and Q on these hyperplanes, 
respectively. It is clear that if the points P*, Qt do not coincide, then they 


22 This problem was considered by Maria-Pia Geppert, “ Una proprieta charatteris- 
tica della distribuzione de Bravais,” Giornale dell’ Istituto Italiano degli Attuari, 
vol. 7 (1936), pp. 378-391. Her considerations were recently rediscovered by M. Kac 
[10]. Actually, the final result of Kac is incorrect, since his conclusion is that either 

_Ÿ, has a jump of 1 at the origi’ or (56) mud hold with b =6b,---=b,=0. In 
the case of polar symmetry, Kac used Cauchy’s functional equation, which will now he 
avoided by applying the theorem of § 5. 
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aré situated symmetrically with respect to the origin O; the same holds for 
the pair P», Q+, In virtue of the polar symmetry of y», it follows that 
A(P) = A(Q), which establishes the radial symmetry of y. 

Since yr(Er; Rx), Vu &(Enx; Rnx) are projections of the radially sym- 
metric distribution yn, they are absolutely continuous on Rs, Ra», respectively, 
if the origin is removed. It follows, therefore, from (54) that yẹ» is absolutely 
continuous on À, with the origin removed, and that the density 8,(2:,° ``, 2n) 
of Wn then is the product of the densities 8,(21,: °°, Ze), Sx-u(Zma,’ °°, Zn) 
of yr and yx. Consequently, the radial symmetry of y» implies that there 
exist a function (€), — œ <T < + œ, and two positive constants c, ce 
such that 


S(Iat+- at E EE B)= cide (T ta) 


and ‘ l 
B(| Trn + H ant |È) = Cône (Tess * + Zn). 


Thus, it is easy to see that the assumptions of the theorem of $ 5 are 
satisfied; so that y» is a radially symmetric normal distribution except for a 
possible jump at the origin. However, the product condition implies that the 
jump at the origin is either 0 or 1. This concludes the proof of the last 
italicized statement in case #, is symmetric with respect to the origin of Ry. 

In order to complete the proof, consider the n-dimensional distribution 
function ¢,(#,) whose Fourier-Stieltjes transform is _ 


Au,” ++, Un). A(— us: ty — Un). 


Then ġn(En) is symmetric with respect to the origin and satisfies the condi- 
tions of the theorem. Hence, #,(Æ,) either is a radially symmetric normal : 
distribution or #, has a jump of 1 at the origin. Thus,* | o 


A(t, © ; tn) A(— 0, : : y — Un) — exp[— af (1n? + - +b tts?) ], 
f 0<a< oa. 


It follows that A does not vanish for any (u, **, Un); so that 


(58) Aus, ---, tn) = exp[— at (m? -H> - =- Un?) + g(t,- -p tm) J 


holds for a suitable continuous function g(u,'*-,14) which satisfies the 
condition ee. 
gs p Un) = — g(— Us; —Un). 


2° The balance of the proof could be based (cf. Kac [10]) on an application of a 
theorem formulated as a conjecture by®Lévy and subkequently proved by Cramér [6]. 
But this rather deep theorem, for which only a complex function- theoretical proof. is 
available today, may be avoided in this case. 


776 PHILIP HARTMAN AND AUREL WINTNER. 


It follows from (57) and (58), by choosing 
P= (us: A ~, un), P: = (ta; 0, + d 00e Prk wm (0, Ua, ° d Un), 
that g(t, - °, Un) = g (u0, © -,0) + g(0, ua: - “sinj le, 
(59) JLU > `, Un) = gi (t1) H: > + gaun), Where 
giltu) = 9(0,- < +, 0, u, 0," + -,0) 
is a continuous odd function of ui. 
On applying (57), (58) and (59) to P = (w? + v°,0,: + -,0), 
PE == (uf, ur, 0,---,0); Pr (v?,— w, 0,: -,0), 
-one obtains oo 
| gi (uw? + 0°) = gi (w) + gr), 
if use is made of the fact that gs is odd and gi(0) 0. This implies that 
there exists a constant c, such that g,(u?) == c,u?; so that, since g, is odd, 
fu) = cu. Similarly, ga (u) = cju for j == 2,---+,n. Hence, (58) reduces, 
in view of (59), to | 


n 
A(t, es a Un) = epea (au; — cy) ]. 
by =1 


But since A ig a Fourier-Stieltjes transform of a distribution function, 
| A | <1, so that the constants cy are purely imaginary, i. e., cj == iby and 


(60) Athy: ++, tm) = exp[—3 (atuj*—ibyus)], OS a< o. 
R st 


Since (60) is known to be the Fourier-Stieltjes transform of an n-dimensional 
distribution of the ‘particular type mentioned in the theorem, the proof is 
complete. 


8. For a positive number p which need not be an integer, let Sq? be the 
solid. characterized by the inequality 


(61) SP: Sa (PS 
gal 


in the Euclidean space Rn: (2,° © °, 2n). It will be shown that tf àn? (£), _ 
— œ < r <+ mo, denotes the one-dimensional distribution function which 
onè obtains by projecting on a codrdinate axis of R, the n-dimensional egui- 
distribution on S,r, the density of probability of An? is 


const. (1 — | 2 |?) -9/2 if | s| < 1, where 
‘spr (1 -+n/p) 
d donnons PEER 2 
AE ie e ° r(t) (1+1) 
. P P i 
0,if|z|>1, 
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In order to prove (62), let n and p be fixed, and let s denote a continuous 
parameter which varies between 0 and 1. A straightforward homogeneity 
consideration shows that the n-dimensional volume of that infinitesimal portion : 
of the solid (61) which'lies between the two hyperplanes 2, = s, t, = s + ds 
is proportional to (1—s)ds, where q = (n—1)/p. Since the whole solid 
(61) is contained between the two hyperplanes 2, — + 1, and is symmetric 
with respect to the hyperplane s, = 0, it follows that (62) holds for some 
const. > 0. Finally, the value of this constant is obvious from 


1 R es 23 (n—1 1\ > 
1—|z oder f 1 — y) YD Ay/r-1g. -2s( 1,2), 
IX | 2 | Fig ne ee 


and from the fact that the total probability represented by àn? (z), — co < x 
< + œ, is unity. This completes the proof of (62). 

A corollary of (62) is that if Sy?(1) denotes, for a fixed r > 0, the solid 
which one obtains by writing 1? instead of.1 on the right of the inequality 
(61), the projection on a codrdinate asis of the equidistribution on Sq? (n/P) 
tends, as n—> œ, to the distribution function which has a density proportional 
to exp(— | z |P/p) for — œ < x < + ©. 

In fact, if p is fixed and n — œ, then 


(nf) 

















(Stirling) ; while 
: £ p (n-1)/p | f | g |» 
lim (1— ne ) — exp (— Lt) for — œ << +0. 
Hence, from (62), 
; a ` . 
E z \ _exp(—|2|?/p), | 
(65) Ing (=) apara p TOSSE 


But it is clear for reasons of homogeneity that, if r > 0, the distribution func- ' 
tion 1 Ay? (z/r), — co < x < + œ, belongs to SP(r) in the same way as 
An? (z) belongs to 8,9 Sa” (1). Hence, (63) is equivalent to the last 
italicized statement. i 


Remark. If Læ (u), — œ < u< + œ, denotes the Fourier transform 
of àP (z), — œ < s < + œ, then, according to (62), 


os cP fuerte 
P aP ' 


x 
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It is seen from the integral definition of the Bessel functions J se) that 
(64) reduces for p == 2 to 


(65) La (u) = Ty(u)/T (0), 


if J*,(w) denotes Jy(| u |)/| u|”. On the other hand, if L,*(u) denotes the 
Fourier transform of the distribution function A,*(z), — œ < æ< + œ, 


which belongs to the equidistribution on the boundary S. pik 1 of 


Bat: 3 z; = 1 in the same way as A,”(x) belongs to Bn? itself, then, as men- 
ja 
tioned in the Introduction, it is well known. that 


(66) | LE (u) = J* in- (4) /J* in: (0). 


Since comparison of (65) and (66) shows that Lm, (u) = La?’ (u), it follows 
that | CU 
(67) ua (7) — Àn’ (2), —o <T<+o; (n == 1,2, - +). 


In other words, the distribution which is the projection on a diameter of the 
equidistribution on the interior of the n-dimensional unit sphere is identical 
with the distribution which is the projection on a diameter of the equidistribu- 
tion on the boundary of the (n + 2)-dimensional unit sphere. (Needless to 
say, this fact may be verified also by calculating the volumes of the spherical 
- segments involved.) Actually, the explicit relation (67) may be interpreted 
as an essential refinement of a known phenomenon in functional analysis; * 
that is, of the fact that, as n—> œ an overwhelming portion of the sphere 
T? +: +H a? < 1 concentrates on its boundary m? p> + ++ 2,7 = 1, 
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ON UPPER LIMIT RELATIONS FOR NUMBER THEORETICAL 
FUNCTIONS.* 


By PHILIP HARTMAN and RICHARD KFRSHNER 


There are, in the literature, several results on the limit superior of number 
theoretical (i.e., additive or multiplicative) functions, giving results of the 
‘following mae 


(1) lim sup f(&)g(x) = 1, 


where f(x) is a number theoretical function and g(z) is elementary. All these 
results have in common the fact that the functions f(x) and g(a) considered ` 
are of such a nature that 


(2) lim f (tn) g (ta) == 1, 
n>% 
. where 
Ta = Pip2' * * Pn 


is the ere of the first n primes. 


The purpose of this note is to delimit a simple class of functions for enti 
results of this nature can be obtained. This possibility was suggested to us by 
Professor Wintner. The greater portion of the paper will deal with additive 
functions; although multiplicative functions may, of course, be treated by 
applying these results to their logarithms, this consideration leaves something 
to be desired, since from 


j log f(x) S (1 + 8)/g (7), z > X(8), 
one can | infer only 


fe) S exp [(1 + 8)/9(2)], z> X(è),. 


and not 
f(æ) < (1 + 8) exp (1/g(x)), z >X (8), 


which would be needed to prove a corresponding limit relation. Corre- 
spondingly, the direct treatment of the multiplicative case seems to be more 
difficult than that of the additive case, and we were unable to establish for 
- multiplicative functions a result of generality comparable to that obtained for 
additive functions. Thus we have confined the consideration of multiplicative ` 
. O 
* Received March 14, 1940. 
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functions to one very simple case; which, however, does imply the known limit - 
result for the Euler ¢-function. 

The treatment will be based on a very simple lemma stating the Tauberian 
conditions needed in order to infer (1) from (2). 


Lemma. Let f(z), V<a<-+ œ, be a real-valued function of the 
integer z, and let {rz} be a sequence of integers, with the following properties: 


(i) 0 < Tr < Tea; (hast oe <<), 
(ii) 1% ©, as k> ©, | 
(iii) f(t.) /f (te) — 1, as k> o, 


(iv) fon every 8 > 0 there exists an N = Ns such that 
(3) - iS (A+ 8)f (ra) whenever Sry and n>N. 
Let g(x) be a non-increasing function such that 


(2 bis) Fra) g (in) — 1, as n> o. 
Then 
lim sup f(z) g(a) =1. 
@->00 


In order to prove this lemma, notice that the conditions (iii) and (2 bis) 
imply that | 
(iii bis) ` g(rns)/g(rn) — 1 as n> o. 


If x and n are integers such that the relations r,1 < T'S 7, and (3) hold, 
then, in virtue of the monotony of the function g, 


F(z)g(t) S (À + 8) EF (ra) 9 (rx) Lg (ra) /9 (1) J. à 


Hence, it follows from (2 bis) and (iii bis) that limsupf(x)g(x) is not 
greater than 1. On the other hand, (2 bis) alone implies that it is not less 
than 1. This completes the proof of the lemma. 

Before proceeding to the general class of additive functions mentioned 
above, to which this lemma is applicable, two special cases which become im- 
mediately obvious when thought of in connection wéth this lemma will be 
mentioned. These are the cases of strongly additive and strongly multiplicative 
functions. ‘An additive (multiplicative) function is called strongly additive 
(strongly multiplicative) if f (p”) = f (p) for all v = 1,2,: : >. (Throughout 
the paper p will denote a prime and p, the n-th prime.) 


THEOREM I. Letf(x),zx = Qo, be aivongly multiplicative, so that 
I (pe) =F (pe), (pipe) =P (pi) f(x), GAR). Let f (Pra) 2 f(x) > 1 as 


782 PHILIP HARTMAN AND RICHARD KERSHNER 


k—> œ. Then the conditions (i)-(iv) of the Lemma are satisfied by the 
SEQUENCE Ta = PıP2' °° Dn 

The proof is obvious, in fact (3) is satisfied for N == 1 and 8—0, It 
should be mentioned that the requirement of monotony, f(71) = f(x), 
cannot be dispensed with. This can be seen by the example 


fpe) =14+1/n, 
f (pe) = 1 if k542* for any n. 
In spite of the simplicity of Theorem I, the known case? of the function 
- f(z) =z/p(x), where (x) is the Euler ¢-function, may be treated as a. 
particular case of this theorem. In fact, «/¢(x) is strongly multiplicative and 
. _ P/$(p) = (1 —1/p)*, 
so the conditions of Theorem I are satisfied. Consequently, the relation 
Pips ates Pa , 1 

P(Pipz' ** Pa) eC log log (p:P2° ` * Pa) 


(where C is the Euler constant), which is a consequence of Merten’s asymptotic 
formula 


— 1, a n> o, 


U (1—1/p)7 ~ ec log x 
re a 
and Chebyshev’s inequalities, implies by the Lemma, that 


li Ream) oo oe ia 
ton p(s) eloglogz z 
The corresponding theorem for the additive case is the following: 


THEOREM II. Let f(x), 1,2, - :, be strongly additive, so that 
pe) = f (Pa), f (pipe) = f (pi) +F (pe), (FAK). Let f(x) =f (p) > 0. 
Then the conditions (i)-(iv) of the Lemma are satisfied' by the sequence 
Tn == Pip2` ` ` Pa k 


The proof is again obvious. Notice, in connection with our earlier remarks 
on the comparative difficulties of the two cases that the requirements of this 
Theorem II are much® weaker than those of the corresponding Theorem I. 
It might also be mentioned, in this same connection, that in this case the 
requirement of monotony can be considerably modified. 

As an applicàtion of Theorem II, consider the strongly additive function 

f(n) = fa(n) defined by 


ZE. Landau, Handbuch der Lehre von der Verteilung der Primeahlen, Leipzig 
(1909), pp. 219-222. 
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fa(P) =— log (1—1/p*), fa Pars ° pure) =È fa (Pa). 
The conditions of Theorem IT are christ satisfied. Also, one has 
falra) =3 ta) ~X Mpe, (0<a<1); 
but, in virtue of the prime number Lie | 
tri (1/08 log n — 242 (1-— a) logs, (0<a<l), 
80 that A bis) is satisfied by the function | 


g(n) = (1— a) log log n/ (log n). 
Hence, by Theorem II. and the Lemma, © 
lim sup fa (z) log log z/ (log z)**—_1/(1— a), (0<a<1). 
æ->00 \ 


The strongly additive function f,(n) in this relation may be replaced by the 
additive function log [oa(n)/n*], where og(n) is the classical (multiplicative) 
function defined, as the sum of the a-th power of the divisors of n. For 


a(p) /pi mm 1+ 1/p5 + 1/p +> + + 1, 
which implies, for 0 < a < 1, that | 
log [oa(p*)/P*] < fa(p*) = fa(p) and log [oa(rs)/ra6] ~ fa(rs). 
Consequently, the result obtained for fa(n) may be transcribed as 
iim tenp log [oa (x) /x*] log log z/ (log x)1-4# = 1/(1 — à), (0<aæ<1), 


which was first proved by Gronwall * (using a refinement of the prime numbere 
theorem). 

As another example of the use of Theorem II, consider the function 
f(x) = p(x) defined as the number of distinct prime divisors of z. It is easily 
verified that this function is strongly additive. Since p(ps) = 1, the condi- 


3 This is a consequence of the standard procedure of base 
Z1/pa = È 1/pa = 2 ts — 9 (n — 1) ]/na log n, wiftre 0(@) == log p, 
rao pax re 


applying the Abel ue formula to the last sum, and using the prime number 
theorem in the form (l—e)n < @(n) < (1 + e)n, if n > X. (Cf, e. g., loo. oit. 1; p. 25.) 

ST. H. Gronwall, “Some asymptotic expressions in the theory of numbers,” Trans- 
aotions of the American Mathematical Sooiety, vol. 14 (1913), pp. 113-122. Gronwall 
also considers the functions o (n}/n«, for a=1. However, these cases are simpler 
than the ones treated above; in “fact, they are easily handled in the multiplicative form, 
i.e, without resorting to logarithms. On the other hand, the upper limit is not 
approached on the sequence f, =p, P,- - Py, 28 is the situation above. 
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tions of Theorem II are satisfied by this function f(x) —p(«). Consequently, 
the relation ‘ 
log log (P1P2° ` * Pa) 


log (pipe ` * Pa) TE Ge 


P(P:P2° * * Pn) 
i. e., the relation 


(4) . n log log (PPs: ` ' Pa) _. 


no), 
log (P:1P2° ` ` Pa) ( ) 


` which is an easy consequence of the elementary inequalities of Chebyshev, 
implies, by the Lemma, that 

; a log Tes 
(5) lim sup p(z) long 





. We now proceed to the main result. 


Tuorem III. Let f(x), z= 1,2,:--, be an additive function such 
that, for some e OSe < 1, i 


(6). fx) Sr, ER AEMET 
and m3 i i 
(7) ae LORS 'k> ©. 


Then the conditions of the Lemma are e satisfied with Ta = Pips’ ` * Pa and 
g(x) = log log z/log x. no k 


Proof. The conditions (i)-(iii) are obviously satisfied. In order to prove 
(iv), let 5 > 0 be fixed and let 


a | a Pa pu” Pr < Tn = Papa ` Pre 
‘Now (7) implies that 
flra) _ f(pi) t:i l) 








>], n= ©; 
n n 
Bo that, for any m > °, and for sufficiently large n, g 
(9) | Am 2 ft) = 2 cya 


On the other hand, by (6), 
k 
` f(z) = F vm°, 
m1 
so that . ° 
k 
F(z) £ 3 (vm* 108° Pim) (1087 pan). 
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It follows from the inequality of Hélder that 


j k. k 
(10) F(E) S (Bm log pau)" (È log 0A pin), 
« MAI m=1 
Now, by (8), | | 
i » tk. | n 
(11) a Ym log Pin = 3 log Pm- 


On the other hand, by the inequality of CHR for any 72 > 0 and for 
sufficiently large n, 


(12) 3 log pm S (1 + m)n log n 
mal 

Also, for any ys > 0 and for sufficiently large n, 

'k n 
(18) Slog /C) pin S X log (m +1) S (1+ m)nlog tn, 

m=1 mal 
If (11), (12), and (18) are substituted in (10), one has 

© f(t) S (1+ ma) (1 + y) (n log 1) (nv log 0n) = 


or 


(14) fe) S (1 + m)°(1 + mn 
for n sufficiently large. Combining (9) and (14) gives 


f(t). Ss Am) + ne) (1 + ms) f(r), 


. where m > 0, 72 > 0, ņa > 0 may be chosen arbitrarily small if n is sufficiently 
large. Thus, for any 5 > 0, there is an Na such that 


(15) f(z) S (1 +8)f (ra) if n> Na. 


This shows that the condition: (iv) of the Lemma is satisfied in the present case. 
‘The fact that the function g(x) in the Lemma may be chosen to be ` 


g(x) = log log z/log x 
follows from:(4), in virtue of (9). This completes the proof of Theorem III. 


It should be mentioned that, in view of (7), the requirement (6) of 
Theorem ITI may be replaced by the condition 


(6 bis) ' F(p) S vf (Pr), (v, k =1,232,: ‘), 
and, in fact, in view. of the asymptotic character of the result, (6) or (6 bis) 
need only be required for sufficiently large k. The same is not true, however, 
with regard to v'and it is quite egsy to constryct an example where (6) fails 
only for v == 2 but where the result (15) n no longer holds if z is chosen of me 
form g = p?p? + - fn’. : 
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Tt seems that the requirement (6) or (6 bis) is somewhere near the best 
estimate of its kind which can imply (15). In fact, it is easily seen that if 

fpe) > v/ (log v) f (pe) for some «> 0, | 

then (15) fails if v is chosen to be a power of 2. | | 

An example which satisfies the conditions of Theorem IL is the function 
f(x) =log d(x) /log 2, where d(x) =s (x) is the number of distinct divisors 
of z. In this case f(z) is additive and 

f( pe”) = log (vy + 1) flog 2, (r, k =1, 2, re). 
Thus, the result, | 
lim sup log d(z) log log z/log x = log 2, 
12-00 

due to Wigert,* follows from Theorem ITI and the Lemma. 


QUEENS COLLEGE, 
UNIVERSITY OF WISCONSIN. 


“Cf. loc. cit. +, pp. 219-222. > - 


ON THE PROPERTIES OF A COLLECTIVE + 


By Z. W. BreNBAUM and HerBerr S. ZUOKERMAN. 


1. R. v. Mises? gives the following definition of the simplest collective 
which he also calls an alternative: A simple collective is an infinite sequence 
of observations, the result of each of which may be represented by one of two 
symbols, say 0 or-1, which satisfies 


Postulate 1. If nọ and m are the number of observations, among the 
first n, for which the results are 0 and 1 respectively, then the limits of the 
relative frequencies, lim no/n == w, and lim n/n = w, shall exist; and 

now #00 


Postulate 2. If an infinite subsequence of the total sequence is formed 
by a “selection” then, for this subsequence, the same limits exist and their 
values remain unchanged, Tim im no/n = Wo limni/n = wi 


The numbers w, and w, are us probabilities of the appearance of the labels 
0 and 1 in the collective. 

These postulates have become the object of considerable discussion. Most - 
of these discussions have centred around the second postulate and a number 
of investigations have been made in attempts to prove the consistency of the 
concept of a collective, in connection with the difficulties encountered i in inter- 
preting this postulate.” 

It is the aim of the present paper to prove that a sequence which fulfills 
the first postulate, fulfills also, generally speaking, the second postulate. “Tho 
precise formulation of this statement is given in the following 


THEOREM A. The set of all infinite selections can be interpreted as a 
space © in which a Lebesgue measure is | defined, so that tf a sequence of 0's 


* Received February 21, 1940. 

1 Presented to the American Mathematical Society, Febryary 24, 1940. 

3R. v. Mises, Wahrecheinlichkeitsrechnung und thre Anwendungen.in der Statistik 
und theoretischen Physik, Leipzig u. Wien 1931, p. 14. 

3 Certain special cases of our Theorem A are included in some of these investi- 
gations e.g. in A. H. Copeland, “Point set theory applied to the random selection of 
the digits of an admissible number,” American Journal of Mathematics, vol. 58 (1938), 
pp. 181-192. A special case is also formulated by, H. Steinhaus, “Les probabilités 
dénombrables. et leur rapport à la théorie de la mesure,” Fundamentu MMathematicae, 
vol. 4 (1923), pp. 286-310, especially p. 306. 
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and 1s fulfills the first postulate of v. Mises, the second postulate ts ful- 
filled for the subsequence determined by every selection with the exception 
of a set of measure zero in ©. : 


Theorem A follows from a more general theorem which will be- formu- 
lated in the next paragraph, | 


2. Let K be an infinite sequence (a;,@2,:--) of Os and Ps. The 
number of 1’s among the first n elements of that sequence is Š a. To each 
à 4=1 $ 


sequence K we ascribe the real number k = a,/2 + 0/2? p: > 

Let S be a selection which, if applied to a sequence K, preserves only the 
4,-st, t-nd,- + > terms. The result of an application of. S to K is, therefore, 
the sequence ai, Qi’ * * which we shall denote by K C S, in accordance with 
a-notation introduced by Copeland# ` 

A selection 3 is completely described by a sequence -(b,, bs, : : ) where 
by, =— b =: © + == 1, and b; = 0 for all other values of 7. We obviously have. 


4=1 


We shall consider only selections § which preserve infinitely many terms of a 
‘sequence to which they are applied, i.e. selections 9 with b; = 1 for infinitely 
many values of 4. A one-to-one correspondence between the aet © of all such 
` selections $ and all real numbers s of the intervall <0, 1> can be established by 
ascribing to the selection S ==(5;, ba, : +) the number s = b,/2 + 2/2? +... 
We introduce a measure in © by calling a set 3 in © measurable if and only 
if the set ø of corresponding numbers in <0, 1> is measurable in the sensé of 
eLebesgue, and by defining 


measure of 3 = m(32) — measure of o= mo). 


The relative frequencies of the 1’s in K are 


(2) | h(E) FE Sa 
while those in K C cA are given by 


Treorem B. If F(K j- ts the set of points a condensation of the sequence 
fi(K), fe(K),: ++, then F(K) =F (KC 8) almost everywhere in ©, ie. 
for all 8 except those of a set of measure zero. 


“loo. cit. $. 
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Proof of Theorem B. We denote by ri(t), i= 1, 2,: - -, the well known 
Rademacher" functions which are defined for 0S¢S1 as follows: if 
t=—=1,/2 + t:/2° +: > > is the infinite dyadic expansion of t, then r(t) = 1 
if t = 1, and r(t) = — 1 if 5 —0. We evidently have 
(4) tı = $ (r4 (t) +1). 


Using as arguments for those functions the numbers k and s which correspond 
to K and 8, we find, from (2) and (4), 


(5) f(E) (+R È RG), 
and, from (1), (3), and (4), 





LZ ÉnGon( +2 noti Sawa] 


(6) fa(K C8) = 
1 + 

= ï m(s) +1 

| mn i 

The functions r.(s),r:(8),: > : are a normed orthogonal system. It is known ° 

that for such a system the relation 


(7) lmg 2 Sri(s) — 0 

‘holds for almost all values of s in <0,1>. Similarly, the functions p,(s) 
== 9, (k)r, (s); po(S) = 12(k)re(s),- > +, form a normed orthogonal system, 
and therefore we again have 


ae ee ; 
(8). | im y 2 ri (k)ri(s) — 0, 


for.almost all a. From (5), (6), (7), and (8) we see that 
(9) (+ Lim {fa(K C 8) —fi,(Z)} = 0, 


for almost all S. Hence every point of condensation of the sequence 
{fn(K C 8)} is a point of condensation of the sequence {f;,(K)}, ss there- 
fore F(K C 8) is contained in F(K) for almost all 8. 

We shall now prove that F(K C 8) also contains FK) for almost all S. 
It is easy to see that if F(K) contains two numbers + < u, then it also con- | 
tains all numbers t with r<t<u. We let æ be the smallest and 8 the 


sH. Rademacher, “ Einige Sätze über Reihen von allgemeinen Orthogonalfunk- 
tionen,” Mathematische Annalen, vol. 87e(1922), pp. 142-138. 

°S. Banach, “Sur la valeur moyenne des fonctions orthogonales,” Bull. Ao. Orao., 
1919, pp. 66-72. 


“790 | ` Z. W, BIRNBAUM AND HERBERT S. ZUOKERMAN. 


largest number in F(K). It will suffice to prove that a and B belong to 
F(K C 8) for almost all 8. i 
Let {h} be a sequence of such indices that lim fr, (K) =a. If a is not 


contained in F(K ea 8) for a certain S then it is au a point of condensation 
of {fa(K C 8)}-and, by (9), it is not a point of condensation of {fi,(H)}. 
Therefore only a finite number of the indices i, are equal to some k&n, and, if 
8 is (bı, be: * -), then we have.b:, — 0 from a certain m on. We now let 


T be the set of all S such that br, — 0 for all m =r, and T= È Tr. All § 


for which « is not a point of condensation of {f,(K C S)} bine to T. 
However, it is easily seen that each set T, is of measure zero, and hence T is 
also of measure zero. From this we see that the set Ea, of all 8 for which « 
is not a point of condensation of {f,(K C 8)}, is of measure zero. By the 
_ same argument, the set Eg, of those S for which £ is not a point of condensa- 
tion of the sequence {f,(K C S)} is, too, a set of measure zero. If # is the 
set (of measure zero) of those © for which (9) does not hold, then ` 
EH’ ==: <0, 1 — E — Ea — Eg is of measure one and contains only selections 8 . 


for which both «a and £ belong to F(KC 8). If both « and £ belong to. . 


F(K C38) then F(K C8) contains every number between a and £, and 
| hence contains F(K). Since the measure of W’ is one, this completes the 
proof of Theorem B. 


‘8. Theorem B states that, for a fixed K, there is a set of measure one 
of selections 8 which leave the set of points of condensation of the sequence of 
relative’ frequencies invariant, ie. F(K C 8) = F(K). : 


The dual statement is also true: 7 for a fixed selection S and almost all K > 


we have F(K)=F(KCS). To see this we note that by. a classical theorem’: 
due to Borel, for almost all K, the set F(K) contains only the number 4. 
On the other hand from (6) and (7) we find that, for a fixed S and almost 
‘al K, we have lim fa (X C 8) =4. 


The question may be asked whether it is possible to find a set M ‘of 
sequences K and a set N of selections § such that each set is of measure one 
and that F(K) —F(K C 8) for every K in Mand every Sin N. The answer 
to this question is négative as may be seen from the following argument: 

We first discard the set of measure zero of those K which contain only a 


TFor a more general treatment of such “dual” problems i.e: those with a fixed 
selection and sets of collectives, see Z, W. Birnbaum and J. Schreier, “Eine Bemerkung 
zum starken Gesetz der grossen*Zahlen,” Studta Mathematica, vol. 4 (1933), pp. 85-89. 

*E. Borel, “Les probabilités dénombrables et leurs se arithmétiques,” - 
Rend. Ciro. mat.- Palermo, vol. 27 (1908), pp. 247-271. 


ON THE PROPERTIES OF A COLLECTIVE, . 791 


finite number of 1’s. Now, if the same sequence of 0’s and 1’s is used 
for K and for S, i. e. if K = S, then K CS is a sequence consisting only of 
Vs. Therefore, for every K, we have f,(K CK) =1, n=1,2 5. 
Hence, if 1 is not a point of condensation of {fn(K)} we have F(K) 

~AF(K CK). For almost all K the set F(K) contains only the number #, 

therefore F(K) 34 F(K C E) for almost all K. It follows that, if M is a set 

of measure one of sequences K, and N a set of selections S such that for all K 

in M and all Sin N we have F(K) = F(K CS), then N must not contain 
any 8 = K with K contained in M, and therefore the measure of N is zero. 


UNIVERSITY OF WASHINGTON, 
SEATTLE, WASHINGTON. 
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ON SYMMETRIC BERNOULLI CONVOLUTIONS.* 


By Tatsuo KAWATA. 


1. LetA(t;o), — 0 < t< + œ, denote the Fourier-Stieltjes transform | 


(1) A(É: 5) = [7 ettedo(z) 

-0 
of a distribution function e(z}, — co <z <+ œ. Let B(x) denote the 
symmetric Bernoulli distribution, which has at either of the points z == + 1 
the jump +; so that A(t; 8) — cost, and so the Fourier-Stieltjes transform 
of the distribution function £ (z/b), where b > 0, is cosbt. Thus, the 
infinite convolution 
(2) of) = B(a/bs) * B(z/ba) * B(a/bs) *: ++, where br > 0, 
is convergent if and only if 


(3) 3b < o, 

in which case 

(+t) Aft;o) = [I cos bat. 
k 


Wintner has obtained on the one hand? Gaussian estimates of 1 — o(s} 
and o(— x) for large x > 0 in case of an arbitrary {b+} satisfying (3), and 
on the other hand ë almost Gaussian estimates of A(f;o) = A(— t;e) for 
large t > 0 in case {b} is suitably chosen (e. g., by = k#*, e> 0); he has 
also pointed out? the relation of these estimates to a conjecture of Wiener, 
proved by Hardy.‘ The object of this note is a precise investigation of this 
relation. 

2. First, if {b+} satisfies (3), then there exists a À > 0 such that? 
©) 1—0o(x) == 0 exp(— Ar?) and o(— x) == O exp(—Az*), as T— + œ. 
Actually, (5) holds for every fixed A. In order to see this, one merely has 
to combine the proof? for the existence of a sufficiently small A with a known 


device, which consists in replacing the sequence bj, b,,- - - by the sequence 
bys, bwin, © ©, Where N = N (A). | 


* Received March 24, 1039. a. 

1B. Jessen and A. “Vintner, “Distribution functions and the Riemann zeta- 
function,” Transactions of the American Mathematical Society, vol. 38 (1935), p. 6L ` 

* A, Wintner, “Gaussian distributions and convergent infinite convolutions,” Ameri- 
can Journal of Mathematics, vol. 57 (1935). 

SA. Wintner, “On analytic convolutions of Bernoulli distributions,” American 
Journal of Mathematics, vol. 56 (1934); “On symmetric Bernoulli convolutions,” ` 
Bulletin of the American Mathematical Society, @ol. 41 (1935). | 

`G. H. Hardy, ‘A theorem concerning Fourier transforms,” Journal of the London 
Mathematical Society, vol. 8 (1933). 
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In the particular case where k o) is so nail for. ‘large | ż | as to inil 
the existence of a continuous derivative o' (z), one has 
(6) : © d(t)=0exp(—ar), 2+0, 
for every fixed A. This follows from (5) by a known argument.° 

Now, (6) implies that there does not exist a convergent Bernoulli con- 
volution (2) whose Fourier-Siieltjes transform (1) is Oexp(— òt?) for a 
sufficiently small § > 0. 

In fact, if there existed a 8 > 0 for a suitable sequence {b+} one 
(3), then, on choosing A in (6) sufficiently large, one could conclude from 
the theorem of Hardy * that A(t; e) is of the form P(t)exp(— at”), where 
P(t) is a polynomial and « a constant. This involves a contradiction, since 
(4) has infinitely many (real) zeros and does not vanish identically. 


8. It will now be shown that the result of Section 2 cannot be essentially 
improved. In fact, it will be shown that there exists to every positive in- 
creasing function p(t), O<t< œ, which satisfies the condition 


(7) f PO Gee 

1 P. 
a convergent symmetric Bernoulli convolution (2) in such a way that 
(8) : ‘ A(t;o) = Oexp(—p(|t|)); as é— + œ. 


In the proof it may be assumed that p(t) tends with ¢ to + © in a 
monotonous way, since otherwise we could replace p(t) by p(t) + t. 


Now put, for t > 1,. ia 
q(t) = f P du, 
1 u 


Then clearly g(t) is increasing and, since 


OO uzn fR, 


we have 


(9) ~ pt)=o(#). 


Furthermore, since 


Peel L'Eau 


1 


10 =a fr BO) ay “x fo 0(u)du=0(2), 


e ` ° 
5 Cf. B. Jessen and A. Wintner, loc. oit., p. 67. 
° B, Jessen and A. Wintner, loc. cit’, p. 68. 


and 


we have 


7. 
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Now Si : 


t 

G: 9(@t) at — S “EC du = p(t) log 8 = p(t) +A 
= = . 

for t Z to, ris A == log oe walt) 75): 
Let r(#) denote the inverse function of g(t), and put ay —1/r(#). 

Then we can easily see that ¢(t) — 0. Since 


te (t) -707 — 16) oo —0(1), 


rt) - ' 
we have 
A g(t) dt = NG (N) — poe f” ta (t)(t) at 
—o(1) g(a) —2 [7 w E a 
| = o(1)— ##(1) +2 S nay ay 
Thus, 


if" $3(u)du < ©. 
Since ¢?(u) is monotone, it follows that (3) is satisfied by b, = p(nA). It 
will be shown that, for these b,, the function (4) satisfies (8). 
Lett > 0, The number of those n which satisfy bnt = c is [4a (J): 
for the inequality bnt = c is equivalent to (n4) = c/t, i. e., to r(ndA) S t/c 
gns Z q (+) Thus, the number of those n ‘which satisfy 1 > bnt 2 1/3 is 
[g (8t)/4] — [g (t)/4] & q(8t)/4 — q (t)/4 —1 


= (p(t) + 4)/4—1=p(t)/4, 
for t = t). Hence, i 


. | 
| A(t,c)| —|Ilcos(bst)| = IJ cos(bif) 
s= 1L>bt2% 
| Æ (cos 1/3)?(/4 = exp(— p(t)), for t= to. 


Since (4) is an even function, the proof of (8) is complete. 
Finally, I should like to express my hearty thanks to Professor A. Wintner 
for his invaluable criticism and advice. | 


TOHOKU UNIVERSITY, bd Le 
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T [w] represents the integral part of s. 


THE FOUR-VERTEX THEOREM FOR SPHERICAL CURVES.** — 
By S. B. Jackson. 


1. Introduction. The Four-Vertex Theorem or “ Vierscheitelsatz” 
states that every oval of class ©” in the plane possesses at least four extrema ` 
of the curvature, where an oval may be defined as a simple closed curve with 
non-vanishing curvature. This theorem has been extended to other classes 
of plane curves by Fog and Graustein® and to certain restricted classes of 
space curves by Süss, Takasu, and others.* , As regards the space curves, the 
results have been very fragmentary, and the curves considered have been 
principally those that are closely enough related to plane curves so that analo- 
gous proofs can be carried over. This is not surprising when one considers 
that the property of closure for a space curve puts a much lighter restriction 
on the. curvature than does the same condition for a plane curve, which is 
completely determined by the curvature as a function of the are. Accordingly, : 
it seems more reasonable to look for a.generalization of the theorem to 
spherical curves, with curvature replaced by geodesic curvature, since a curve 
on the sphere is completely determined by its geodesic curvature as a function 
of the arc length. Such a generalization is the object of the present paper. 

By a suitably chosen inversion, any spherical curve can be transformed 
into a plane curve. Under this transformation, it is found (§ 3) that the 
geodesic vertices of the spherical curve, that is, the extrema of the geodesic 
curvature, are transformed into the vertices of the plane curve. From the 
known results for plane curves* there follows at once the existence of at least 
four geodesic vertices on any simple closed spherical curve of class C””. .: 


* Received February 19, 1940. 

1 Presented to the Society, April 8, 1938. 

* First published apparently by Mukhopadhyaya, “New methods in the sense 
of a plane arc,” Bulletin of the Oaloutta Mathematical Society, vol. 1 (109R pp. 31-37, 
and since then appearing repeatedly in the literature. 

3D. Fog, “Über den Vierscheitelsatz und seine Tealing Sitzungs- 
berichte der Berlin Akademie der Wissenschaft (1933), pp.51-254; W. C. Graustein, 
“Extensions of the four-vertex theorem,” Transactions of the American Mathematical 
Society, vol. 41 (1937), pp. 9-23. 

tW. Süss, “ Ein Vierscheitelsatz bei EE Raumkurven,” Tôhoku Matke- 
matical Journal, vol. 29 (1928), pp. 359-362; T. Takasu, “ Vierscheitelsatz für Raum- 
kurven,” Tôhoku Mathematical Journal, vol. 39 (1934), pp. 292-298. Also a number 
of other papers. W. C. Graustein ang S. B. Jackson, “The four-vertex theorem for a 
certain type of space curves,” Bulletin of the American Mathematical Society, vol. 43 
(1937), pp. 737-741. 
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In pushing the results beyond the case of the simple closed curves, a study 


of certain spherical arcs is made, called arcs of type Q (§ 5) because of their -` 


shape. These are entirely analogous to Graustein’s arcs of type Q in the plane. . 
It turns out, in fact, that by a suitable inversion a spherical arc of type Q 
may be carried into a plane arc of type ©. Thereby the fundamental property 
of the plane arcs of type © is transferred at once to the spherical’ ares of type Q.: 
` This property states that there exists at least one non-negative minimum of 
geodesic curvature interior to any spherical arc of type Q; By means of’ it the 
‘Four-Vertex Theorem is extended to a ae class of non-simple spherical 
curves. 

. In a paper in 1936,5 Graustein strengthened the original Four-Vertex 
Theorem. A vertex is called primary if, at the vertex, the curvature is greater 
than or less than the average curvature according as it is a maximum or a 
minimum, and it is shown that the primary vertices outnumber: the: other 
(secondary) vertices by at least four for every plane oval. . We shall establish 
precisely analogous results for a certain class of spherical curves, namely those 
which ‘are tangent indicatrices of other spherical curves (§ 7). The question 
as to whether the strengthened theorem holds for a wider class of DER 
curves is left open.. 

À close relationship is exhibited between the re. vertices on the 
tangent indicatrix of a twisted space curve, and the dual vertices defined by 
Takasu * (§ 8). The relation of the geodesic vertices of a spherical curve to 
the ordinary vertices, that is, the extrema of the ordinary curvature, is also 
clarified (§ 9). It appears that every geodesic vertex is a’vertex, but not | 
conversely, whence any spherical curve has at least as many vertices as geo- 
desic vertices. | 
e 

2.. Transformation of curves by inversion. If C:s = æ(s) is a-regular 
twisted space curve of class C””, lying on a surface, $, the following well known 
equations are valid: © 


CR + in 


SW. C. Graustein, “A new form of the four-vertex theorem,” Monatshefte für 
Mathematik und Physik, Wirtinggr Festband (1936), pp. 381-384. 

° See, for example, W. C. PHRASE Differential Geometry, Macmillan (1936), 
pp. 163-165. 
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_ where 1/9, 1/r,‘and 1/r are, respectively, the geodesic curvature, the normal 
curvature, and the geodesic torsion of C on 3, and a, &, v are, respectively, 
the unit tangent vector to C, the unit normal vector to X, and the unit vector 
tangent to X and orthogonal to C such that (avg) = 1,7 

Let us seek the equations of transformation for the quantities 1/p, 1/r, 
1/r and the curvature 1/R, of C under an inversion in space. If the sphere 
of inversion has radius a and center O, the equation of the inversion in vector 
form is ne 
= az” 
rome | 
where « and 2 denote the vectors OP and OP’, respectively. From this 
equation it follows that the relation between the elements of arc, ds and ds’, 
of C and its image C”, respectively, is f 
ds _ (zl) _ œ 
df e |’) 

If 8 is an arbitrary unit vector localized at the point, P, and ¥ is the 
corresponding unit vector at the inverse point, P’, it is readily shown that 
(2.4) = ZER : 

In particular, the vectors a, v, £ of the.trihedral of:C on X transform into 


mu le), 


| A | 
(2.2) a! == Ge) or, inversely, s 





(2.3) 








| (la) 
(2. 5) i : eek ee En 
met (ejz) ` |. 


The vectors & and v’ may be viewed as the first two vectors of the trihedral — 
of the inverted curve, C’, on the inverted surface, 3’. Since inversion carries 
a right trihedral into a left trihedral and vice versa, it is necessary to take for 
the surface normal to ¥ not & but ¢”==— { in order to preserve the con- 
vention that the trihedral have the same disposition as the axes. The trihedral 
for C’ on 3 is, therefore, a’, v’, ¢” and equations (2.1) for-C’ become 

f e 


det wae 
. a or 
dv’ ` af 7 

ar wi, 

ds == y TF . 


where the primes denote quantities referred to C. 


TFor vector notation see Chapter I, loc. oit. 6. 
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Differentiating the first of (2.5) with respect to s” and substituting from’ ` 
(2.1), (2.3), and (2.6), we obtain the relation 








wv t (zx) 1 (xx) 1 (zla) __2(z|v) 1 
fg et ge eo ue ae à 
(2.7) Gl 1, 2,,4(@le), 
- a r a? a (ale) * 


The inner product of (2.7) with the second of formulas (2.5) yields the 
equation 
i 1 (ele) 1, 2(zle) 


(2. 8) P | a? p a? 





“which represents the desired transformation of the geodesic curvature of € 
into the geodesic curvature of O’. By differentiation of (2.8) with aa 
` to s and substitution from (2.1) and (2.3), we find 


an O-O- 


as the equation of transformation of the derivative of the geodesic curvature. 

In order to obtain the corresponding formulas of transformation for the 
normal curvature, it is only necessary to form the innet padas of (2.7) with 
£"=—#, The result is 





(2.10) —— Ge, 1 L ep 


and differentiation of this relation and use of (2. 1). and (2.8) yield the 
equation oe 
a(t ds? d (1) 2e) & 1 
@. 11) #(3)--(%) 2(2) a ds’ +" 
A similar procedure in the case of the geodesic torsion gives the following 
equation of transformation ' 


(2.12) Dr ae 
Since 1/R? = 1/p? + 1/r?, we have, on squaring and adding (2.8) and (2.10). 


Fem (#Y Les (ere) +46 O7, 


From (2.1) and the Frenet-Serret formulas, it follows that 


"p_i? È 
B D pe 


i i 
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where £ is the unit principal normal né for C. Maig use a this, together 
with the identity . ` 

| oe) = le) + (+ Calo? 
we find 


(2.13) rm) mt ec) UE 


as the equation of transformation for the curvature. 
Differentiation of (2.13) and application of the -Frenet-Serret formulas’ 
gives the equation of transformation for the derivative of 1/R, namely: 


oop eC) (0) ET m) 


where y is the unit binormal vector for C. 

It is to be observed that equations (2.18) and (2.14) are independent 
of the surface, 3, since they involve only intrinsic properties of the curve. 

As a consequence of the formulas developed above, the following theorem 
may be stated at once. 


THEOREM 2.1. If a surface, 3, is carried by inversion into a surface, X, . 


(a) the extrema of geodesic curvature, not at the center of inversion, ` 
on the lines of curvature of class O” of X are carried into the similar extrema 
of geodesic curvature on the corresponding lines of curvature of 3’, points of 
marimum (minimum) geodesic curvature going into points of maximum 
(minimum) geodesic curvature; $ ; i 


(b) the extrema of normal curvature, not at the center of inversion, on 
the lines of curvature of class O” of X are carried into the similar extrema gf 
normal curvature on the corresponding lines of curvature of 3’, points of 
. maximum (minimum) normal curvature going into poinis of minimum 
(maximum) normal curvature.® | 


The proof is immediate, for: the lines of curvature are da by’ 
the fact that 1/r—0. Since ds/ds’ 4 0, it follows by (2.9) that d(1/p)/ds 
and d(1/p’)/ds’ pass through zero together and in the same direction. This 
proves (a), since the extrema of geodesic curvature: are characterized by. the 
fact that at these points (or arcs) the derivative of the geodesic curvature 
Guanes sign,. and the direction. of passing FER Zero for the derivative 


* These statements as to exactly, “is the maximüm and minimum points are 
transformed into are valid only by virtue of our agreement regarding the relative 
orientations of z and 2. 
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determines whether it is a maximum or a minimum point.’ By use of (2.11), 
the proof of (b) follows in a similar manner, except that in this case d(1/r)/ds 
and d(1/r’)/ds’ pass through zero. in opposite directions, so that maximum 
points are carried into minimum points and vice versa. 


8. Geodesic vertices on spherical curves. An extremum of geodesic 
curvature will be called a geodesic vertex, and a point (or arc) where the 
geodesic curvature changes sign a geodesic inflection. The term vertex, alone; 
will be used to indicate an extremum of the curvature, 1/R. It is necessary 
to clarify this ambiguous term, curvature, however. For a twisted space curve, 
C, the curvature is defined as inherently non-negative. For a plane curve,.how- 
ever, we shall use the same word, curvature, to denote what is actually the 
geodesic curvature of the curve with respect to the plane. This curvature may 
be either positive or negative depending on the direction of rotation of the 

_ tangent with reference to the orientation of the plane. 

According to (2.9), geodesic vertices of a curve, C, are preserved under 
inversion provided 1/7 == 0, as was seen in the proof of Theorem 2.1. Special 
interest thus attaches to those surfaces for which 1/r= 0, i.e., for which all 
curves are lines of curvature. It is well known that the only such surfaces 
are the sphere and the plane. Henceforth we shall limit most of our attention 
to such curves. Part (b) of Theorem 2.1 becomes trivial for such curves, but 

- part (a) assumes the following form. 


THEOREM 3.1. The geodesic vertices, not at the center of inversion, 
‘on a plane or spherical curve. of class C” are carried by inversion into the 
similar geodesic vertices of the transformed curve, points of maximum (mint- 
mum) geodesic curvature being carried into points of maximum (minimum) 
geodesic curvature.’ 


A simple, closed spherical curve, C, of class O”, may be carried by a 
suitably chosen inversion into a simple closed plane curve, C, of class C7. 
Since every simple closed plane curve of class C”, not a circle, has at least 
four vertices ? we obtain at once the following theorem. | 


THEOREM 3.2. À simple closed spherical curve, of class 0” , not a circle, 
has at least four geodesic vertices. 


° This proof ‘holds only for isolated extrema, since the first derivative test may 
fail. for extrema which are limit points of other extrema. The theorem is valid for 
‘this type of extrema also, but the proof is omitted as of scant interest for the present 
paper. e eo 

‘10 This result is incorrectly stated by Fog, to. cit. 3 in that he states that vertices 
and geodesic vertices coincide on a spherical curve. This is incorrect. (See $9). 007 
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4, D-ares and D-curves. It is necessary to introduce at this point a 
series of lemmas dealing with certain types of spherical arcs and curves. The 
results and Fo parallel very closely certain work by Fenchel on Res 
ares." 

A simple wie arc of class O’ will be called a D-arc if (a) it consists 
of a finite succession of arcs of class (”” with geodesic curvatures continuous 
clear to their endpoints, and (b)-the geodesic curvature is non-negative when 
the arc is suitably directed. A simple closed spherical curve is called a D-curve 
if every sub-arc of it is a D-arc. Otherwise expressed, a D-curve is a D-arc 
that is closed. It follows at once from these definitions that on a D-arc or 
D-curve the geodesic curvature is continuous except for a finite number of 
points where one-sided limits exist. In the work that follows we shall consider 
the sphere as oriented by viewing it from the tip of the outward drawn normal. 


Lemma 4.1. In a sufficiently small neighborhood of any point on a 
D-arc, the arc lies on or to the left of the directed tangent great circle at thts 
point. 


At a point of continuity of 1/p' the lemma follows from the definition of 
non-negative geodesic curvature, while at a point of discontinuity of. 1/p the 
lemma holds for each of the two arcs class O” which meet at this point and 

thus holds here also. 


LEMMA 4.2. If a D-are joins two. non-diametral points, A and B, of a ` 
great circle and does not meet it elsewhere, the region (contained in a hemi- 
sphere) bounded -by the D-arc and the smaller great circle segment, AB, lies 
to the left of the D-arc. 


The proof given by’ Fenchel 1! for an arc of continuous non-vanishing .- 
geodesic curvature holds without alteration in the present case. It may be 
noted, however, that we have assumed A and B non-diametral; whereas Fenchel 
could prove it. 


Lemma 4.3. A D-arc, contained in a hemisphere, joining two diametral 
points, A and B, is a great semicircle. 


Consider the great semicircle APB directed frome4 to B, where Pi is any 
point of the D-arc. In case P coincides with A(B) we shall mean by APB 
the semicircle from A to B which is tangent to the D-arc at A(B). For some 
point (or points) P the D-arc lies entirely on or to the right of this great 


u W. Fenchel, “ Über Krummung a Windung "geschlossene Reumkurven,” Hathe- 
matische Annalen, vol. 10 (1929), pp. 238-262. 
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semicircle. The points common to the D-arc and this great semicircle APB 
are a closed set, and consist entirely of points of tangency, except perhaps 
for A and B. Moreover, in each ‘case, the tangency must be in the direction 
APB since otherwise the D-arc must either cross APB or cut itself, both of 
which are impossible. If the lemma is false there exists at least one such 
‘point of tangency in every neighborhood of which there are points of the 
D-arc to the right of APB. But this contradicts Lemma 4.1 and is therefore 
impossible. . 


LEMMA 4.4. A D-arc kig at most a finite number of crossings with any 
great circle. i 


By a crossing is meant any point or arc common to the D-arc and the 
great circle in every neighborhood of which lie points on both sides of the 
great circle. Fenchell ™ has proved this lemma for ares of continuous, non- 
vanishing, geodesic curvature, and this proof extends at once to D-arcs. It 
should be observed that by Lemma 4.1 a tangency cannot be a crossing. 

The remaining essential properties are most readily obtained by con- 
sidering first the case of D-arcs with non-vanishing geodesic curvature. At a 
point of discontinuity of 1/p we demand also that both the one-sided limits 
shall be different from zero. 


Lemma 4.5. The tangent great circle to a D-arc of non-vanishing geo- 
desic curvature at a point P has no further points of contact with the D-arc 
in a suffictently small neighborhood of P. 


In general 1/p == dp/ds where Ad is, to within infinitesimals of higher 
order; the angle between two neighboring tangent great circles. At a point at 
which 1/p is continuous, df > 0- and the arc is actually turning away from 
the tangent, while a point at which 1/p is discontinuous is the junction of 
two arcs, each of which is turning away from the tangent. 


LEMMA 4.6. The tangent great circle to a D-curve with non-vanishing 
geodesic curvature, at a point P, has no further points in common with the 
curve. | DOS 


The number of pari common to the circle and the curve is finite, for 
by Lemma 4. 4 the number of crossings is finite, and by Lemma 4. 5 the closed 
set-of tangencies consists only of isolated points and is therefore finite. Assume 
that there are common points, other than P, and let Q be the last such point 


before P. Since.1/p > 0; it follows from emma 4.3 that P and Q are not, , 


diametral. The tangency at P determines a directed great ‘eirele..arc, PQ. 
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Let R denote the region, contained in a hemisphere, bounded by the D-arc 
QP and the great circle arc PQ. R and the arc QP are on the same side of 
the great circle are PQ, and since, by Lemma 4. 5, QP lies to the left of PQ 
at P, È lies to the left of PQ. Since PQ and QP are similarly directed at P, 
R lies also to the left of the D-are QP. It follows readily from Lemma 4. 2 
‘that PQ is the shorter great circle arc joining P and Q. At P the D-curve 
actually passes into the interior of R, by, Lemma 4.5, and therefore, in order 
to return to Q, it must return to a point of the great circle are PQ. This is 
impossible, by Lemma 4.2 since the region bounded would be on the right. 
Thus we obtain a contradiction and the lemma is proved. i 

Fenchell* obtained the following result, restated here only for convenience. : 


Lemma 4.7. If Ais an arc of class C” on a surface of positive Gaussian 
Curvature, a similarly directed geodesic parallel to A contained in the field of 
geodesics perpendicular to A and lying to the left of A has greater (algebratc) 
geodesic curvature than A at corresponding points. 


Since by this lemma a geodesic parallel to a D-curve which lies sufficiently 
near it and to its left is surely a D-curve we are led at once to: 


LEMMA 4.8. The geodesic parallels to a D-curve that lie sufficiently near 
it and to tts left are D-curves of ARENA geodesic curvature. 


~ Leama 4.9. A tangent great circle to a D-curve cannot cross the curve. 


Since, by Lemma 4. 1, a tangency cannot: be a crossing, the curve and a 
great circle meet at some angle, not zero, at a crossing. Suppose there exists 
a tangent great circle that crosses the curve. If the D-curve is deformed to 
its left into an arbitrarily near geodesic parallel, the crossing points and tive 
tangent great circle deform continuously, and the geodesic parallel has a , 
crossing with a tangent great circle. Since this contradicts Lemma 4. 6, the 

. assumption is false and the lemma is proved. 

It is clear from this lemma that every D-curve is eontamad in a closed 

hemisphere, which leads to the following result. ; 


Lemma 4.10. A .D-curve containing two diantetral points is a great 
circle. i T s 

` Since the entire curve, and hencé each of the arcs into which the diametral 
points divide it, is contained in a hemisphere, it follows by Lemma 4.3 that 
each arc is a great semicircle. The conclusions then follows by the conmaaiy 
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It can be shown by further discussion that the D-curves have the character 
of ovals on the sphere. In particular, a D-curve, not.a great circle, has in 
common with any tangent great circle either a single point or a single arc, 
less than a semicircle. Since this property is not essential for our work, the. 
details of the discussion will be omitted. : 


5. Arcs of type Q. Graustein ® has developed a theory of certain plane 
` arcs which he has called arcs of type Q. We shall consider analogous spherical 
arcs which will also be designated as of type 0. 

A spherical arc, AB, of class 0%”, is said to be of type Q if (a) its geodesic 
curvature, when it is traced from À to B, is non-negative and is not identically 
zero; (b) the tangent great circles at A and B coincide; (c) the arc meets this 
common tangent only at A and B; and (d) it is simple except that B may 
coincide with A. Condition (c) is not actually necessary as a part of the 

. definition since it essentially follows from the other three conditions and the 
work of § 4. However, it is convenient and we shall retain it. 
_ ` An aré of type Q, which may be designated without, ambiguity by Q, is 
clearly a D-arc. Moreover, it is tangent to the common tangent great circle 
` in the same direction at 4 and B, since otherwise Lemma 4.1 would be vio- 
lated at one point or the other. By adjoining to Q the great circle arc BA, 
directed in the sense induced by Q, there arises a D-curve, ©, with discon- 
tinuities in the geodesic curvature at A and B. Since © is not a great circle 
arc, À is not a great circle, and by Lemma 4. 10 the arc BA is less than a semi- 
circle. From this discussion and Lemma 4.9 we have at once the following 
result. | 


Lexma 5.1. The closure, à, of an arc of type Q lies on one side of every. 
ingent great circle. The common tangent great circle at A and B has just 
the contact arc BA, less than a semicircle, in common with ©. 


Consider the point 17” diametrically opposite to a point M on the great 
"circle arc, BA, of ©. Let W’, which by Lemma 5. 1 does not lie on &, be chosen 
as center of stereographic projection. The great circle containing the arc B.A 
goes into a straight line, and Q goes into an arc ©’, lying on one side of this 
line and tangent to it at the projected points A’ and B’. Consider the great 
circle K tangent to © at any point other than A or B. The point M’ lies on 
one side of K, while M, and with it all of &, lies on the other side by Lemma 
5.1, since the circle K by hypothesis is not the common tangent great circle 
at A and B. Thus K projects into a circle z with Q/ in its interior. Since, 
at the point of tangency, 0’ must have at least as ‘great curvature a as a a has 
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non-vanishing curvature, except perhaps. at A’ and B’. It is seen at once that 
Q’ is a plane arc of type Q as defined by Graustein.? 

The direction on a plane (spherical) arc of type Q for which the (geo- — 
desic) curvature is non-negative will be called the positive direction., It is 
readily shown that the positive directions on Q and Q correspond. When Q 
is traced so that 1/p = 0, v is directed toward the interior of G, that is, toward 
the smaller of the two simply connected regions into which & divides the sphere. | 
Since A” is not in this interior, the interior of © projects into the interior of 
Gy, and v projects into the vector v’ directed toward the interior of Y. But 
for 0/ the first of formulas (2.6) becomes do’/ds’ == »'/R’. This shows that 
1/R’ = 0 when v’ is directed toward the interior of ©, and the positive 
directions on © and O correspond. We have therefore established 


LEMMA 5.2. By a suitable inversion, a spherical arc of type Q can be 
transformed into a plane arc of type Q, so that the positive directions on the 
two arcs correspond. 


From Lemma 5.2, Theorem 3.1, and Graustein’s theorem ? that the non- 
negative curvature of a plane arc of type Q has at least one minimum interior 
to the arc or is constant throughout the arc, we have at once the following 
theorem. l 


THEoREM 5.1. A spherical arc of type Q has a minimum of non-negative 
geodesic curvature interior to the arc, or has constant geodesic curvature 
throughout the arc. 


This leads readily to a second result, since an arc of type 2 with constant 
geodesic curvature is a circle. 


Turon 5.2. À closed spherical curve of class O” which has geodesic 
inflections and contains an arc of type Q, not a circle, has at least four geodesic 
vertices. 


If the curve is directed so that on the arc of typ Q, 1/p = 0, it follows 
from Theorem 5.1 that there exists at least one non-negative minimum of 1/p. 
Since there are geodesic inflections, 1/p becomes negative, and there must also 


12 This lemma is established only by virtue of the relative orientations of = and 2’ 
agreed on when we derived formulas €2.6). In the present case it implies that if the 
sphere is oriented by the outward drawn normals, the plane is oriented by the normals 

-directed away from the sphere. 
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‘exist a negative minimum. ` It follows that there must also be two maxima, 
and thus at least four vertices. i 


6. | Extrema of the torsion of spherical curves. The torsion of a curve 
C on a surface X is readily found to be 


142 (2)-22 (4) | 

r ds \p p dsir/,1. 

TAR 
ta 


T 


(6.1) ge 


_ by replacing the angle which enters into Bonnet’s formula, 1/T = dp/ds + 1/r, 
“by its value in terms of 1/r and 1/p. For a spherical curve on a sphere of . 
radius b, 1/r == — 1/b, d(1/r)/ds = 0, 1/7 = 0, and (6.1) reduces to 


1d i 
(62) e 
pt 


‘Since the geodesic vertices are the points where d(1/p) /ds changes sign, 
we obtain the following result directly from (6.2). 


Lemma 6.1. On a spherical curve of class C’” the geodesic vertices are 
precisely the points where the torsion changes sign. 


Let us call a point (or arc) where 1/T changes sign a transition of the 
torsion. It has been proved by Fenchel for a closed spherical curve of class 


Č” that f ds/T = 0.3 Hence, for a closed spherical curve, a transition of 


> the torsion is, in reality, a point at which the torsion crosses its average value. 
A maximum (minimum) point of the torsion on a closed spherical curve where . 
1/T > 0 (< 0) will be called a primary extremum. Other extrema will be 

termed. secondary." 


LEMAA 6.2. I f Cis any closed spherical curve of class O”, pr = 8r + g, 
‘where pr and Sp are, a einai the numbers of primary and secondary es- 


; 1: W, Fenchel, “ Über einen Jacobischen Satz der Kurventheorie,” Téhoku Mathe- 
matical Journal, vol. 39- (1934), pp. 95-97. This-result also follows by integrating (6.2). 

Compare W. C. Graustein, Poc. oit. 5. Algo W. C. Graustein and S. B. Jaokion; 
toc. cit. 4. 
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irema of the torsion, and g is the number of geodesic vertices on C. It ts 
understood that if Sr org is infinite the equafion merely implies that pr ts also 
_ infinite. 


By Lemma 6.1, g represents the number of transitions of 1/T on C. 
If 1/T =0, all three quantities are zero and the formula is trivially valid. 
In the contrary case there exist at least two transitions. Between two con- 
secutive transitions, the primary extrema outnumber the secondary by one or 
both are infinite, for on this arc the types of extrema alternate, and the first 
and last are primary. Finally if g is infinite then pr is also infinite, since 
between two transitions there is at least one primary extremum. Thus’ the 
lemma holds in every case. 

The following theorem is a direct consequence of Lemma 6.2 and 
Theorem 3. 2. 


THEOREM 6.1. The number of primary extrema of the torsion on a 
simple closed spherical curve of class C””, not a circle, exceeds the number of 
secondary extrema by at least four, or both are infinite 


7. Geodesic vertices on tangent indicatrices of spherical curves. If 
C is a closed spherical curve of length 7, a maximum (minimum) point of 1/p 
at which 1/p — 1/a > 0 (< 0) will be called a primary geodesic vertex, where 


1/a = (1/1) Í, ds/p; i.e. 1/a is the average value of 1/p taken over C. Al 


other geodesic vertices. will be termed secondary. A point (or arc) where 
1/p — 1/a changes sign will be called a transition of 1/p. Precisely as in 
Lemma 6. 2, it can be shown that NaS t 


(21) . p=s+i 


where p, s, and £ are respectively, the numbers of primary and secondary geo- 
desic vertices, and the number of transitions of 1/p on C. 
It has been shown by Fenchel ** that for a regular space curve Cy 


(7.2) pee ae e 
To Es P 

`- 18 The existence of at least four transitions of the torsion for any simple closed 
curva on an ovaloid was proved by H. Mohrmann, “Die Minimalzahl der stationären 
Ebenen eines riumlichen Ovals,’ Sitz. der Kôniglich Bayoerischen Akad. der Wissen- 
schaften, Math.-Phys. Klasse, Minchex® (1917), pp. 1-63. This might have been used 
in conjunction with Lemma 6.1 to establish Theorem 3.2 
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where 1/p is the geodesic curvature of the tangent indicatrix, or, equivalently, 
ds,/T, = ds/p, where s and s'are the arc lengths on C and its tangent 
indicatrix, C, respectiyely. If, in particular, C is itself a closed spherical 
curve, Í, ds/p = f dso/To == 0, as was noted in § 6, whence it follows that a 
os condition for a closed epherioal curve, C, to be the tangent indicatrix 


of some other closed spherical curve, Co, is that f ds/p == 0; i.e. 1/a == 0. 
-Je 


Since, for a spherical curve Co, 1/Eo 0, it follows that in this case 
1/p and 1/T have corresponding transitions, and ¢, in (7.1), equals the 
number of transitions of 1/7, which, by Lemma 6,2, equals the number of 
geodesie vertices on Co. Thus we have proved the following theorem. 


THEOREM 7.1. If Coisa a closed spherical curve of class C”, and C is tts 
- tangent indtcatriz, then 


p=sSt Jo 
where p and s are the numbers of primary and secondary geodesic vertices 


on O, respectively, and go is the number of geodesic vertices on Co. 


Let Co be a closed spherical curve of class C™? and consider the sequence 
of closed spherical curves C4, +==1,---,n, such that C4 is the tangent in- 
dicatrix of Cia. C4 will be called the i-th tangent indicatrix of Co. It may 
be noted that C, and C are the ordinary tangent and normal indicatrices of Co. 
The last theorem may now be generalized in the following way. 


, THEOREM 7.2. If Co is a closed spherical curve of class C"*? and C4 - 


is its i-th tangent indicatrir, i = 1,: : -,n, then 
+ 


r-t 
Pr = sr +? Ès Jo rSn 
=1 


where p; and si are, respectively, the numbers of primary and secondary geo- 
desic vertices on Cy, and go ts the number a geodesic vertices on Co. 


The proof is by induction on r. For r= 1 the above contention is pre- 
cisely Theorem 7.1. If FA theorem is true for r = t, then Pt == 8- À 2 84+ Jo 
and the number of geodesic vertices on O: is pi + 8: = 2 > & + go Since Crs 
is the tangent indicatrix of Cr it mies by one %. 1 that Pi = Stan 
-+2 > Si + Jo, and the induction? is complete. 
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COROLLARY 7.2.1. The k-th tangent indicatriz of a closed spherical 
curve, Co, contains at least as many geodesic vertices as Cy. The numbers are 
equal if and only if the first k tangent indicatrices have only primary vertices. 


. è i 
Rhe proof is immediate, for px + Su == 2 $, 85 + go = go, and the equality 
. ia 


sign holds if and only if s; == 0, i = 1,5- kn 


COROLLARY 7.2.2. The number of primary geodesic vertices on a tangent 
indicatriz of any order of a simple closed spherical curve, not a circle, exceeds 
the number of secondary geodesic vertices by at least four. 


k-i 
For pr — 8x = 2 X sı + go = go, and by Theorem 8.2, go = 4. It is clear, 
izt 


by this last corollary, that a necessary condition that a closed spherical curve, C, 
‘be a tangent indicatrix of a simple closed spherical curve, not a circle, is that 
p—s = } > 4; that is, the curve must contain at least four geodesic inflections. 
Thus a figure eight curve with only two geodesic inflections cannot possibly be a 
tangent indicatrix of a simple closed spherical curve. 

By suitably combining the relationships discussed here, it would be 
possible to state several interesting theorems. One illustration will suffice. 


THEOREM 7.3. If C, and C, are two mutually inverse spherical curves, 
with first tangent indicatrices Č, and Co, respectively, then Dı — 5, = fiz — 52, 
where Ð; and 5, are, respectively, the numbers of primary and secondary 
geodesic vertices on Ci, i == 1, 2. 


For ji—S$; = gi, where g; is the number of geodesic vertices on Gi, 
i= 1,2, and by Theorem 3.1, g1 = gs. 


8. Dual vertices. As a consequence of formula (7.2) there exists a 
very simple relationship between the geodesic vertices on the first tangent 
indicatrix of a closed regular space curve, C, and the dual vertices which are 


defined by Takasu 1° as the extrema of the dual curvature, 1/P = — T/R. 
By (7.2) 1/p == R/T, where 1/p, is the geodesic curvature of the first tangent 


indicatrix, whence it follows that 1/P—— pp If 1/py340, then 1/P is 
continuous and the dual vertices of C correspond exactly to the geodesic ver- 
tices of the tangent indicatrix. In the contrary case, however, 1/P becomes 
infinite. This occurs whenever 1/7 becomes zero. 

e e 


16T, Takasu, loc. cit. 4. 


8 
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| If, in perticular, C is a closed spherical curve, not a circle, it follows from 
(6.2) that 1/7 passes through zero at least twice, and 1/P cannot be con- 
tinuous. This fact is: apparently overlooked by Takasu in proving his dual 
_ four:vertex ‘theorem’ for spherical. ovals. “He makes use of the continuity of 
1/P to establish thé éxistence of at least two zeros for its derivative, + The 
proof in question is-thus invalid, but curiously the theorem itself à is true. äs 
follows at. once by Corollary 7.2.2 and the relation 1/P = — pg eo 
Takasu observes the relationship indicated above between a curve and its.. 
` tangent indicatrix, but states it‘incorrectly as a correspondence of dual vertices 
of the curve and’ vertices of the ‘tangent indicatrix instead of geodesic vertices. 
As we shall.see in the next paragraph, not all vertices are geodesic vertices. 
.As a matter of fact, the vertices of the tangent indicatrix which “are not also 
| geodesic. vertices are the points that correspond to the discontinuities of 1/P 
instead: ‘of its extrema 


9.” | Variis. of putes curves. In concluding this study of spherical 
curves, it is natural toi inquire what relationship exists between the vertices and 
. geodesic vertices of such a curve, and what can be said regarding the number 
_ of ‘vertices. If the curve, C, lies on a sphere of radius b, then 


` Differentiation of this equation yields 


` 1. d 1 afi 
ee . ale zal): 
The points of C at which d(1/R)/ds, 1/p, and d(1/p)/ds: change sign are, 
respectively, the vertices, the geodesic inflections, and the geodesic vertices. 
Since, for a spherical curve, 1/R £0, the left side of (9.1) changes sign 
precisely at the vertices. Similarly, since a geodesic inflection never coincides 
with a geodesic vertex, the right side of (9.1) changes sign precisely at the 
geodesic inflections and ghe geodesic vertices. ‘This leads at once to 


< Tronen 9.1. The vertices of a spherical curve of class C”” consist of 
z is geüdesio vertion and geodesic inflections. 


ar This theorem is true for curges of ‘clase Ce; but our argument, based on (9.1) 
assumes class C7”. 
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From this theorem it can readily be shown that vertices are not preserved 
by inversion, even for the special case of stereographic projection. Let a circle 
be tangent to an ellipse at a vertex and cut it in two other points. If a sphere 
is chosen on which the above circle projects stereographically into a great 
circle, .the projection of the ellipse has geodesic inflections, for otherwise it 
would be a D-curve, and by Lemma 4. 9 could not cross a tangent great circle. 
Thus the projection has at least six vertices, four geodesic vertices and at least 
two geodesic inflections, as compared with four vertices on the ellipse. 

The statements in the last paragraph are not contrary to (2.14) for 
although for a plane curve 1/7 ==0 and thus d(1/R’)/ds’ is a multiple 
of d(1/R)/ds, the other factor, (1/R)ds/ds + 2(2|B)/b°, may become 
zero, whence it follows that d(1/R’)/ds’ may change sign, even though . 
d(1/R) /ds «0. 


Theorem 9.1 may be conveniently restated in the following form. 


COROLLARY 9.1.1, If C is any spherical curve of class C””, 


v=g +i 


where v, g, and i are, respectively, the numbers of vertices, geodesic vertices, 
and geodesic inflections on C. 


From this follow readily several interesting corollaries. 


rr 


COROLLARY 9.1.2. A simple closed spherical curve of class C””, not a 


circle, has at least four vertices. 
For by Theorem 3. 2, g = 4. 
A D-curve with continuous geodesic curvature is called an oval. Using 


this definition, we state 


COROLLARY 9.1.3. A simple closed spherical curve of class ©”, not an 


oval, has at least six vertices. è 


Since, by hypothesis, there are geodesic inflections, and these necessarily 
occur in pairs, i = 2, while by Theorem 3.2, g = 4. 


COROLLARY 9.1.4. A closed gpherical curpe of class ©” which contains 
geodesic inflections has at least four vertices. 
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For by hypothesis i > Z and g = 2? on any closed curve of class C”.. 
CororLarx 9.1.5. A closed spherical curve of class O” which is a 
tangent indtcatric of any order of a closed spherical curve, not a circle, has 
at least four verlices. 


By hypothesis 1/9 Æ 0, and a necessary condition that a curve be a tan- 
gent'indicatrix of a closed spherical curve is that f ds/p = 0. Hence 1/p 
c 
changes sign and Corollary 9.1, 4 applies. 


COROLLARY 9.1.6. A closed spherical curve of class C’” which is a tan- 
gent indicatrit of any order of a simple closed spherical curve, not a circle, 
has at least six vertices. 


As in the last corollary + = 2, and by Corollary 7. 2. 2, g = 4. 
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A COMPLETE CHARACTERIZATION OF SECTIONAL FAMILIES 
OF CURVES.*? 


By ANNETTE VASSELL. 


. The object of this paper is to study the geometric character of a special 
type of family of plane curves, the secttonal family. A sectional family is 
obtained by projecting from a fixed point upon a fixed plane all the plane 
sections of an arbitrary surface. A set of six plane geometrical properties is 
found for these families and it is proved that they are characteristic. This 
problem was first considered by Kasner ? in 1908 and the solution is analogous 
to his differential-geometric characterization .of dynamical trajectories? Of 
the individual properties mentioned below, I is due to Kasner, as also II, III 
and IIT’ for the case of developable surfaces. Moreover IV and V were sug- 
gested by his properties V and VI for dynamical trajectories. 


By making use of a projective transformation in space which leaves every 
point of the fixed plane invariant and carries the center of projection to the 
point at infinity in the direction orthogonal to the plane, it is readily seen that — 
a given sectional family obtained by central projection from one surface can 
always be thought of as obtained by orthogonal projection from another surface 
projectively related to the first. Let the fixed plane be the z,y plane, let 
z == f(x,y) be the equation of a surface and let z == az + by -+ c be the equa- 
tion of a general cutting plane. Projecting the plane sections orthogonally 
upon the z,y plane we get as the equation of the resulting family of curves 


az + by +ce—f(z,y) = 0. R 


A sectional family is thus a certain kind of three-parameter system of plane 
curves. 

By differentiating and eliminating the constants from the last equation 
we find the differential equation of the system of curves to be — 


(1) (faz + 2feyy’ + fy?) y” | 
ae (feos +. 3fecyy’ + Bfewy? + Faw?) A 3(fa + Enyi”. 


* Received July 27, 1939; revised January 8, 1940. : 

1 Abstract in Bulletin of the American Mathematical Sooiety, vol. 45 (1939), p. 91. 

3 Abstracts ‘in Bulletin of the American Mathematical Society, vol. 14 (1908), p. 
356; vol. 36 (1930), p. 61. 

3“ The Trajectories of Dynamics,” Transactions of the American Mathematical 
Society, vol. 7 (1906), pp. 401-424. Also “Differential-geometric Aspects of Dynamics,” 
Princeton Colloquium Lectures on Mathematics (1913; new edition 1934). 
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This is of the es type “ 


(2) y” =G(a y yy” + Heyyy” 


Kasner proved that all triply infinite families‘ whose differential be is 
of the form (2) where G and H are any functions of x, y, y have the following 
~ geometrical property.” 


` Property I. If to each of the œt curves having a given lineal element in: 
common the osculating parabola i is drawn at that element, the foci will lie on 
a circle through the point of the element. 
And conversely, every system of curves possessing Property I is defined by 
- a differential equation of the form (2). As shown in the reference, the focal 
circle corresponding to a lineal element x, y, y has the equation 


(3) 2G(X* + F?) + (3(¥#—1) —y(y°+1)H}X 
+ ((¥?+ 1)H—6y}¥ —0, 


where X, Y denote current coördinates referred to axes drawn through ‘the 
given point as origin and parallel to the z- and y-axes respectively. . 
| . The special form of the coefficients G and H in the sectional case indicates 

that sectional families possess other properties besides HE I. We observe 
that H has the form 
(4) y — (w, + W2) /2 

aes (y — ws) 

where w, tòs are the roots of the equation fa» + 2fayy’ + fyyy — 0 and. are 
therefore the projections of the asymptotic directions on the surface. We have 


(5) w + 102 =. 2fov/fu» Wiw: — faa/fry- 


We shall show that if w, w: in (4) ‘are any two functions of x and y, 
` whether derived from a surfacé or not; the following property will hold and 
conversely that if this property holds then H must be of the form 1e This 
property was stated without proof by G. Comenetz.® 


Pr operty II. There exist for each point x, y of the plane two directions 
w,, Wz such that any direction and the reflection in y of the tangent to the 
focal circle “determined by x, y, y are‘pairs in the involution muse fixed 
directions are w, and ta. 


tE. Kasner, “ Dynamical Trajectories and the “cot Plane Sections of a Surface, a 
Proceedings of the National Academy of Sciences, vol. 17 (1931), pp. 370-376. 

5“ The Trajectories of Dynanfics,” p. 408. 

baa Curvature Trajectories,” American Journal of Mathematics, vol. 58 (1938), .p- 
225. 


ae x aa gon 
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The slope of the tangent to the focal circle (3) at the given point is 
E | 3(y?—1) —y (y? +1) 
ÿ—G+DA 
It is easily computed that the reflection in y of this slope is y” _3/H. The 
necessary and sufficient condition * for Property II is that . i 
y (y —3/H) — $ (w: + we) [y + (y —8/H)] + mw: = 0, 
‘and this equation is equivalent to (4). . 
When w, = w, (we then write them as w), the involution is singular so 


that the reflection inwy’ of the tangent to the focal circle is fixed and coincides 
with w. | | 


Next we note that with the use of (5), G of (1) may be written as 
| hy’ + my? + ny +k 
6 ae 
(6) 4 (y — w) (y — w) 
where 
(1) h = fyyy/fuvs m == 3hayy/fyy, n = Bfasy/ fuv k == fosz/fuv- 


Moreover, when w, = wz, it may be verified that the numerator of (6) has 
the factor (y — w), that is 





(8) hw} mu? + nw Hk 0. 


We shall prove that if h, m, n, k, wi, w, are any functions of x and y 
subject only to the condition that: if w, = we, (8) holds, the following property 
will be true of systems (2) with H given by (4) , and conversely that if this 
property holds then G must be of the form (6) subject to (8) when w, = we. 


Property III. In each direction through .a given point O there pasges 
one curve which has contact of third order with its circle of curvature. When 
the directions w,, w2 occurring in II are distinct, the locus of the centers of 
thé co! hyperosculating circles, obtained by varying the initial direction, is a 
cubie having a rectangular node at the given point O. The nodal tangents ` 
bisect è the-angles made by the directions w1, we. When w, = ws, the locus is 
a conic which passes through the given point in the direction w. ` | 

The condition of third order contact demands that y” of the differential 
equation of circles (1 + y*)y” = 3y'r/”* be the same as y” of the system (2). 
Equating the two expressions for y” and Hien. solving for y”, we find 


t Graustein, “ Introduction to Higher Geometry id (1930), P- 155, ex. 9. 

s When w, t, are the two isowopic directions, the cubic degenerates to three 
straight lines through the given”point (if also @ = 0, ‘it degenerates into he. whore 
plane), and there is no rectangular node or angle biseotion: E F 
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k ; G +y”) 
9 | ot Co 
= YO SHOE) +5 
Substituting for H from (4) and for-G from (6), we have 
(10) f= 2 (hy? + my? + ny +k)( + y") 
— 8[ (to, + we) — 2 (ww: —1)y — (w + w2)] ` 
This shows that to any x, y, y there is one y”. Hence to any lineal element 
there corresponds one curve which is hyperosculated by its circle of curvature. - 


. The codrdinates of the center of curvature for a curve at a point, with 
respect to that point as origin, and axes parallel to the codrdinate axes, are 


1+ y" 
Y=—;r-. 
y 





-~ y Zr Ut”) 
(11) A PF > 


Solving for y and y” in (11) and substituting the values in (10) we have 


(12) AX’ — mAPY + aXY? — LY’ 
— % (wy + we) X? — 3 (wwa —1) XY + Yo (w, + we)? == 0. 


This is a cubic with a node at the given point. We observe that when we set 
the quadratic terms equal to zero, the product of the roots F/X is — 1 and 
hence the tangents at the node are perpendicular to each other. It follows 
from the identity 


(w + w) (wiw) — (wiw: — 1) (w; + o) — (w, + 2.) = 0 


that w,, w, are harmonie conjugates with respect to the roots of the quadratic, 
hence the roots, that is the nodal tangents, are the angle bisectors ° of w,, We- 
When w, = w, an extraneous factor À + wY must be removed from (12) 
and we obtain a conic having the w direction at the given point. 
‘Conversely, we now ask for all systems (2) having Properties IT and III. 
The equation of a cubic having a rectangular node at the given point is 


(13) AX? + 84,X*¥ + 84,.X¥? + AY’ + A,X? + AX Y — AY? = 0 


where the coefficients À are functions of x, y. The center.of a hyperosculating 
circle is given by (11) where y” is defined by (9). If we substitute the center 
in (13), and apply Property II to the result by substituting for H from (4) 
and then solve for G, we find 


— 3 (4oy — 34y? + 84 — Aa) (w1 + re) yf? 
(14) Qm — À (ww: — 1 )y — (w, +) ] 
2(¥ — w) (y — we) (AY — Asy — 44) | 





? Graustein, p. 155, ex. 10 and p. 152, Theorem 1. ‘When the involution is circular, 
Wy w, are the isotropic directions and ‘there is no question of bisection. 
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This expression for G becomes simplified when we make use of the fact that the 
nodal tangents of the cubic bisect the angles formed by the lines F —w,X, 
Y = #24. The nodal tangents are defined by setting the quadratic terms of 
(13) equal to zero. As before we find that the condition for bisection is 


Ag/Ag = (w + 2) /2 (ww, — 1). 


Substituting in (14), and changing the notation somewhat we find that G 
simplifies to the form (6) where h, m, n, k are arbitrary in a, y. 

Similarly we deal with the case w, = we. 

We have now proved that a necessary and sufficient condition that a system 
of curves have an equation of the type (2) with H and G of the forms (4) 
and (6) respectively, that is, | | 


(15) (y — wi) (y — w:)y” | 
| = (ag + my? + ny + k)y” +3 (y- Eee ye 


where h, m, n, k, wW, and Wa are arbitrary functions of x,y (except for (8) 
when w, = w») is that it possess Properties I, 11, III. 

Property V of Kasner’s set for dynamical trajectories states a relation 
between the radii of curvature of the trajectory in the w direction at a given 
point which hyperosculates its circle of curvature, and of the line of force 
(the line y’ =w) passing through the point. This suggests a similar in- 
vestigation in the case of sectional families. 

We shall prove the following property. 


Properly IV. Let the directions w, and w. of Property II be distinct. 
Of the curves which pass through a given point in the direction w, there is 
one which has contact of third order with its circle of curvature; the radiys 
of curvature of this curve is 3/2 the radius of curvature of that one of the 
integral curves of the direction w, which passes through the point. A similar 
statement holds for the direction wa When w, == w., the above statement is 
replaced by the following: the curve in the w direction which has contact of 
third order with its circle of curvature has the curvature zero.” 

Let w, 54 we.’ To find the radius of curvature of the curve in the w, 
direction which hyperosculates its circle of curvatu$e we substitute in the ' 
formula for radius of curvature y” as determined by (10) and w, for y. 
We observe ‘that this radius of curvature is, by definition of the cubic (12), 


7°-'In this case the integral curve of the direction w also has curvature zero and 
thus the 3/2 ratio still holds, but it % not necessary ‘to mention this in order to have 
a characteristic set of properties. On the other hand merely to say that the 3/2 ratio 
holds is not sufficient for a characteristic set. 
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identical with the segment which the normal to the w, direction at O intercepts 
on the cubic. Let us-call the point of intersection of this normal with the 
cubic Ny. Then — | 

ae) ONS 2(hw,s + mae + nw + k) ` 


The radius of curvabure pı of the curve y = w, is 


a) garer, 


pı 
Wie + WiWiy 


The necessary and sufficient condition for ON, to be 3p1/2 is 
(18:) han + mw? + nw, + k= (w: — Wy) (Wie + WiWıy). 
Similarly the condition corresponding to the w, direction is 
(182) | hw + mio? + nw: + k = (w — We) (Wee + Wey). 


When w, = w., the denominator of (10) has the simple factor y — w. 
Tn order for y”, and hence the curvature corresponding to the w direction, to 
vanish when y — w the cubic in the numerator of (10) must have (y —w)* 
as a.factor. In view of (8), the necessary and sufficient condition for this is 


G9) 3 -: Shw? + 2mw + n = 0. 


It may be verified that sectional families which have w, 54 ws obey (18;) 

and (18:2) and those for which vi == W obey (19). Hence sectional, families 
have Property IV. 
We obtain the remaining aies by differentiating and ns in 
various ways the relations (5) and (7). - We omit the calculations and merely 
state the results, which can be verified from (5) and (7). The first relation 
found is 
120) l | he = b= (em) . 
WW ý 

We shall prove that (20) is the necessary and sufficient condition for the 

. next property. 7 


. Property V. When the point O is moved, the associated cubic referred to 
in III changes in the following manner. Take any two fixed perpendicular’ | 
directions for the x direction and the y direction; through O draw lines in 
these directions meeting the cubic again at A and B respectively. Also con- 
struct the normals to w, and #, at O. APA draw a line in the y direction 
meeting these normals in some points A’ and A”, and at B draw a line in the 
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x direction meeting the normals in some points B’ and B” respectively. The 
variation property referred to takes the form | 


a Lae T me Lap + |, A ele] Lo 


where AA’, AA”, BB’, BB” are signed distances and where 1, Wa denote 
the slopes of the directions referred to in II relative to the chosen z-direction. 
This is true for any pair of orthogonal directions, and therefore really expresses 
an intrinsic property of the system of curves. 

‘To establish the above statement, we substitute in (20) the values of h 
and k from 


(22) OA == LA Ce and OB= 34 Wi t > : 


these latter being the intercepts on the codrdinate axes of the cubic (12). On 
simplifying the result somewhat, we find 


Oa a son aon |: +. Ce ai 


Y 
Now if we carry out the construction as expressed in V we find from triangles 
AA'O and BB’O that — OA = w,4 A’ and — BB’ = w,0B, and from triangles l 
AA”O and BB”O that — OA = w,4A” and — BB” = w,0B. On sub- 
stituting these in the above equation, we obtain (21). 
The final property aaa on the following two relations. 


1s ia — 10112 ; 210, Wz (We) y 
(23) (UE + We y h+ ae Saale (ui + uw)? ? 


(24) (10,402) (10s + 202) h + 2k = 2 (ane) — (w103) (w: + 2) = 


If we now use the relations (22) in (23) and (24) we obtain | 








cs) A + Rte Le Ce) Ce 9h (an, + we) | are | mt 


1 1 We)? w w POST IA T l 
(26) CRD ER yg MEE 96 (ing) 36 (wits) (a + way = € 
Property VI. Let the intercepts OA and OB be constructed as in V. 
The variation of the directions w and w as the point O is moved-is related 
to these intercepts by the equations (25) and (26). 


Obviously (23) and (24) are the necessary and sufficient conditions for 
Property VI. 
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We shall now prove that Properties I-VI are sufficient for sectional fami- 
. lies. We have already seen that any system of curves having Properties I, II 
and IIT will have an equation of the form (15). Our problem is to show that 
if we apply -(18,), (18:) (or if w, = w», (8), (19)), (21), (25) and (26) toa 
system of curves (15), the curves will be the projections of the plane sections 
of a surface; that is, that w,, W, A, m, n, k will be of the forms (5) and (7), 
where f is some function of z and y. Instead of (21), (25) and (26) we may 
use (20), (23), and (24), which are equivalent to them. 

Equation (20) is an integrability condition; hence a function F(z, y) 
exists which satisfies the equations 


k — (ww); 
W We 


(27) fd F= 


If we place the expressions for h and k from (27) in nee ha multiply 
the result by e? we have 


(e¥w,ws)y = — $ {eF (w: + w) Je 
Therefore a function H exists such that ` 
(28) eww, A, and — $ef (w, + we) == Ay. 


Now if we substitute (27) and (28) in (24) so as to eliminate h, k, w, and we, 
and simplify, we find that H,, = (e*),. This equation means that a function 
g exists for which Hy == ge and ef == g. The first of these two equations 
further says that a function f exists for which H.= fs and g == fy. We have 
then | 

(29) = fy and Hmm fo. 


Cfnsequently w,,we obey (5). Next.we substitute (5) and (29) in the 
equations (18,), (182) (or if w, = wz, in (8), (19)) and (27) and solve 
them for À, m, n, k. We find that they satisfy (7). 

We have now proved that every sectional family possesses Properties I-VI 
and every family of curves possessing Properties I-VI is a sectional family. 


Developable surfaces. On a developable surface-the asymptotic directions 
coincide, hence w, = w9. Therefore a sectional family derived from a de- 
xelopable surface, which we may call a developable system, has Properties I-VI 
in the simpler form to which they reduce when w, = we. 

Conversely, every family of curves having Properties I-VI in the reduced 
form is a developable system.: Thus the reduced set of six properties is a 
characteristic set for developable systems. fn this case G in (2) is linear in 


y and H is 3/ (y — w). 
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Wher w; is set equal to ws, (21) in Property V becomes | 


a i (ar) -g r) tA 


and Property IE says that y’ bisects the angle between w and the reflection in 
y of the tangent to the focal circle. For developable systems one asymptote 
of the conic in Property ITE is perpendicular to the w direction. 

Now a given: type of family may be characterized by more than one set of 
properties. In. the case of developable systems it is possible to replace VI by 
the following more elegant pair of properties. 


(A), The integral curves of the direction w are straight lines. 


(B) Let a be the angle between the asymptotes to the conic of IIT, let 
K be the curvature of the conic at the point O, and let K, be the curvature at 
O of the orthogonal trajectories to the straight lines y = w. Then 


K sina + 2K, cos a = 0. 


Property (A) is well known although it has not been stated in this con- ‘ 
nection before. The corresponding analytic statement is ws + wwy = 0. The 
(tvz/w), in (30) then becomes — wy. 

A property of sectional families derived from non-developable surfaces 
which is analogous to (A) may be obtained from (5) by differentiating, and - 
eliminating f and its derivatives. The eliminant is found to be 
(wy — We) (Wizz F Wore + RW reg + WWW ey + Wi Wayy + W Wy) 


+ Qu Way — PW Wry A 4w Wes ey— LWW eWiy F 27222 — 2W7 12 == 0. 


This has the following. geometric significance. Let 8 be the angle between 
the directions w, and we, let y and T be the curvatures and let s and 8 be the 
lengths of arc along the curves y = w, and y’ — w; respectively; then 


` a9. ar \. do dô 
gin ĝ (2 — as)? (r—4) (r—2 2) 


ano (5 + Z) (+4) (40H). 


This relation is a new characterization of asymptotic nets." 
An alternative for Property III for developable systems is: 


** References to other characterizgtions of alg at nets may be found in Fubini 
and Cech, “ Introduction a la Géométrie Projective Différentielle des Surfaces” (1931), 
p. 191. 
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Properly III’. The locus of ihe centers of the et tes corresponding 
to the elements at a given point is a conic with that point, as focus. 
For a given developable surface the conic is 


(GE + ie 


The analogue of III’ for families derived from non-developable surfaces 7 
has not been worked out completely but it.is certam that the locus of the 
centers of the focal circles at a point is a cubic curve. The coefficients are 
very lorg expressions in terms of fes, foys* * `, fyyy and they are symmetrical 
in the subscripts and y. The constant term turns out to be M(L?— 4M)? 
where M is the Hessian and Z the Laplacian of the surface. 


) BET VE (= r+ yEy) 


Ruled surfaces. Ruled surfaces may be characterized analytically by the 
fact that the cubic in y in the numerator and the quadratic in the denominator 
of G (that is, the coefficients of y” and y” in (1)), have a linear factor in 
common. For the condition for the.existence of such a factor is exactly the 
differential equation of ruled surfaces found by Monge. 

In order to give a characteristic set of geometrical properties for sectional 
families derived from ruled surfaces we have only to add the sale property 

to the previous set. 


Property VII. One of the two families of integral curves of the directions 
Wı, #2 consists of straight lines. 
This is so because a ruled surface is a surface one of whose two families 
of asymptotic lines are straight lines. In this case the family of straight lines 
. is at the same time part of the sectional family since it is the projection of thé 
rujings on. the surface. These straight lines belong to the ea 
curves referred to in Property IV. 
The condition for VII is 


(Wia + WiW:y) (Wen + WaWay) — 0, 


where w,, w, are given by (5). Incidentally it follows that this condition is 
equivalent to Monge’s equation for ruled surfaces. 
° 
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EXACTLY (k, 1) TRANSFORMATIONS ON CONNECTED 
: ' LINEAR GRAPHS.* 


By O. G. Harrop, Jet 


1, Introduction. In a paper by G. T. Whyburn [1]? there is given, 
among other results, a detailed study of the behavior of interior transforma- 
tions on linear graphs. ‘The results suggest a connection between these trans- 
formations and transformations which are exactly (k,1).2 This paper gives 
a more or less detailed account of exactly (k,1) continuous transformations 
defined on a connected linear graph. Since we are considering a connected 
graph, this type of transformation includes all local homeomorphisms defined 
on the given set [2]. 

In 2. results are given concerning exactly (4,1) mappings defined on 
Peano spaces of varying degrees of complexity. It is hoped that several of 
these results will be of use in attacking the problem of determining precisely 
what topological structure a set must have in order that an exactly (k, 1) 
continuous mapping can-be defined on it for k > 1 [31.4 In 3. an example is 
given showing that an exactly (3,1) image of a graph need not be a graph. 
In 4. the case k == 2 is given special attention. It is shown that an exactly 
(2,1) image of a graph A is a graph B, and furthermore, there exist sub- 
divisions of A and B into finite complexes Ky and Kg, respectively, such that 
the transformation of K4 into Kg is simplicial. A formula is given which 
not only relates the structure of A to B but also actually limits the type of 
sets A on which an exactly (2,1) mapping can be defined. 


2. Exactly (k,1) transformations in general. 


2.1. Let f(A) == B, where A and B are arcs. If f is at most (k,1) on A, 
there is an open set U dense on A such that f ts topological on each com- 
ponent of U. 


* Received March 7, 1940. 

1 National Research Fellow. 

*The numerals in brackets refer to the bibliography at the end of the paper. 

3 All transformations considered in this paper are single valued and continuous. 
By an exactly (k,l) transformation is meant a continuous mapping such that each 
point of the image space has exactly k inverse points. By a (k,l) mapping is meant 
a continuous mapping such that every point of the image set has at most k inverses. 

“See also an abstract by J. H. Robêrts in the Bulletin of the American Mathematical 
Society, 45-11-433. 
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` Proof. It ie supposed that A and B are non-degenerate arcs. The asser- 
tion is true for k == 1. For k > 1 it will suffice to show that on any sub-arc 
of A there is an open set U on which f is a homeomorphism. . Denote the 
end-points of A by at and a? and those of B by bt and b°, It may be supposed 
that A is a sub-arc of the given arc A such that f (81) — a? and f> (b°) == a’. 
The inductive assertion is that the statement is true for k— 1. If every point 
in the interior of B has at most k — 1 inverses in A, the desired open sét U ` 
exists by our hypothesis. If x e B— (b* + b?) has k inverses in A, let the first 
and last of these in order from a’ to a? be æt and 2, respectively. There is 
an open interval V in B with v as an end-point such that for every ye V, 
f(y) (ata + 27a?) has at most 4 —2 points, otherwise, since f(æz?) is a 
non-degenerate arc in B, some point in B would have at least k + 1 inverses. 
For some z on ax! (or z*a?) the open interval W between z and zè (or s?) 
` .maps into a subset of V. By the choice of V; f must be (k—1,1) on W, 
hence by the inductive assumption, there is an open set U dense in W such 
that f is topological on ‘each component of U. This proves 2. 1. 


2.2 If fi exactly (k, ty: on the stably pagal curve A,’ B= f(A) is 
stably regular. 


_ Proof. First, B can contain 1 no non-degenerate continuum XY which con- 
tains no are. For this would imply that f*(X) is totally disconnected, which 
_ ig not possible for an exactly (k,1) mapping.* ‘Thus, assuming that B can 
. contain a non-degenerate continuum X such that X C B— X, we may take 
X to be an are. It will be shown that this denial that B'is stably regular 
leads to a contradiction. Since f +(X) is not totally disconnected, there is a 
non-degenerate continuum Y C f*(X). Since A is stably regular, we may 
tgke FY to be a free arc’? in A. Then f(Y)—X:CX. Hence we may apply 
5 1 and further restrict Y to be an are mapping topologically into some are 

in X. Clearly, X1C B— X*. Each point z in the interior of X* has 
oat k — 1 inverses in À —Y. Thns any arc Xj! wholly in the interior of 
X! has an inverse f*(X,!)(4 —Y) which is not totally disconnected, ‘since 
f is exactly (4 — 1,1) on the compact set f*(.¥,7)(4—Y). Hence there 
- is a free are Z C A — F such that f maps Z topologically into XC X1, As 
before, XY? C B — X*, è After a finite number of steps there are determined 











k free ares Y, Z,- + +, Win A such that each contains an are Ty, Tg: - ‘Tw po 


# À continuum M is said to be stably regular (beständig regular) provided that 
for no non-degenerate continuum T does TC M—T. 

ê See [3] and the references given therein. © è 

* The arc T is said to he free in X provided no interior point ‘of T is a limit point 
of X— T. It is essential to notice that a free are is a closed point set. 
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mapping topologically onto X°, where X° is a non-degenerate arè such that 
X° C B—X°. Let x be-any interior point of X°. Let Tnt, Ge B— ZX. 
Since f*(z,) and (Y + Z+: - -+ W) have no common points, this implies 
that the compact set A— (fy+T7z+----+ Ty) has an inverse to z, hence 
z has k + 1 inverses in all, which is not possible. This permits us to state also 





2.3. If f is exactly (k,1) on the stably regular curve A, to each non- 
degenerate arc G in B = f(A) there is a non-degenerate sub-arc G of G such 
that f*(G") consists of k arcs each mapping topologically onto G*. 
2. 4. If f is exactly (k,1) on the Peano space A and x is an end-point of 
B = f(A), each point of f(x) ts an end-point of A. . 

Proof. Denote the Urysohn-Menger order of z by o(s); Set f (2) 
=r H pH oha Let (Tt), j= 1,2, +--+, ni, be a set of m arcs in À 
each terminating at zt, but with no other common point (by pale): Suppose 


each set chosen so that ST;t- Ty” = 0,1 Ÿ. Then C == IL i f(T;*) con- 
tains an infinite à sequence of distinct points converging to x oe o(r) =1. 


Clearly, each point of C — has Èn, = ķ inverses, hence each ni — 1 and 
each o(a*) = 1. 

If Ve(z) denotes a region in B of dun less than e, and if V.(z) —x 
has o(r) components for e seu small, then s is an end-point of each 
such component. We have ` : 


2.5. If f ts exactly (k,1) on the Peano space À and z e B = f(A) is such 
that each suffictently small regin in B containing x is cut into o(z) com- 


ponents by the removal of z, then È o(zt) S o(s): k. á 
2.6.` Let f be ssh (k, 1) on the stably regular curve 5 À. If x is a whe ee arc 
in Be=f(A) containing no point of E == f (E°), where E° is the set of branch 
points plus end-points of A, then f*(X), has at mast 2k — 1 components. 


Proof. Let X be a free arc in B— E. Then f*(X) C A— E°. Since 
f is exactly (k; 1) on A, f*(X) is not totally disconnected. “Let J be a non- 
degenerate component of f*(X). By the choice of Æ, J is necessarily an are 
(a free arc). By the continuity of f, each end-point of J must map into an 
end-point of X (not necessarily the.same end-point). There remain at most 


*It is evident that J could be only a aione closed curve or arc, since all of its 
‘points are of order two. The first Possibility is fuled out by 2.7 (which does not 
depend on 2.6). ` > f 
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2k — 2 points.in f*(X) —J to be located which map into an end-point of X. 
Clearly, any isolated point of f*(X) must map, by continuity, into an end- 
point of X. Hence each component of f> (X) — J contains at least one point 
which is carried into an end-point. Thus f1(X y= — J can have at most 24 — 2 
components. 


2.7. If f ts exactly (k,1), k> 1, on the continuum A, B= f(4) is not 


. an are. 


Proof. Suppose, on the contrary, that B is an arc for some exactly (k, 1), 

k > 1, mapping defined on a continuum 4. Now it is known that if a regular 
curve (Menger) is obtained from a continuum A by an at most (k,1) con- 
` tinuous mapping, then À is likewise regular [4]. Thus À will be assumed to 
be regular (hereditary local connectedness is sufficient). We take B to be the 
unit interval 0 S y S i. First, there is a proper subcontinuum À: of A such 
‘that f is exactly (k,1) on At and f(A*) — Bt, where B: is the interval 
OS yS b <1. By 2.4, each point of f+ (1) =2'-+ 2*- - -4 a is an end- 
point of A. Let T*,T%,---,T* be k arcs in A containing qt,- - -,a*, 


k + 
respectively, and such that TITI = 0, t£ j. The set C= IT f(T*) is an arc 


in B containing y—1. Setting S'—T*f*(Q) and noting that f is (1,1) 
on 5%, St is an arc. In fact, since B is an arc, the arcs S* are free in A. Let 
the end-points of C bez and y = 1. For any z < b! < 1, the set E(0 Sy S b+) 
. a 

` is A minus & open free arcs (each containing an end-point of A) which is a 
continuum A! on which f is exactly (k, 1). 

Next, the property of being a continuum in A on which f is exactly (k, 1) 
is an inducible property, for if A? = It A‘, where each A‘ is a continuum in 

e i 

A on which f is exactly (k, 1) and A‘ C At, then z e A? implies f*f(z) C A’. 
It follows that there is a continuum A° in A which is irreducible with respect 
to the property of being a continuum containing f*(0) and on which f is 
. exactly (k,1). If f(4°) is non-degenerate, we get a contradiction, for, A° is 
a Peano space (A is hereditarily locally connected) and the first remark will 
apply, hence A? is not irreducible. If f(A°) == 0, we again get a contradiction, 
for A° is a continuum fontaining only gt + 2?- -+ æ = f1(0). 
2.8. If f ts exactly (k,1) on the compact, heredttartly locally connected 
space A and B==f(A) ts an arc, there is an arc Bt in B such that do 
` consists of k arcs each mapping topologically onto Bi. 


Proof. The statement is trivial if B “reduces to a single point. Since 
f is exactly (4,1), there exists a non-degenerate continuum A*C A. Since 


s 
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A is hereditarily locally connected, At may be taken to be an are. pe é 
by 2.1, there is an arc A,’ mapped topologically by f into Bj C B. Since 

f is exactly (k— 1,1) on the half-compact space À — À;:, there is a non- 
degenerate continuum A? f1(B,1) - (4— 4,1). As before, we take A? to 
be an are, and applying 2. 1, there is an are 4, C A? on which f is topological. 
Set f(A?) = Bt. Then A,? and A,’ each contain an arc mapping topo- 
logically onto B;', After k such steps, we obtain arcs A,*, 423%, : © -, À; each 
of which contain an arc Ct == A;tf (Bat), t= 1,2, --,k, mapping topo- 
HE onto Bt. 


2:9. If f ts exactly (k,1) on thi continuum A and B= f(A) ts du 
regular, so also is À. 


Proof. Case 1. The continuum A is hé locally connected. 
Suppose T is a continuum of condensation of A. Since A is hereditarily locally 
connected, we may take T to be an arc. Let S—f(T). Let Y be a free arc 
in 8. Since F is free, f*(Y) can have only a finite number of components 
and is thus hereditarily locally connected. Applying 2.8 to f*(Y), there 
exists a set of & arcs 7", T?,- - -,7* (mutually separated by pairs) in A such 
that each T! maps homeomorphically onto X C F. Since the sum of k 
mutually separated arcs cannot contain a continuum of condensation, there are 


£ 
points in any neighborhood of a point v e T which belong to A — (T + 2 T+). 
Let z be any interior point of the arc X. Les f(T) DX, Pars 0. Let 
PeT: f(z). Let In D, En e Å — ( Š Ti+ T). „Then by continuity, 


f(t.) > f(x). But f(z) is an interior paint of a free arc X all of whose — 


inverses have been located in 37", which is a contradiction. à 


Case 2. The continuum À contains a continuum of convergence. The 
transformation f being at most (k,1) and A ASE (in the sense of 
Menger), B is likewise [4]. 


3. The original set À is a graph. If A is a connected linear graph, 
the image set B = f(A) under an exactly (k,1) mapping is a stably regular 
curve which has at most a finite number of end-pointse The following example 
shows that an exactly (3,1) continuous mapping defined on a graph need 
not give a graph. 


ExawmPze. Two basié mappings of an arc into an arc will be defined. 
Let C bé the interval OS ¢1., Let D be the interval OS y 5&1. Let the 
closed interval of C(D) between (n +1) and (n) be denoted by C;(D,). 
Define f as follows: Map C, topologically into D, with f(1) —1. Map C: 


1 


828 . 0. G. HARROLD, JR 


(topologically) into D, with f(1/2) ==1/2. Map Cs; into D; + D, with 
f(1/3) = 1. In general, Con is mapped onto D, such that as ¢ decreases 
y increases and Con:, is mapped onto D,: + D, such that as ¢ decreases 
y decreases. Finally, f(0) —0. Each point in the interior of D has three 
inverses in C, while y == 0 has one and y == 1 has two inverses. This will be 
called a mapping of type («). By demanding that the point which generates 
D. oscillate in the same manner near y == 1 as it does near y == 0, a continuous 
mapping of C on D is effected which has the same properties as the one above 
except that both y = 1 and y = 0 now have precisely one inverse. This will 
be referred to as a mapping of type (8). 

Let Xt, X? and X° be the intervals OS z S 1, y = 1,2, 3, respectively. 
Let J denote z == 0, 1 Sy S3. Set Am J -+ XIE X°4 X38. Let X,* be 
the sub-interval of X+ between z= 1/n and s= 1/(n+ 1). For a fixed n 
define on Xant, i = 1,2,3 a mapping of type (8). Then identify the end- 
points of the image arcs corresponding to s = 1/n and identify the end-points 
of the image arcs corresponding to s = 1/(n +1). Thus f (Xa! + Xn? -+ 7,5) 
= Ÿ, is a theta curve (i.e. of the form of the letter 8). Every point of F, 
has exactly three inverses on X»! + Xa? + Xn’. Repeating this for each 
n= 1,2,3,- < + and setting Y = 3Y,, we obtain a continuous exactly (3,1) 
mapping of X! + X? + X? onto Y. Evidently Y is merely the enclosure of a 
sequence of theta curves converging down to a single point such that Fn’ Yunis 
isa single point. Now on J define a mapping similar to type (a) such that 
the points of JX’, JX? and JX? are identified. Thus f(J) ==J* is a topo- 
logical circle. Setting B = f(A), we have an exactly (3,1) continuous 
mapping of the triod À onto B. The image B is clearly no graph, since it 
contains an infinite sequence of simple closed curves. 

e A continuous transformation f is said to be locally interior at the point « 
provided that f(z) is not a boundary point of the transform of any open set U 
containing z. The above mappings of type (a) and (8) fail to have the 
property of being locally interior at the points t= 1/n, hence the above 
exactly (3,1) mapping is not locally interior at infinitely many points. It 
follows from 2.3 that any exactly (k, 1) mapping defined on a stably regular 
continuum is locally interior except perhaps for a closed set of dimension zero. 


3.1. If f is an exactly (k,1), k>1, te defined on the graph A, 
B = f(A) contains a simple closed curve. 


Proof. It is to be shown that B is not a dendrite. From 2.4 and the 
fact that A is a graph it follows that B has at most a finite number of end- 
points. Let B have n end-points. The assertion has been proved for n == 2 
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(2.7). Assuming the statement true for a dendritic graph with n end-points, 
it will be shown to be true for n+-1. Let x be an end-point of B. Denote 
the maximal free arc containing z by X. Set C—B— YX. Set D—f#(C). 
The property of being a continuum in A which contains D and on which f is 
exactly (£,1) is inducible. Hence there is a continuum 4° in A which is 
irreducible with respect to this property. If f(A°) contains C as a proper . 
subset, by precisely the same reduction as was made in the proof of 2.7 we 
can find a subcontinuum H in 4° such that C C f(H), f(A) is a proper subset 
of f(4°) and f is exactly (k,1) on H. But this denies the irreducibility 
property of A’. Hence f(A°) —C. But C has one less end-point than B, 
hence, by the inductive hypothesis, f is not exactly (k, 1) on £’. 

The preceding results, in so far as they apply to the case in which A is a 
graph, may be summarized as follows. 





3.2. Let f be a continuous exactly (k,1) transformation defined on the 
connected linear graph A. .The image B = f(A) is a stably regular curve. 
The curve B is never a dendrite, and for k > 2 need not be a graph. The 
function f is locally interior at all points of A except possibly for a closed set 
of dimension 0. There ts a closed set D of dimension 0 in B such that each 
component of B— D is an open free arc whose inverse is precisely k open 
free arcs in A each mapping topologically onto the common image. Each 
free arc X in B containing no point of E =f (H°), where E° ts the set of 
end-points plus branch points of A, is such that f*(X) has at most 2k—1 
components, | | 
While the above theorem fails to give an exact statement of what the 
image B will be in terms of the properties of A, it does show that an exactly 
(k,1) mapping on a graph has some of the characteristics of an interior _ 
mapping. For instance, it produces only a slightly more complicated cuse. . 
than. the original set. This is meant only in a relative sense, of course. It is 
known that an at most (3,1) mapping on an arc can increase dimensionality, 
while a (2,1) mapping on an arc can produce a curve containing a continuum 
of convergence. ` (For interior transformations the property of being a graph : 
is preserved). . 
4, Exactly (2,1) transformations on a connected linear graph. The 
results in this case are much more precise, as would be expected, of course. 
‘The underlying reason for this, actually, seems to be that 2.1 can be 
strengthened to read i 
4.1. Iff maps the arc A into the arc B and is at most (2,1) on À, then f ts 
topological on A provided it preserves end-poimis® ,— 


* See [8], Lemma A. 
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As intermediate conclusions to the main results of this section we show - 


4,2. If f ts exactly (2,1) on the graph A, then (i) there exists at most a 
finite number of points x in B—f(A) such that x is the verter of a triod 
in B containing free arcs xy and wz; (ii) all but a finite number of the maximal 
free arcs X in B are such that f*(X) has exactly two inverse components; 
(iii) no point x in B is the vertex of infinitely many free arcs. 

Proof. (i) Since the set E° of end-points and branch points in A is 
finite, f(#°) is a finite set. Hence if there were infinitely many points x in B 
with the asserted property, there would be one, say x, such that szef (E°). Let 


=: T be the enclosure of a region in B containing a triod with x as vertex and 


such that two of the ares of this triod, say zy and zz, are free arcs. It is : 
supposed further that Tf(E°) —0. Set f(r) —a'+2*. The points 2 and 
q? are in the interior of free arcs in A. Let XY, i, j — 1,2 be free arcs in A 
having only the point xt, j = 1,2 in common and such that f(X#) C T. It 
may be supposed that (XY! + X12) (X*1-4 X) ==0. Denote three components 
of T— x by xy, zz and W. Suppose f(xy): X40. Let peft(acy) : Xs 
Then p-<zx. Denote the subarc of Y" from p to z+ by pat. Let the last 
point on pz" in f-tf(p) be p'. Then f is topological on ptz! by 4.1. Since 
f is exactly (2,1), f (sy): X} (say) £0. Similarly, there is an arc pr! 
in X on which f is topological. Hence there must be four arcs p's, pz, 
qa? and gx? on which f is topological and such that f(p'zt + Pr + qi? 
+ qv?) C sy 4- cz. Further, the sum of these four arcs contains an open 

set containing zt + s”. The continuity of f, however, implies that x have at 
least one more inverse, since W has x as a limit point. This denies the (2, 1) 
_ property. The property (iii) follows from an analogous argument. 

e (ii). The maximal free arcs are uniquely determined. Suppose there are 
infinitely many of them. -There is one, call it K, such that each point of 
FF(E) is of Urysohn-Menger order two and contains no topological circle of A. 
Since f is exactly (2, 1), we have by 2. 6 that f*(K) has at most 3 components. 
Since f is exactly (2,1), at least one of these components must be non- 
‘ degenerate. By the choice of K it must be an arc. Let T be such a non- 
degenerate component of f*(K). Since K is a free arc, end-points of T must 
map into end-points of W. Case 1. If the end-points of K are contained in 
the image set of the end-points of T, then by 4.1, f must be topological on T. 
Since f is (1,1) on the set f*(K)—T, which has at most 2 components, 
f*(K) —T consists of a single component J? which maps topologically onto 
K. Case 2, If one end-point of K contains the image set of the end-points 
of T, we. distinguish two casés*a) and b);°according as f(T) contains the 
other end-point of K or not. In the first mentioned possibility it is easy to 
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show that T is the sum of two arcs T* and T? having only a common end-point 
and such that each f(T*) — K. Hence all points of K have two inverses on T 
except one end-point. Thus f*(K)—T' consists of a single point, and 
f> (K) has two components: In the second mentioned possibility, f(T) = 

is a subarc of K. In this case T contains two inverses to all points of K! 
except the end-point of K+ in the interior of K. Set K*—K—K'*. Since 
K? is free, f*(K?) — T° can only have a single non-degenerate component, 
which is seen to be T° itself. Each point in the interior of K* has two inverses 
in T°. The point K!- K? has one inverse in T and one in T°. Thus any arc 
K in B of the type we have described has exactly two inverse components, 
i.e. f*(K) has two components, 


4.3. Let f be exactly (2, 1) on the connected linear graph A. Then B = f(A) 
is a graph. There exists subdivisions of A and B into finite complexes Ka and 
Ex such that the transformation of K1 into Kp ts simplicial. 


Proof. First, B is a graph. To this end an upper semi-continuous 
decomposition of B is effected. From 2.2, there are free arcs in every open 
set in B. Any free arc is contained in either a maximal free arc or in a simple 
closed curve having just one point in common with the rest of B. (If B is 
a simple closed curve, our conclusion is already attained). To each free are T 
in B there is a connected set T containing T which is a sum of such simple 
closed curves (of the type just mentioned) and maximal free arcs and which 
is maximal in this regard. By 4.2 (i), (iii), T contains only a finite number 
of simple closed curves or maximal free arcs. Hence T is a graph: Now the 
elements of the decomposition are to be (1) the. graphs T which contain a 
simple closed curve or two maximal free arcs, (2) the maximal free arcs in B 
(not already in (1)) which have more than two inverse components, afid, 
(3) the points in B not in an element of type (1) or (2). It will be shown 
that there is only one element in this decomposition. If there is only one 
element, clearly it must be of type (1). Next, the above definitions do. give 

‘an upper semi-continuous decomposition. First, the elements are disjoint. 
From 4.2 (i), the elements of type (1) and hence all are closed. Also, by 
4. ?, there are only a finite number of elements of type (1) or (2) so clearly 
this gives an upper semi-continuous decomposition: Let C be an image space 
of a corresponding continuous transformation g defined on B, g(B) =C. 

. Evidently, C can have no continuum of condensation, this C is stably regular. 

If C contains more than a single point, the inverse of a point in C with a 

finite number of exceptions is a ingle point.. Denote this exceptional set by 

ECC. By the manner of selection of the elements of the decomposition, 
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no two maximal free arcs in C can intersect. Also, C can contain no simple 
closed: curve having only one point in common with the rest of C. The curve 


C has only a finite number of end-points since B has only a finite number. : 


(The function g is topological on B — g*(E)). “Hence we are in a position 
. to define another upper semi-continuous decomposition, this one taking place 
on C. The elements of this decomposition are defined to be the maximal free 
arcs in C and the points in no free arc. It is known that this decomposition 


defined on such a curve C gives a hyperspace D containing no free are [3]. 


Setting A(C) = D, D = hgf(A). The continuous transformation of A into 
D can be factored into a monotone transformation f, followed by a light 
transformation fs, where f,(A4) == À! and f,(A*) =D. This factorization 
` may be so accomplished that. the ‘ points’ in At are the components of inverse 
sets to the mapping hgf(A) — D [5]. Since fı is monotone and À is a graph, 
A’ is a graph. The set h(E) contains only a finite number of points. Let 
æ be any point of D— h(E). It is readily seen that either f >g th (x) con- 
sists of exactly two arcs (one of which may be degenerate) or two points in A 
(according as h(x) is an arc or a point), hence f:1(x) consists of exactly 
two points. Thus the transformation fz carrying the graph A? into D is exactly 
(2,1) except for the points of h(E). Since D contains no free arc, there is 
a non-degenerate arc T in D—h(E) such that TC D— T. Now precisely 
- - as in the proof of 2.2 (taking A = A+), this leads to a contradiction. Hence 
C is a single point, i. e. B is a graph. 

It will now be shown that there exist subdivisions of A and B into finite 
complexes Ka and Kn, respectively, such that the transformation of K4 into Ks 
is simplicial. Let F be a finite set in B such that it contains all points of B 
of order =~ 2 and such that each component of B—Æ has an enclosure which 
is an arc uniquely determined by its end-points. Set H°—f1(Æ). Add to 
E° a finite set #—f1f(F) such that each component of A — (E° + F) is 
an open free arc (whose enclosure is an arc) uniquely determined by its end- 
points. Consider any component U of B— (E -+f(F)). Since each non- 
degenerate component of f*(U) contains only points of order Æ 2 and contains 
no simple closed curve, the reasoning in the proof of 4.2 (ii) can be applied, 
hence f-:(Ü). has two components. If Ü —K gives rise to Case 1, f1(Ù) 
consists of two disjoint? arcs which are mapped topologically onto U. By 
definition of F and F, these arcs are edges of the complex introduced into A 
by the points F + E°. If U gives Case 2a, f*(U) has a single non-degenerate 
inverse component which is the sum of two arcs, each mapping topologically 


` Jw Bee an abstract by W. T. Buckett and È Watson, Bulletin of the American 
Hathematioal Society, 43-3-182. 
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onto U. Again these are already edges of the complex on A. If Ọ gives 
Case 2b, the are Ü is subdivided by the insertion of a vertex at the point 
K*- K*. Denoting the augmented set of vertices in B by G, each component 
of B— G is such that its inverse under f in A is precisely two open arcs each 
mapping topologically onto the component in B— G. Denote the complex 
induced on B by G by Kx and-the complex induced on A by f7(G@) by Ka, 
then the transformation f carries Ba of K4 into edges of Kz and in topo- 
logical fashion. . 


4.4. Let f be exactly. (2, oe on PA connectéd linear graph À. For each 
point zeB = f(A) the relation o(z) = 1/2 [o(#*) + o(x°)] holds, where 
f(a) = gh Haut 


Proof. Suppose À and B have been subdivided into the finite re 
Kı and Kz of the last paragraph. Since f is exactly (2,1), each simplex in 
K3 is the topological image of two and only. two of the simplexes of K4. The 
asserted relation is a direct result of enumeration. - . 
This formula shows that any exactly (2,1) mapping defined on an are 
(or circle) can contain no point of order three, hence, after showing that the 
- are and circle cannot be exactly (2,1) images of:an arc, we have another 
proof that it is impossible to define an exactly (2,1) continuous transforma- 
tion on an are [8]. This relation also shows that any exactly (2,1) image 
of a simple closed curve is a simple closed curve. By fitting together two 
simple arcs to form a simple closed curve and defining mappings similar to 
those of type (a) and (£), it is easy to show that an exactly (k,1) mapping 
on a simple closed curve nged not give a simple closed curve for k > 2. Since 
this relation also implies that an exactly (2,1) transformation cannot be 
defined on any dendritic graph," it is of interest to give all of the possibilities 
(topologically) when A is a simple closed ctirve. ` 


4.5. Let f be exactly (2,1) on the simple closed curve A. Then B= f(A) 
is a simple closed curve and f is topologically equivalent to either (a) w = z? 
on |z|==1, or (b) w= 2 on | z | = 1 for A(z) = 0, and i = 2 on [ea] 


for A(2) & 0. 


Proof. The transformation f(A) = B is said te be topologically aari 
lent to g(A*) == Bt provided there exists a.pair of homeomorphisms h and h* 
such that h (4A) == 41 and h (B+) = B and f=high (or g= (M+) fh) [6]. 


1 A, D. Wallace pointed out that this implies 24 (B) = y (A}, where y is the Euler 
-characteristic. Hence if À is a den@ritic graph, ne exactly (2,1) transformation can 
“be defined on A. This result has been announced elsewhere. See.P. W. Gilbert, abstract 
45-11-420, Bulletin of the American Mathemattoal Society. 
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Two cases are distinguished according as f is interior or not. If f is interior 
on the simple closed curve A and is (2,1), and, if it is known further that 
the image is a simple closed curve, then f is topologically equivalent to w — 2? 
on |-z |== 1 [1]. If f is not interior on A, there is a point 2*« A and an open 
set U D zt such that z = f(z) is a boundary point of f(U). Let z? be the 
other inverse of z. Let h be a homeomorphism carrying À into the unit circle 
A? in the complex z plane such that A(2*) = + 1 and h(x) =— 1. Let 
h be a homeomorphism carrying the unit circle Bt of the complex w plane 
into B such that h? (1) =. Since f is not interior at zt (or æ*), it follows 
that each of the ares of A determined by z* and 2? map onto the whole of B 
under f. Denote by C and D the semi-circles of At, à (2) 20 and A (z) £ 0, 
respectively. Set g(z) == (h*)“fh*(z). Suppose as C is described from right 
to left that B* is described by g(+) in counterclockwise fashion. Then on 
A*+(C) the function f is topologically equivalent to w= 2? on |z2|[—1, 
A(z) 20. Since z= f(x) is a boundary point of the transform of some 
- open set containing 7t in A, B! must be described in opposite fashion as D is 
described from left to right. Hence on 4*(D) the function f is topologically 
equivalent to # — 2? on |z| =1, A(z) S0. 
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THE CHARACTERIZATION OF PSEUDO-SPHERICAL SETS.* + 


By Leonard M. BLUMENTHAL and GEORGE R. THURMAN. 





1. Introduction. We give in this paper the solution of a fundamental 
problem in the distance geometry of the n-dimensional sphere (surface) 
proposed some years ago by Karl Menger. 

Defining a semimetric space as a set of abstract elements (points), to 
each pair p,q of which there is attached a non-negative real number pg 
(distance) such that pg = qp and pq =Q if and only if p — q, the problem 
of characterizing metrically (i. e., in terms of the distance function) particular 
semimettic spaces among the whole class of such spaces naturally arises. For 
some of the more important spaces (e.g., euclidean, hyperbolic, spherical) 
the existence of a function mapping an arbitrary semimetric space congruently 
(i.e., with preservation of distances) upon the space follows from the con- 
gruent embedding in the space of each set of k points of the semimetric space. 
A space with this property is said to have congruence order k with respect to 
semimetric spaces.? It has been shown that the n-dimensional euclidean, 
hyperbolic, and spherical spaces have (minimum) congruence order n -+ 3, 
while, on the other hand, Hilbert space does not have any (finite) congruence . 
order with respect to semimetric spaces. ` 

The problem of determining necessary and sufficient conditions for the 
congruent embedding of any semimetric space in a given space is thus reducible 
to a “finite” problem in the case of those spaces possessing a congruence 
order; for if g has congruence order # with respect to semimetric spaces, then 
any such space is congruent with a subset of S provided each set of k points 
of the space is congruently embeddable in 8. Now the class of semimetric 
spaces with minimum congruence order m + 1 contains a subclass (spaces with 
quasi congruence order m) for each member of which a further reduction in the 
characterization problem is possible. A space S has quasi congruence order m 
with respect to semimetric spaces provided any semimetric space containing 
more than m + 1 ‘points is congruent with a subset of § whenever each m-tuple 


* Received September 30, 1939. . 

1 Presented to the Society, December 28, 1938. A brief summary of results appeared 
in the Proceedings of the National Academy of Soiences, vol. 24 (1938), pp. 557-558. 

#n this paper the class of comparison spaces is invariably the class of semimetric 
spaces, and hence the phrase “ with respect to semimetric spaces ” is.frequently. omitted. 
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of its points is congruently embeddablé in §. The n-dimensional euclidean 
and hyperbolic spaces belong to such a subclass of the class of semimetric 
spaces having minimum congruence order n + 3, since each of these spaces has 
quasi congruence order n+ 2. On the other hand, the n-dimensional spherical 
space a,r (the “ surface ” of a sphere of radius r in a euclidean space of n + 1 
~ dimensions, with geodesic (shorter arc) distance), though it has, as remarked 
above, minimum congruence order n -+- 3, is not a member of this subclass since 
. the 8,,, does not have quasi congruence order n + 2. 

“That this is the case is immediately verified upon noting that the Spr 
contains an equilateral set of n + 2 points (i.e., a set of n+ 2 points with 
all of the $(n+1)(n-+2) mutual distances equal) but does not contain an 
equilateral (n + 3)-tuple. Hence, if P is a space of arbitrary power exceeding 
n + 3, such that pg =r: cos (— 1/(n + 1)), (the “side” of an equilateral 
.(n + 2)-tuple of Sy,-) for p =q, and pq —0 when p= q, (p,qeP), then 
P is a semimetric space, containing more than n + 3 points, which is not 
congruent with a subset of S,,, though each set of n + 2 points of P is con- 
gruent with n + 2 points of S,,,. A semimetric space which is not congruent 
with a subset of the S,,-, though each n + 2 of its points may be embedded 
congruently in the Sar, is called a pseudo-Sn,- set. As illustrated by the set P 
defined above, pseudo-S,, sets may be of arbitrary power exceeding n +- 2. 
This is in marked contrast to the analogous pseudo-euclidean sets, for a 
pseudo-#, set is restricted to consist of exactly n + 3 points, due to the quasi 
congruence order n + 2 property of the Ey. The metric structure of pseudo-F, 
sets is readily described.’ 

The characterization of pseudo-S,,, sets was Pore by Menger in 1931.4 
The equilateral set P is a pseudo-S,,, set, but are all pseudo-S,,, sets of this 
simple structure? The principal result of this paper permits us to say that 
if a pseudo-Sa,r set contains more than n+ 3 points, and no two of the points 
have a distance equal to d = mr, then this query ts to ‘be answered essentially 
in the affirmative. ‘The meaning of;the qualification of the above statement 
given by the word “ essentially ” will become clear later. 

Pseudo-S,,r sets of exactly n + 3 points— it is proved (Theorem 6) that 
no diametral pair of points (i. e., two points with distance d) can occur in such 

a set — have a more vawied structure. These sets are described by use of the 

spherical analogue of the isogonal conjugate transformation of the plane. The 
‘case n ==2 of the ordinary sphere illustrates all of the essential features. 


>See L. M. Blumenthal, “ Distance pone University of. Missouri Studies, 
vol. 13 (1938), pp. 63-64. e 

“For n= 1 the term pseudo d-cyclic is ee Concerning the characterization of 
pseudo d-cyclic and pseudo-S,, sets see .“ Distance geometries,” pp. 74-81. 


.THE CHARACTERIZATION OF PSEUDO-SPHERICAL SETS. 837 


It is easily seen that a semimetric set of five points Pis Po, *** , Ps forms a 
pseudo-8,,, quintuple if and only if the sphere contains five points 81, S2,°** , Ss 
such that (1) ss is equidistant (with distance R£ss;) from the three 
reflected images s4/, s41, s of the point s, in the great circles O (sz, 83), 
C (81, 83), O (81, $2), respectively, determined by the independent (i. e., not on 
a great circle). points 51,82, Se and (2) the mutual distances of the points 
Pas Pa; ©: ,ps equal the corresponding distances of the points 81, 82,° - -,8s 
except for the distance pips, which equals R instead of s,s,.5 Thus, the four 
distances 71s, PoPs, Pas, and Paps == R are functions of the six mutual dis- 
tances of the four points Pı, pe, Ps, Ppa- The actual expressions for these four 
distances have not been computed (nor has the analogous computation for 
pseudo-plane sets been made).- One should note here a complication that arises 
in the spherical analogue of the isogonal conjugate transformation which is 
not present in the plane. A point sẹ of the sz,- is not uniquely determined by 
being equidistant from the three points sy’, 5411, s, — a pair of diametral 
points satisfies this condition. Thus, the transformation on the sphere is not 
one-to-one, as it is (apart from certain exceptional points) in the plane, but 
is one-to-two, or rather, two-to-two, since a diametral pair is transformed into 
a diametral pair. . 

The process sketched for pseudo-8,, quintuples is representative of the 
procedure for pseudo-S,,- (n + 3)-tuples. The determination of their mètric 
structure is, then, so closely related to the analogous problem for pseudo-En 
(n + 8)-tuples as to present nothing essentially new. On the other hand, 
one may surely expect quite different results when pseudo-S,,, sets of more 
than n + 3 points are considered, for their euclidean analogues (pseudo-F, 
sets of more than n + 3 points) do not exist. Furthermore, the complication 
due to the one-to-two character of the transformation described above makes 
itself felt only for pseudo-S,,,, sets of more than n + 3 points. Such sets may 
contain diametral points unless the contrary is explicitly assumed. 


` 2. Basic and derived properties of the Sa, and its subspaces. Many 
properties of the S,,- and its k-dimensional subspaces Skr, OS k= n, (the 
sections of the Sar made by (k + 1)-dimensional hyperplanes through the 
center of the sphere in En whose “surface” is the $,,-) are needed for the 
development of this paper. Looking towards the “ abstraction ” of our prob- 
lem made in Section 4, we isolate here the five basic properties of the Snr that 


5 Throughout this paper the poigts of pseudo-, , sets are denoted by the letters 
p and q, while the points of 8,, are symbolized by the letters 8 and t. 
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suffice to demonstrate all the additional properties of the Sa,r that we need for 
this investigation.® 


I. The determinant 


Anse (815 825° Lao > 8ns2) LE | cos (s:8;/7) |; ; 
(i, j = 1, 2,- ` s n+ 2), 


vanishes for each set-of n + 2 points 81, 52,° * *, Sara Of Snr 


II. There exists at least one set of n + 1 points of Sar whose determi- 
nant Any does not vanish. 


Ill. Each finite subset of Snr has a non-negative determinant A. 


Remark. It is easy to show that the dependence (independence) of a 
finite set s1, 82, : : ,s& of k points of 9,, is equivalent to the vanishing (non- 
vanishing) of the determinant As of the k points. We call k points of a 
semimetric space independent (dependent) if they are congruent with k 
independent (dependent) points of S,,r. 


IV. If 81, 82,° * +, Seer Z tuta’ * +, teas? are two congruent sets of 
k + 1 points (not necessarily pairwise distinct) of two (coincident or distinct) 
k-dimensional subspaces of Snr, kn, then to each point s of the subspaca 
containing 81, 825" °°, Skri there corresponds at least one point t of the sub- 
space containing ist" °°, trn such that - 


81, 82,° ° ‘> Skis 8 ty, te, ° $ +, bear, de 


V. If Susa, 8, are k independent points of Sr, (k =1,2,---,1), 
then corresponding to each point s of Sir independent of 8,82, ° +, 8, (1e. 
Aky (81, 82° = , 8%, 8) does not varnish) ,-there is at least one point ¥ of Ser 
such that 3 5&8 and 81,82," * * 3 8%, $ © 81, 82," * *, 82, #. | 


We list now for convenience the derived’ properties of the Su, to which 
reference will be made in the next section: 


(a). A semimetrig set of k + 2 points is congruent with k + 2 points 
of Sur, k n, if and only if each k +1 of the points are congruent with 
k + 1 points of the S+,- nnd the determinant Ax.: of the k + 2 points vanishes. 


SThe derivation of these additional properties from the five basic ones follows 
closely the methods of an earlier paper (L. M. RJumenthal, “The geometry of a class 
of semimetrie spaces,” Tohoku Mathematical Journal, vol. 43, (1937), pp. 206-224). 

“This notation signifies that 8,8, = tity (i,1=1,2,. ..,k +1). 
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(b). A semimetric set of k + 3 points is congruent with k -+ 3 points - 
of Skr, k<n, if and only if each k + 2 of the points are congruent with. 
k + 2 points of the 84, and the determinant Azs of the k + 3 points vanishes.® 


(e). If 81,83," *", Skin Zh, ta, © *, tu are two congruent sets of k +1 
independent points of two (coincident or distinct) k-dimensional subspaces, 
k Sn, and if s,s aro points of the subspace containing s1, 82, * - , Sr While 
t, ť are points of the subspace containing tı, t2,' +, tis, such that 


81) 82,° °° 5 Sky, S © th, ta, tty testy t, 
z 81, 8a * > Seer, © tay da” 3 * bea Ÿ, 
then ss’ = tt. 
Remark 1. Tf 81,83," * +, Star © listes" * * , eu are two congruent sets 


of k + 1 independent points of two (coincident or distinct) k-dimensional 
subspaces, k= n, then to each point s of the subspace containing the first 
set of k -+ 1 points there corresponds exactly one point ¢ of the subspace 
containing the second set of k -+ 1 points such that 


Si, Sa" ° "y Skit, ST lie, £ -s Ér É 


` Remark 2. There is at most one point of 84 with prescribed distances 
from k -+ 1 independent points of Szr, k= n. 


(d). Any subset of an independent set of points is an independent set 
of points. | 

(e). If 8, 82,° ++, & are k independent points of Skr, (k =1,2, n), 
then corresponding to each point s of 8x, which is independent of them there 
is exactly one point # of Srr such that s 4s and ` 


81 823" °° Sky S © Su Say? * à hs Se ba 


(£). Let 1, 82," °° , 3+ be k independent points of Sir, (k = 1, 2, son), 
and let s,s and t,t be two pairs of points of Se, such that ¢ is either 
dependent on 81, 82° ` *,8 or distinct from &, and ? is either dependent on 
815 S2,° * *, Se OF distinct from #, while 


$1, 89," ' ' 5 8k, 8 © 81, 82,7 ° LL 
81) Sas" °° Sky 8 TT Si, 825 E Le 


Then 85 = tř. 


8 Properties (a), (b) are well-known theorems in the distance geometry of the 
Sar (see “ Distance, geometries,” p. 73). That they may be obtained by using merely 
Properties I-V has not been recorded ‘heretofore. * 

° The point s has a single image s’ £8 when “reflected” in the (k—1)-dimen- 
sional subspace determined by the k independent points 8,,8,,- - -, Sy 
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-© (g). Let 81, 83,- > Sn be k+1 independent points of Sir, k= n, 
and let s be a point of Suse not common to any two of the k + 1 subspaces” 
. Szar determined by the k points 


81, 825 * es Bars Stats > 8h41) (t==1,2,° i sk +1). 


‘Denote by s a point of Snr such that 


81, 82» ves s Si- g(t), Siti)’ ‘> Skat = Sis Sas © 5 Bis 8, Stet,’ * * 5 Sr 
| (t= 1,2, --,k+1), 
where 3) 48 if s is independent of 31, 82° °°, 844 Sin,’ °°, Seu Then 


there are not two independent points #,# of Sx, such that 
8/302) gs. + + =o’) and pa — pen mes > vm Pg), 


Remark. There are at most two points of Sz, satisfying the conditions 
of property (g). 

(h). The an k Sn, has minimum congruence order k + 3.1° 

(i). If s, $2, eu are k + 1 independent points of S:,, k S n, 
and s is a point of Sz, such that for each integer 4, (1—1,2,: > <, k) the 
points 81, 82," "‘;84-1, 8, Sisu”. ‘> Sky Ska re dependent, then the points S; Sky 
are dependent. | 

(j). Ifs,t,u are any three distinct points of Ser, k<n, such that 
st — tu — su, and if 8,82, * `, S% are k points of S>, such that 


818 == sit == Siti, (i=1,2,: ? ‘” k), 
then the points 81, 8,° ` `, Sk are dependent. 


(k). Let 8,¢ be any two distinct points of Ser, (k =1,2,: ::,n), 
‘and let 8:,8,°°* *,84 be k independent points of Sk,- such that- sys == sit, 
(i= 1,2,:--,k). The (k— 1)-dimensional subspace Sx-ı,r determined by 
815 82, * * , 8x is the locus of points of Si, equidistant from s and t. 


(1). Two distinct k- dimensional subspaces Sk,r, k< “Sn, can have at most 
k independent points inecommon. 


3. The characterization of pseudo-S;, sets, k<n. We deduce first 
some preliminary theorems concerning pseudo-S;,, sets, kn, that point the 
way to the desired characterization theorems. 

e , 


10 “ Distance geometries,” p. 78., 
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THEOREM 1. A pseudo-Sy,- (k + 3)-tuple pr, Pos’ ` `, Pers contains at 
least one independent set of k + 1 points. 


Proof. Since. pi, Pa, * * , fs is à pseudo-S4, set, each (k -+ 2)-tuple — 
contained in these points is congruent with k + 2 points of Spr. It follows 
(property (a)) that the determinant Az: of each (k + 2)-tuple vanishes. 
Suppose, now, that each set of k + 1 of the k + 3 points is a dependent set. 
Then the determinant Amis (P1, D2)‘ ** , Pare) has all principal minors of orders 
k + 1 and k + 2 equal to zero, and consequently vanishes. It follows (property 
(b)) that the &-+ 8 points are congruent with a subset of the Sz,r, which 
contradicts the hypothesis that they form a pseudo-S;, set, and establishes 
the theorem. 


THEOREM 2. The determinant Ars(pi, Pa ‘+; Pes) of a pseudo-Sir 
(k + 3)-tuple is negative. 


Proof. As seen in the proof of Theorem 1, the determinant 


Axis (Pi Pos’ * t; Disa) 


does not vanish. Let the points be so labelled that p,, Pay" * * > Pen is an 
independent (k + 1)-tuple. Since 


| Agen (Pas Pas * 5 Prz) = 0, 
we have 1? | 
— [k + 2,k +37 


Asis (Po Pa °° > Peas) = Arn (Diy Pas” * * 3 Past) : 


where [k + 2, k + 3] denotes the co-factor of the element in the (k + 2)-nd 
row and (k—+3)-rd column of Axis(Pis Pos °°, Pes). The poifts 
Pi, Pas © °> Pex being independent and congruent to k+1 points of Sxr 
implies Ara(Pu Pa’ *, Pen) > 0, (property III), and the theorem is 
proved. - 


THEOREM 3. If py, Po, * * , Piss form a pseudo-S;,- set, with the points 


41 We shall suppose the index k to assume the values k 2 0, 1,2,. - -, n except when 
it is stated otherwise. 

21 The expansion of a (k + 3)-rd order symmetric determinant 
k42 k+2 k+2k+3 
kK4+3,44+2 k+3k+3 
(a special case of Jacobi’s theoremẹ, is particully useful whenever, as frequently 
happens in this paper, one of the co-factors [k + 2, k + 21, [k + 3, k + 3] vanishes. 


10 


|e |= (Ik +2, k +2]. [k +3,% +3) Th +2 k +81] 
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Pis Pas 5 Pam independent, then the Sr r Mains k + 8 points 81, 82, °°", Sina 
such that . 

Po Po’ * * 5 Ben Pira © 81, 83° “à Skit) Sde2, 

Pry Poy” * * 5 Peris Pias © 81, Sip” * * p Shely Ska) 


ANd PrraPirs 7E Skra8irs- 


Proof. Since py, Ps," `, Puss form a pseudo-Ss,. set, there exist two sets 
81, 82," © “y Stary Stag ON G tat * * , teas, fers Of k -+ 2 points of Ser such that 


Pi Pz’ : y Pass Paie © S1, 823 ° ae s Sktl 8x+2 
Pis Pas * * > Peery Derg © tay ba,’ * + besa, us. 


The points pı, Po,” ", Pes being independent, then {s;} and {ti}, (t==1,2,---, 
k+ 1), are two congruent sets of k+ 1 independent points of Sy, and 
hence (Remark 1, property (c)) the Sur contains exactly one point Skis ` 
such that 

ty, tac t t, bess} Des © 815 825° °° > Si Sas 
Then we have 


Pas Pay” * * y Pris Phra © 81, 82, ° * * 5 Sear, Stes, 


and since the k + 3 points P, p2,--*, Pra form a pseudo-S,, set, the dis- 
tance PirsPers does not equal the distance sxsSeis. 

© The k + 8 points 81,8», ° °, Sks of Sær are said to be “almost 
congruent” to the pseudo-Sx,r set 1, Pas * * * s Prase 


Taxorem 4. Let Dry Pay’ °°» Pass form a pseudo-Ss,, (k + 8)- ua with 
the independent (k + 1)-tuple Pu pa, °°, Den. Then the k +41 points 
Pas Day’ ‘> Base are independent. ` 


_ © Proof. Let sı, 82,- +, Exs be k +3 points of ue almost net to ` 
Pas Pas "Ps; Le, 
Pi, Pas * ` p Peu Pria © 81; 825" * * 5 Skats Ska, 


Pis Pay” ° ‘> Piris Pres = Sis S237 ° 7 5 Shit, Sera, 


and PessPhes FE Ski28er8 Since each & + 2 points of the set pi, Pa,” °°, Pris 
are congruent with k +,2 points of S:,r, we have 


Pas Pas" © * 5 Diez, Deis © te, bey * "y Urey bees, 
and hence 
ta, ts, LR testy tras = 82, Sa» °° 5 Sily re) 


with the points 8:,8:,° - *, Skp being indeSendent since they belong to the 
independent (k + 1)-tuple s1, 82," : *, Se (property (d)). 
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Property IV applied to the congruence 
A EEE er 
entails the existence of two points s,s’ of Sz, such that 


ta, ts, SRE Eu tea, ters © Sa, 83,° °°, 8415 8, #. 
Then | 


Say 8857 °°» Skit, 8 D late, t t ? İkin De © S2 83,° °°, Sk+1) Sk+2+ 


Suppose, now, that the points po, Pa; * `, Pri, Desa are dependent. Then 
8,83," * * > Be, Sa are dependent, and applying Remark 2, property (c) to 
the Sk-ı,r determined by the k independent points s2, 3,- * *, Si, the above 
congruence gives 8 == +, and hence tz, la," - <, Ér bere, fre are congruent 


with 82,88," * *, Ska, 842, S in the usual order. 
Now ' 
825 S87 * Ty Skity Skis ~ Paos Pas’ © > Pirrs Pias X ts, tes > Éran Class 


which gives Ss, Sas" °°» Sri Stun FS Say Sas * “y Sooty Se TE Seas is not dependent 
OD. 82) 88" °°, 81, then by property (e) the point # may be distinct from 
8ms. But then the last congruence together with the congruence 


"Say 83° °° Stars Sera Say Say ° °°» Siriy Skea 
shows that Sp.cdk+s == 82,8, according to property (f). If, on the other hand, 
Sig 18 dependent on 82, 83,° * *, Ss, then the congruence 
82, 8a," * "> ke Stes FZ 82,83," °° Serr, 8 


evidently implies s’ = 845, ANd S:128%18 == Sees as before. Hence, in any case, 


Seaska = Sue = bios = Pk+2Dh+3, 


+ 


which gives the desired contradiction and establishes the theorem. 


Remark. It is clear that the method of the preceding theorem may be 
used to show that each of the (k + 1)-tuples 


Pis Pas 5 Dt-ty Piris "> Piris Pass; Po Pass Pio Piris’? > Derry Pass, 
(i =1,2, -k +1), 
1s independent. . 


THEOoRR™M 5. Let Diy Pos’ O , Pra, Prans Form a pseudo-Sy, set with the 
independent (k -+-1)-tuples pi, Pa,” * o Pry Prans 
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Po Pay" > Pass Dies" > Pasty Phra) Po Pa't Pass Pia" Prei Diss, 
(i = 1,2,- k+ 1). 


Then the (k + 1)-tuple Ps, Pa’ ``, Dress Pris 18 independent. 


, Proof. The proof of this theorem follows the lines of the proof of the pre- 

` ceding theorem, with the independent (k + 1)-tuple Pi, Da, Pas **" > Piris Pres 
in the rôle of the (k + 1)-tuple pı, p2,- * +, Deu. Thus; the Se, contains 
k + 8 points Sı, 8,° * *, Sere, 3x3 such that — 


Pi Pes Pe "ty Deets Parks Pias = Sis 83, 84, ° | T 5 Skit, Shay Skis, 
Pry Pas Pas © * > Piris Peas P2 ~ Sis Sas S4y° * * y Sri Ska, 82, 


with poPx.s > 825¢42, and the same procedure as in Theorem 4 leads to the 
desired result. | | 


Remark. In a similar manner, tt is seen that the (k + 1)-tuples 


Pry Pas > Pias Pisis? °° s Pas Pins "5 Piris Pass Pts, 
(i j = 1,2,- Eee Ls ij), ' 
are all independent (k + 1)-tuples. 
We have thus proved the following useful theorem : 


THEOREM 6. If Pi Pss’ >`, Pass Pras form a pseudo-Sz, set then each 
of the 4(k +2) (k +3) sets of k-+1 points contained in this (k + 3)-tuple 
is an independent set. | 


Let 1, Pas": , Pias form a pseudo-83,. (k + 8)-tuple, and let 81, 82, °°° , Sas 
be k + 8 points of Sx,- almost congruent to them. Consider, now, the points 


e : Pis Pas” ce, Pi Pins" © * 5 Pias (i= 1,2; -k +1). 


The Sx r contains k + 2 points ti, ta, ++, ta, fis +, 8 such that for each 
t= 1,2,- í ,k+1, 


Pi, Pas” s Dis Dias" “y Plus lite, “> » thay bis” +s Ube. 
Then l 


S137 ° * y Sieis Sts1s > Skis Size 
is congruent with the points | 
oo iala isiu et; tes tae 
with each of itis sets of k + 1 points 
. 


81, 8a,° °°, S41, Stet,” ” ` a Skri Ska, | (= 1,8,: g ,k+1), i 
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` being independent (Theorem 6). Hence, by ae 1, property (c), Hie Irr 
contains exactly one point s() such that 


Sas l t y Story Siris © * 5 Skee, som ~ th, rit tia, tea pene ty tesa, Tess, 
(i= 1,2,- k+ 1), 
and hence 
Pou’ os Pt-ty Phi’ * * > Pin Prs X 8y; St-1y Sir © T * y Skay ay > 
; | (t=12,:":,k+1). 
Since 
Sp» t > 84-15 Stary" * > Skeets so © Put"; Pis Pus tot "5 Piris Plus, 


FZ 8157 * “5 84-15 Stary” © ` y Shery Skany 


the point s(# has the same distances from the points 

815 82 WA s 84-15 81415 ts Sk 
as the point Srs, but s Ade = Ss for 
S28 w = Dine Dkrs FE Sk+28k+8. 
We have | | 
f (1) mes S8po8 (D) aes + + mm 8p 9g (FH) 

RE Se) 2 

for each of these distances equals prefers. Now a point r+: is not determined 


uniquely by these inequalities, but by property (g) and its accompanying 
Remark there is at most one other point 8"i42 equidistant from the k-+1, 


points 8), 8{2),°- » 8730) and Sp+28% 2 = d. 
In 2 kal Sher it is seen that the Skr contains k y 1 points 3%), 
{2} . . (&+1) 
Bkr oe such that | 
s ° 
Pis "9" 5 Piei Piris’ © © y Pres Pra = Sig’ $ y Sgety Sirig” © T y Skis sue 


(t= 1, 2,- É *,k+1), 


with each of the distances ss, (i= 1,2,::-,k—+ 1), equal to PrsaDisae 
Thus ses is equidistant from the k + 1 points 80), 80,- + +, S9, and there 
are at most two points (diametral) satisfying this requirement. | 


THEOREM 7. Let P= (pi, Da" : +) Pra) and Q= (qu qot © * > Tess) 
be two pseudo-Si, (k + 3)-tuples such that 


Pas Po’ * y Pee ~ G15 Jar" °°» Gire 


Then either pipers = Geis, (i= 1,2,- - : ,& +2), and the two (k +38)- 
tuples are congruent, or f 
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cos (papass/r) + 608 (q6Ge1a/1') = 0, (1,2, +, k 42) 
p Proof. From the hypothesis of the theorem we may write the following 
congruences : | 
Pis Pas > Piri Perz © Sis 823° > Sets Ses 


Pis Past * y Piris Pias ~ 81, So, © T y Sears Skis, 
Qas Q23 * ‘> kris Tera FZ 815 82,° 7 ‘3 Stats Shea 
qis qa> NF kis dks = Siy 82; ENS Skis Us, 


where the points on the right-hand side of these congruences are in 8. and 


Parois 7E Sk128k+8, kraka m Skratka. 
_ From the first two congruences follow, as has been seen, the existence of 
points a(t), (t—1,2,--+,4-+1), of Skr such that 
PeraPisa = 8 Dee = Sos mE Bee 
Similarly, from the last two congruences, we may write 


= g (1) me: g (2) mm’ + = g (Kt) 
Qk+2Qk+s Ste 8 bees re bise 


It follows (property (g)) that either seg = tis OF Strt = d. In the first 
case, clearly pipes = igis, (t= 1,2,-. -,k +2), and PÆQ. In the 
second case, Sir, and tz being diametral implies (property III) that the 
determinant As (Si, 8x8, firs) == 0, and hence (upon expanding) 


COB (848x18/7) + cos (8ites/r) == 0, (é=1,9,-..,k+1). 
Then, by the soie congruences; | 
COS (PiPurs/T) F C08 (Gsqars/1) = 0, (t= 1,2,-+°+,k-+ 1), 
Finally, sss == d implies that As (s2), Skra, tesa) vanishes; i. e., | 


cos (82)84,3/7) + cos (8 tess/1) = 0, 


Then 
COS (PirsPese/T) + COB (Guizgers/T) = 0, 


and the theorem is proved. 


Ha 
Lemma. If a pseudo-Ss, set P of k + 4 points Pi, Pas” °°, Pers, has no 
. pair of its points diametral, then the set contains at least three pseudo-Sir 
(k + 3)-tuples. | 


13 With a view to the “abstraftion” of the Problem treated in Section 4, we write 
cos (p,P,,,/F) + co8(4,93,./r) =0 rather than pp, + 9,%,, = 4. 
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Proof. Since, by property (h), the Sx, has congruence order k- 3, 
the set P contains at least one pseudo-&, (k + 3)-tuple. The labelling may 
be assumed so that pi, p2,°--, Pass is a pseudo-Sr set. In case P does not 
contain at least: two (k + 3)-tuples congruent with k + 8 points of Sr, the 
lemma is surely valid. In the contrary case, let 1, De, : °°, Dest, Pass, Peu and 
Pis Pos’ ‘5 Prr Piss, Pera be congruent with two (k + 3)-tuples of eats 
Since pı, Pas’ ` *, Piss is a pseudo-Sy, set, we have 


Pr» Pz’ er Peu Piva © 81, 82," © T y Neris Ska, 
Pry Pz TONG Preis Pias ~ Si 82, AE g Sat, Skta 


with PriPkis FE Ski2sk+sy ànd it follows that 


(I) Pip Pas * > Piris Piens Disa ~ $1, 825° * "5 CRP Skta Sk+4, 
Pis Pas °° 5 Pass Pkass Pira © 815 Soy" © * y Seriy Sass Skry 


and the point sx, of Sz, is uniquely determined since its distances from the 
k + 1 independent points 81, 8,:-*, Sea Of Sz, are fixed. Now, by hypothesis, 
each pair of points of P is independent (i. e., no two points of P are coincident 
or diametral) and hence the points Sw, and 8, (¢—=1,2,---,%+8), are 
independent. It follows that at least two of the k + 1 sets 


Sis 82, ° * “> Stay Stay” °°» Shots Stray (t= 1, 2,: ` *,k+1), 


are independent (k + 1)-tuples, for in the contrary case, the k dependent 
(k + 1)-tuples have, in addition to the point &,, one of the points 81, 82,°*", 8m1 
in common. This -point is then, by property (4), diametral to (or coincident 
with) the point 8,4, in contradiction to the preceding remark. 

Let, then, 82, 83,° * * ; Suns Ska ANG 81, 83,°-* *; Sky Stra be independent 
(k +-1)-tuples. We show that the two (k + 3)-tuples ° 


Pas Psy * `, Piris Prras Pass Pire 
and ‘ 


Pis Pas To Pers, Prins Pts Pru 


are pseudo-S;, sets. (A similar procedure gives the desired result in case two 
other (k +-1)-tuples are independent.) - e . 

We make the assumption that Ps, Ps, * * , Desi, Piz) Detar Deve ATO CONgruent 
with k + 3 points of Sir and show that this assumption leads to a contra- 
diction. (The other (k + 3)-tuple is treated in the same manner.) 

Suppose, then, that 

e 


Des Pay’ ‘; Pins Pier Press Pis = te, te pa es dias tesa tess, tase, | 
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of Sir. : Then (using congruences (1)) it follows that 


Sas 883° °," à Skat, Ska) Ska — te, ts, OT tert) rss bas 


82, 533° °° 5 Skeet, Ske, Skri = te, ts, Sy tina) teas, ae 


with 52, 59)* * -,Sku,6ma an independent (k -+ 1)-tuple. It follows that 
Skrat = ralas = Pass (the first equality resulting from property (c)) 
gives the desired contradiction, and proves that the (k + 3)-tuple 


Das Dar" °° > Piris Pera, Pass, Divas 
is a pseudo-$}, set. 


Turorsm 8. If a pseudo-S,. (k + 4)-tuple P has no pair of tts points 
diametral, then each (k + 3)-tuple contained in P is a pseudo-Sx,r set. 


Proof. From the Lemma, P contains at least three reas r (k + 3)- 
tuples, say 
` Po Das y Pres Pien Pies) Pry Pas * * s Pris Pin, Pk; 
Pis Peas ` * > Pieri; Pars, Pree 


Applying Theorem 7 to the first and second of these pseudo-S,,, sets, and 
then to the first and, third of these sets, we see that the distances determined 
by the points of P satisfy one of the following four sets of relations: 


Case I. Piro Pkra = Phra Pers = Pers Per4, i 
f Pipers = PiPrs = Pi Pero (= 2,-::,k+1). 


Case II. 008 (PraPria/T) = — 008 (PessPars/T) = — cos (PrroDirs/T), 
COS (PiPris/T) == CO8 (PsPers/T) = — C08 (Papena r), | 
Gre Ges Ed). 


' Case III. cos a = — 008 (Pris Prr/T) == CO8 (Prisfins/T), 
| COB (Pipire/T) = — COB (PiPris/T) == — CO8 (Pifers/T); 
(i=1,2,: à ,k +1). 


Case IV. cos FT Tes — cos (Prsa Praf) == — CO8 (PrroPrrs/T); 
| COB (Pi Pera/) = — COS (pipmis/T) = CO8 (PePese/t), 
° (i= 1,2, > -,k +1). 


To show, for example, that fe, Da, * * *, Piaty Prins Piers Pira is a pseudo-Si,r 
(k -+ 3)-tuple, assume the contrary and let 82, 8s,' * *, Skry Skin, Stray Skrg De - 
k + 8 points of Ser congruent to them. Then the distances determined by 
these k + 3 points of the Sy, satisfy one o$ t the above four sets of relations 
(with the index i taking on the values 2, 3o‘ k 41). 
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Case I. Applying property (j) to 8,83,° - +, St, it is seen that 
82,83," * *, Sei are dependent. Then the k points po, Ps,'." *, Pr, congruent 
to them are dependent, which is not possible (property (d)) since these k 
points are contained in the k + 1 points p,, pz,--- , Pru Which are independent 
{Theorem 6) since they belong to the pseudo-S,,r (k + 3)-tuple p1, pa, °°, Prase 


Case II. The points 82, Ss,- - , SA are each equidistant from the points 
$i42, Ss and since these k points are independent it follows from property (k) | 
that the (k—1)-dimensional subspace 9*%1,- determined by them is the 
locus of points of Sxr equidistant from sg. and Sms Now the points 
Se) Sa" * Stu: ate also contained in the locus of points s of Sx, such that 
cos (8842/1) + cos (582:4/7) == 0. Since Sxs2Sts4 — Priz Prr the points Siro, Ses 
are distinct and not diametral. We prove now the following ascension: 


Assertion. The locus of points t of Sur such that 


COS (t8x12/7) + cos (ésxs4/r) = 0 
is the (k — 1)-dimensional subspace of Sy, determined by 82,53," * °, Sat 


To prove this assertion, it is shown first that if t, t2,:--, te. are k +1 
pairwise distinct points of ‘Sz, such that cos (ti8ui2/r) + cos (tiSeue/T) — 0, 
then the determinant Ans(t:, t2,° * * , tea) of these points vanishes, and hence 
the points are in an S,.,-. For suppose this determinant does not vanish, 
and consider Azs (ti, tay * * * ; teas, Seez, Sera), Which clearly equals zero, Adding 
the elements of the last row (column) to the corresponding elements of the . 
preceding row (column), and using the expansion ** employed in Theorem 2, 
we obtain, after some obvious reductions, 


[1 + cos (Sxi28e10/7) | Arn (ts, do, °°, tear) = 0. 


Since Sista 7E d, it follows that Azn (tu tz, °°, tx) vanishes, contrary to 
our supposition. Hence each set of k + 1 points of the locus is a dependent 
set, and the locus is at most (k — 1)-dimensional. But the locus contains the 
k independent.points Sa, 8, * * * , %1 Of Sir. It follows that the locus is exactly 
(k — 1)-dimensional. | 

Finally, let s&s; (1==2, 3, - *,k+1) be any element of the (k-~1)- 
dimensional subspace determined i 825 83,° © °$. Now the determinant 
Aras (Say 88," © * ; Stat) Stray See) 8) is zero, and every principal minor of order 
k + 2 vanishes. It follows that every (k + 2)-nd order minor of the deter- 
minant is zero. Adding the (k + 2)-nd row (column) to the preceding row 


14 See footnote 12. 
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(column) —a transformation of the determinant which leaves the rank 
unaltered — and expanding, as above, the vanishing (k + 2)-nd order minor 
[k + 2, k + 2] of the resulting determinant, we obtain, | 


RTL + cos (Seas8eua/7)] : Agr (82, 835° * * y Sin 8) E 
— [cos (s8:2/r) + cos (s82:4/r) |? > An (825° * * , Sen) = 0. 


Since Ap (Sa, S37 ° t, Sey 8) == 0 and Ag(Se, 83," * >, Sr) does not vanish, 
we have cos (88.2/1) + cos (s8x,4/r) — 0, and sis a point of the locus. 
Hence the assertion is proved, and the locus.in question tdentified with the 
(k —1)-dimenstonal subspace S*y,, determined by the k independent points 
Sg, 8g," * * 5 Ski. 

-But this is impossible, for since cos sa (eects + COS (SksSeiu/7) = 0, 
the point ss belongs to the above locus, though it surely does -not belong to 
8"vr, for sis is not equidistant from the points seis, Sus. | 

This contradiction shows that the distances determined by the points 
8a) 389° © * » Skat, Stray Sta, Siva Go not satisfy the relations of Case II. 

Interchanging Sra With St in Case ITI, and Srs with Sw in Case IV 
reduces these cases to Case IT, and hence the assumption that | 


Pay Pos’ ` ` p Phris Diva, Pere, Prr 


_ is not a pseudo-Sk,r set leads to the distances determined, by these points not 
satisfying the relations of any one of the above four cases. This contradiction 
proves the k + 3 points from a pseudo-S3, set. A similar procedure is used 
to show that the k remaining (k + 3)-tuples of the set P are each pei r 
sets, and the theorem is established. 


COBRÒLLARY. If Pisa pretido-Se set containing more than k + 3 points, 
ne pair of which is diametral, then every set of m=k+3 points of P is à 
pseudo-%, r set. 


Proof. Since P is a pseudo-S,- set there is at least one pseudo-Sz,r 

(k + 3)-tuple qi, 92," ‘ *, qos contained in P. (property (h)). Clearly, any 

-subset of P that contains these k + 3 points is a pseudo-S,, set. Suppose, 
now, that pi, Pa, ° `, Piss,’ * `s Pm ÀS a set of m=k+ 8 points of P not 

containing any of the Points qa qa 5 Ques) Then Qu, a,‘ Geiss Pi is 8 

pseudo-Sx,r (k + 4)-tuple without diametral points and hence, by the pre- 

ceding Aren each set of k + 8 of these points (in particular, the set 

ga Js)" * > Qauss Pi) is a pseudo-S;, set. Then ga, qs," * `, Gare, Pos Pa is 8 

+ i (k + 4)-tuple without diametral points and hence, as before, 
Gas Jas * ** » Yessy Pas Pa 18 8 pseudo-Sy,-(% + 3)-tuple and qs, Ga, * *" Yrs» Pis Pars 
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is a pseudo-S;,, (k + 4)-tuple without diametral points. It is clear that this 
process may be continued ‘until the points gı; QE isa are replaced by the 
points fi, Pz- ``, Peis forming a pseudo-S;, (k + 3)-tuple. Then the points 
Pas Des * t, Press © * > Pn Surely form a pseudo-S;, set. 

Finally, if t 554+ 2 of the points gi, Qe,” °°, qra occur among the 
m-tuple of points Pı, Ps,’ Pass, °°» Pan, We have, with convenient labelling, 
Pi = 4), (J= 1,2,--+,4), and the above process, starting with the pseudo-Sx,r 
(k + 3)-tuple Jiris Quxss F7 t fems Po Po’ © ta Pa 18 applied as before to 
complete the proof of the corollary. ` i 


Lemma. Let P= (Pı, Pas’ `, Pass, Pera) be a pseudo-Sr,r (k + 4)-tuple 
without diametral points. Then pipm = PjPm or 


COS (P4Pm/T) + cos (PsPm/r) | = 0, 
(iim=1,2,.-.,k+4; PTE 


Proof. If i= j the lemma is trivial. Suppan; then, i +4 j, and consider 
the two (k + 8)-tuples i 


Po Pas "> Pi- Phs t * o Pina; Po Pas Ts Di-ry Purn’ ©’ s Pir 


obtained from P by omitting, in turn, the points p; and p;, respectively. | 
According to Theorem 8 these two (k + 3)-faples are pseudo-Sx,r sets, and 
since the (k + 2)-tuple 


Pry Pa’ * * » Pt-1y Pins’ ` * > Das Phs’ © © > Pias 


is common to both sets, an application of Theorem 7 gives at once the desired 
result. 


Turorem 9. Let P= (Pi Pas ee be a pseudo-S;,r a) -tuple 
without diametral points. Then pipm = PiPn or °. 


cos (pipm/r) + cos (pipa/T) — 0, © (im; jn); 


for each pair pip, PiPn Of ane $(k + 3) (k + 4) distances determined by the. 
points of P. j 


Proof. If one of the indices t, m equals one of the indices j, the theorem 
reduces to the preceding lemma. Suppose this® is not the case. Ac- 
cording to the lemma, pipm==pifm OT cos (pipm/T) + COS (P;Pm/T) = 0, 
and PjPm = Pipa OT CO8 (Pjpm/T) + cos (pjpx/r) == 0. A consideration of the 
four possibilities thus presented leads at once to the theorem. 

We may now ‘establish a chpracterization theorem for pseudo-S;, sets of 
k + 4 points, no two cae being diametral 
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THROREM 10. If P is a pseudo-Si, set of k + 4 points, Po Pay" ** > Piris 
‘no two of which are diametral, then for every pair of distinct points p, p; of P, 
cos (paps/r) =+ 1/(k +1). The plus and minus signs are “ determinantally 
distributed” ; 4. e., the signs occur in such a manner that the determinant 


Arn (P15 Pas i ` y Disa) = | cos (pips/r)|; (i,j—1, 8," ‘,k +44), 


may, upon multiplication of appropriate rows and the same numbered columns 
by — 1, be transformed into a determinant with each element outside the 
principal diagonal equal to —1/(k +1). 


In a recent paper * it is shown in detail that this theorem follows from 
Theorems 6, 7, 8, and 9 of this section, and the argument need not be 
‘repeated here. 


FIRST CHARACTERIZATION THROREM. Lei F P be a pseudo-Sy,,'set of more 
than k + 8 points, containing no diametral points. If p,q are any two distinct 
points of P, then cos (pq/r) = = 1/(k +1). | 


Proof. If P consists -of exactly k + 4 points, the conclusion is warranted 
by Theorem 10. Suppose that P contains at least k + 5 points and select. 
any k + 4 points of P containing the points p and q. By the Corollary to ` 
Theorem 8, these k + 4 points form a pseudo-&;%, set and hénce 


cos (pg/r) = + 1/(k + 1). 


LEMMA. If pi, Pa: +, py and quqa’ * *,q1 are two pseudo-Sr,r sets 
without diametral points, and 


Po Poy" t t’ Pia © Qu I t? ‘> Qi 
then either 2 ; ; 
> " Pips = ui (= 1, 8: “x? 1); 

or | , 
cos (pip;/r) + cos (gigs/r) —0, (t==1,2,---,j7—1). 
Proof. Since the two sets are pseudo-S,. sets, then j =k +3. The 
two (k + 8)-tuples pjr-2, Pipes > Pinus Py ONG Gye-2, Yj-eay***y ia, Ys ATE, 
. by the Corollary to Theorem 8, pseudo-§;,, sets. It follows from the hypothesis 
of the ‘lemma that e | 


i 


Pis Djok-1,° * Dia © Yj-e-2) Qj-k-13° “> Us, 


187, M. Blumenthal, “ Metric methods in determinant theory,” American Journal 
of Mathematics, vol. 61 (1939), pp. 912-922. The paper referred to uses, as noted, 
Theorems 6, 7, 8, 9 of the present “article, but merely atates these theorems without 
offering any proof. 3 
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and hence (Theorem 7) we have 
(A). PsP) = qi (i= j—k—2,j—k—1,- . ET) 
or ‘ a " i | 


(B,). cos (papy/r) + cos (qigs/T) = 0, 
(t= j—k—2,j7—k—l,:- Sg ah) 


‘Applying the same reasoning to the two (k + 3)-tuples 


Pie Pikas" °°, Pis, py and Gite VESTE "os Qi Vs 
we obtain | | 
(Aa). Pipik- = QiQiass DIDI = Gien > PPa = Yli- 
or : 
l © COB (PiPj-x-3/T) + 008 (gjgj-xs/r) = 0, 
(B2). cos (p;Pje1/T) + COS (qyqja1/T) = 0, 
cos (pypj-1/7) + cos (gig3-1/r) — 0. 


It is now easily seen that if the alternative (A,) subsists, then the alterna- 
tive (Az) holds, while the validity of (Bı) implies that of (B,). Thus the 
alternatives (A,), (Bı) have been extended from t= j j —k— 2, j—k—1,- 
j—1 to t—j—k—8, j—k—2, j—k—1,---,f—1. tie a 
this manner, the index i is made to recede to 1, and the lemma is established. 


SEooND CHARACTERIZATION THEOREM. Let P be a pseudo-&;,r set of 
arbitrary power exceeding k + 3 and containing no diametral points. Then 
for every integer t > 1, the determinant A; formed for each set of i points 
(pairwise distinct) of P has, upon multiplication of appropriate rows and the 
same numbered columns by —1, all elements outside the principal diagonal 
equal to —1/(k +1). 


e 
Proof. Let Pı, pz'' +, pi be any set of iœ 1 pairwise distinct. points 
of P. Ifi— k + 4, the conclusion follows from Theorem 10, 


Case 1. If i < k + 4 then the i points pı, po, - `, pı form part of a set 
of k + 4 points which, by the Corollary to Theorem 8, is a pseudo-Sy,r set. . 
` By Theorem 10, the determinant Ax, of these k + 4 points has, upon multi- 
plication of appropriate rows and the same numbesed columns by — 1, all 
elements outside the principal diagonal equal to —1/(k+1). The de- 
terminant A;(p;, Pas * °, pı) being a principal minor of this determinant, 
is then transformed by these elementary operations to the form specified in ` 


the theorem. 
e. 


Case 2. If i>k-+4 then by using the preceding lemma in the same 
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manner that Theorem 7 was applied to the proof of Theorem 10, the method 
utilized to prove the latter theorem may be adopted without change to establish 
the present theorem in the case under consideration. | 

It is. noted that requiring a pseudo-84. set to be free of diametral point- 
pairs (a condition that enters into all of the lemmas and theorems following 
Theorem 7) rules out pseudo-S,,- sets from consideration since evidently each 
pair of points of such a set has distance d = mr. It is obvious, however, that 
even for these sets ‘the conclusions of the above two characterization theorems 
_. are valid. | 


4, Spheroidal and pseudo-spherical spaces. Characterization theorems. 
A semimetric space of finite diameter d and positive space constant (parameter) 
p is called an n-dimensional spheroidal space S¢ provided Properties I-V 
of Section 2 are satisfied when: the cosine function involved in these statements 
is replaced by any monotonic decreasing function $(pq/p), defined for each 
pair of elements p,q of the space, with (0) —1, (d/p) =— 1. The Sr: 
as well as the same point set metrized with euclidean (chord) distance are 
examples of spheroidal spaces. It may be observed that spheroidal spaces 
arise as simple metric transforms of subsets of the 54. 

Since the derived properties (a)-(1) of Section 2 may be deduced from 
Properties I-V (and those properties of ¢ given above) they are valid in any 
spheroidal space. Thus each k-dimensional space Sg. has minimum congruence 
order k + 3 with respect to semimetric spaces. The characterization of pseudo 
sets of more than k + 3 points and free of diametral point-pairs is given by the 
two characterization theorems of the preceding section upon replacing the 
cosine function by ¢. Thus, in particular, pseudo sets for the sphere with 
j chord metric are characterized by these theorems. 
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A GEOMETRY ASSOCIATED WITH CREMONA’S EQUATIONS.* 
By Gurarp B. Hore. 


Introduction. In the geometry of planar Cremona transformations there 
are two important problems associated with the forms 


(cz) = ay? + 2,3 a < F Tp? — To? 
‘and (la) = t, + t +: ‘+m — Bay 


If a complete and regular linear system X,4 of ater curves of dimension d 
with the generic curve having genus p is of order æ, and has multiplicities ` 
Tı, T2," * *, To at a set of p prescribed base points, then T= {2o; 2, T2," , Lp} 
is called the characteristic of 3,,4 and satisfies the Cremona equations: * 


(st) —1—d—p, (le) =—1—d+p. 


In 1934, Coble? gave a method of determining every ordered solution of these 
equations for a given p, d, and p. However, a solution of the Cremona equa- 
tions may not determine any system %,,4 and there has not yet been discovered 
any general criterion for distinguishing between proper, degenerate, and virtual 
solutions. (A solution® is defined to be proper, degenerate, or virtual ac- 
cording as the generic curve of the system is (a) existent and irreducible, 
(b) existent and reducible, or (c) non-existent.) Coble gave criteria for 
p= 9,10 and certain p, d but a general criterion is still lacking. 

The second important problem in connection with (lz) and (az) arises 


from the fact that a linear transformation, è 
L'o == NTo — Pat — Taz —' * ` — TpTp 
0. T. L'an 81% — O11 = Zits —" — Sipp 
T p mm SpTo — Ap Ly — apti TT QppTp, 


which gives the effect on x of a Cremona transformation with F-points at the | 
base points of Fat m must leave (zx) and (ls) ab®lutely invariant. Once 


* Received September 13, 1939. : 
1A. B. Coble, “ Algebraic geometry and theta functions,” American Mathematical 
Sooiety Colloquium Publications, vol. 10 (1929), New York City. 
*A. B. Coble, “Cremona’s diopantine equatjons,” American Journal of Mathe- 
. matios, vol. 56 (1934), pp. 469-489. 
* Loo. oit., p. 461. $ 
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again, the converse is not true. There are linear transformations of this form, 
leaving (lz) and (zz) absolutely invariant, which do not represent any C.'T.4 
| For p S 8, the number of these linear transformations is finite, and they 
all represent Cremona transformations with 8 or less F-points. For p= 9, 
the number is infinite but in 1932 Dr. Taylor * ‘showed that there are a finite 
number of types, each expressible in terms of certain parameters, and that 
they are all geometric; i. e. they all represent Cremona transformations. These 
results were later put in better form by Barber.* | - 

For p > 9, there is still no simple way of distinguishing a geometric linear 
transformation. At one time it was thought that it was sufficient that the 
numbers n, 14, Sj, ay be positive or zero, but examples have been devised which 
show that this is not true. : 

In this paper the problem is studied by considering’ (æt) ==0 and 
. (lr) —0 as.loci in a projective space Sp. The most interesting result is the 
` appearance of linear transformations of infinite period and simple algebraic 
properties. These transformations give a simple tool for obtaining results 
already known and also provide the answers to questions that have been raised 
in the literature. 

In the work the C-, P-, and D-characteristics (i.e. dose of 
(zz), (lv) = —1,—3; 1, —1; and 2,0 respectively) play their usual im- 
portant role. Also elliptic characteristics of d = 0, p == { and d==1, p==2 
enter in the work for the first time, The invariant characteristic {3, 1°} will 
be designated by 7 and the fundamental P-characteristic {0;. 0t — 1} by 8. 

The group of linear transformations leaving (ac), (Jz) invariant will be 
denoted by G(R)p,2, G(I)p2, or G(C)p.2 according as the elements have 
rational coefficients, integer coefficients, or represent Cremona transformations. 
my set of p points at which % is defined will be designated by Pos 


1. Harmonic Perspectivities. “The harmonic perspectivity in any point 
y not on (vr) — 0 and its polar Sp.: (yx) = 0 has the equations: 


(1) De (yy)z — 2 (yx) y. 


tQ, B. Huf, “A note on Cremona transformations,” Proceedings of the National ` 
Academy of Sciences, vol. 20 (1934), pp. 428-430. 

M. E. Taylor, “A determination of the types of planar Cremona transformations 
with not more than nine F-points,”’ American Journal of MORE vol. 54 (1932), 
pp. 123-128. - 

*S. F. Barber, “ Planar Cremona transformgtions,” Amerioan Journal of Mathe- 
matics, vol. 56 (1984), pp. 109-121." 
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It will, of course, send points on (xx) — 0 into points on (az) == 0. If it is 
to do the same with points on (ls) — 0, then either y = l or (ly) = 0. 
"In the first case the equations of the involution in the point J are: 


(2) af = (p— 9) — 2? (ls). 


It is readily shown that if this substitution is to 3 leave (zx), (x) absolutely: 
invariant, it must be written in the form 


(3) T = — 2 + BC) l 
p—3. 
For any value of p Æ 9 this gives an involution which is an invariant member 
of G(R),,s. Some of these have been noticed in the literature. p= 7, 8, 10, 11 
give elements of q(I )p Which are in G(C)p,2 for p=~7, 8. The involutions 
for p = 3, 6, 12, 15 have integer values for n, Ti, 8; and have been studied also. 
If y is in (Iz) —0, the equations of the involution must be written in 
the form 


4) 0 na P 


- to insure invariance of (zxr), (x). For (yy) —2, these are all elements of 
G(T)p,2 and are the involutions in D-conditions which have been studied in- 
tensely. : If y is a geometric D-condition, the involution is in Gd Vea | 

For (yy) —— 2, we have members of G(I)p,a which are never in G(C)p,2 
since n = 1 — yo? is never a positive integer for Yọ 0. The first integer y 
such that (yy) = — 2 occurs for p == 11 and gives an interesting element of 
G (I)e . 


2. Pencils of Characteristics on a line. Related due We : 
will say that two characteristics.x, y are of the same sort if (Iz) — (ly) and 
(zz) = (yy). This means that the associated linear systems, tf any, will have 
the same genus p and dimension d. We yay ask ourselves: under what 
conditions is | | 

(5) | | z= AD Fuy 


of the same sort as both x and y? Substitution yields? 


(6) z=Aàt + py is of the same sort as both x, i and only if (cy) = (x) 
— (yy) and At p= 1. 


If two characteristics of the game sort satisfy the condition a = (wy) ; 
= (zy) we will say that they are related.  * 


11 
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(7) Ifa z and y are two related M all characteristics in the form 


RCE dot (1—A)y=A(o—y) +9 | 
or (L—p)a+ py ==n(y—2) +2 


_ are of the same sort as z, y. 


Linear pencils of this type have been studied in detail in the literature. 
All D-conditions on Pe,» are included in 120 such pencils. No related pairs 
of geometric P-curves or C-nets exist. Indeed, the condition that two be 
‘related is invariant under G (C)p,: and it is evident that {0; 091} and {1; 0°} 
are not related to any geometric characteristics of the same sorts. Non- 
geometric pencils of related P-curves have been exhibited.” 

If z and y of the same sort are related, the line joining æ,y is tangent 
to (zz) = 0 at a point in (lz) 0. Hence the result givens in (7) may be 
put in the slightly different form : 


(8) Characteristics of the form ka + y are of the same sort as y for all 
values of k'if and only tf (aa) == (la) = (ay) = 0.. 


The pencils determined in (8) are the same as those given in (7). The 
difference is that in (7) we think of the pencil as determined by two char- 
acteristics of the same sort; and in (8) the pencil is determined by an elliptic 
characteristic &'and a point y in the tangent Sp_, of (wr) = 0 at a. 


8. Systems of characteristics which lie on a plane conic. The pencils 
of characteristics obtained in 2 contained characteristics lying on a line de- 
termined- by two points. We can obtain systems lying on a conics by 
seeking the coton on a and b that 


å i | ka + kb +a 
shall be of the same sort as x for all values of k. The equations 
(l, ka + kb + z) = (iz) 


(ka + kb + x, k'a + kb + x) = (zx) 
regarded as identities in & yield: 


(a) (Ia) & (1b) = 0, (aa) = (ab) = 0, 
and | i | 
(8) 2(az) + (bb) 0, (br) —0. 
If & and b are any two characteristics which satisfy (a), then ‘ 
` a e . r + 


7 See reference ?, p. 480. 
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; HR bn 
will also satisfy On | Substituting these in (8) gives 
2a (de) + (08) 0, (ae) + v(bz) = 0. 

From the second we must have l L 

pmo(be); volts). 
Substituting in the first gives 

` RA(GT) + o” (äs)? (bb) = 0. 

If (dx) = 0, this is identically ‘satisfied, v — 0, and we have a pencil of the 
type in (8). For (adv) 40, we must have A==-—4$(bb)o*(Gz). (It is 
readily verified that if b is an integer characteristic then 4(bb) is an integer:) 
Thus, . ’ 


(9) If a and b satisfy (aa) —(la)—(1b)—(ab)== 0 then all characteristics 
a! = — $ (bb) (ok)?(ax)a + ok[ (ax)b — (bx)a] + z 

are of the same sort as x. Moreover, every system of characteristics of the 

same sort and of the form 7 


ka+kb+z 
can be obtained in this way. If (ax) = 0, the system lies on the line ka + x. 


This means that any elliptic point a and a point b on its tangent plane 
and (ic) —0 determine for any characteristic x a system of characteristics 
of the same sort as z. If (az) 540, this system lies on a conic in the plane: 
determined by a, b, and z. In the next paragraph we will find intereséing’ 
properties of these systems. 


4. The linear substitution S*,,. In the preceding section it was found 
that any elliptic characteristic aand a characteristic b satisfying (ab) — (1b) 
== 0 determine with any characteristic z a system of characteristics of the 
same sort as z and lying in the plane determined by a, b, æ. This led to 
the equation: e | 


(10) | f= 00) (asja + k[ (ax)b — (bz)a] + x. 





For a, 6 and k given, this is a linear substitution which sends any char- 
acteristic z into a characteristie æ of the same sort. It leaves (xz), (Iz) 
invariant. For rational a, b and k it is always an element of G(R)pe; for 
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integer a, b, and k it is in G(I)p,s and is sometimes an element of G(C)p,2. 
We will designate such a-substitution by Si and study its properties. 
The following properties are readily verified : 


. Sap = Sag = Son == Se al, 
(11) S¥o,y — Sra, b = S'a, ko 
Su, pa = Shi; g Sh - Ia — S'obatsbse 


The parameter k plays the role of an exponent and gives a simple law of 
multiplication for. substitutions S*,,» defined for the same elliptic characteristic a. 
‘The’ simple algebraic form of (10) leads to the following theorems: 


.(12) All lines through a which are tangent to (xx) == 0 are left invariant 
by St for all b, k. : 


(13) Sov == S35 if and only if a = Aã and Ab = ua +b. That is, two sub- 
stitutions are the same only if they are defined at the same point a and.b, d 
- lie on a line through a. 


(14) If x,y are two characteristics of the same sort, and an elliptic chàr- 
acteristic a exists such that ` 


(az) = (ay) = k #0, 


then b= (y—x)/k satisfies (ab) = 0 and defines an San which sends x 
into y. 


A`given elliptic characteristic a and a suitable b determine a linear sub- 
stitution Sa,» and its inverse Sa,- which generate an infinite “cyclic” group. 
On the other hand, a particular elliptic characteristic a defines an aggregate 
‘of gements Sa,» for all characteristics b satisfying 


(ab) = (b) —0. 


From the laws (1) it is evident that this aggregate constitutes an infinite ` 
“abelian group. We will designate it by {Si}. 


5. The condition that Ila shall be a substitution Sa». To relate trans- 
formations Sa,» to known, elements of @(C)p,2 we investigate the conditions | 
under which Zola, the product of two harmonic perspectivities in D-conditions, 
may be such a transformation. As an algebraic tool we use the theorem.® 

The necessary and sufficient condition that a square matric M can have 


' Given in a paper by the auth®r read to th® Texas Section of the Association, ` 
May, 1938. 


A us ASSOCIATED WITH OREMONA’S EQUATIONS. _ 861 


its k-th power a matrix whose elements: are séhoniate of dagta nin k ws 
that (M — I)" =0. , 


Obviously, the k-th power of the matrix of any transformation Sox has 
elements which are quadratics in k. Then if el is to be such a transforma- 
tion, its matrix must satisfy (M —I)* == 0. Applying this condition, we find 
that we must have (cd)?==4 which means that the line joining c and d is 
tangent to (zz) — 0 at the elliptic point c— d. That the condition is suffi- 
cient is shown by verifying that So-a,e = Iela if (cd) == 2. 


(15) If c,d are two distinct D-conditions, then IIa is a linear substitution 
Sev tf and only tf (cd)?—=—4. If sane are chosen so that c,d are related 
(i.e. (ed) = 2), then | 

. Lola = Sea, 


and Iela and Ial, generate an infinite cyclic group. 


In 1933 Dr. Barber,® using purely algebraic methods, obtained a set of 
necessary and sufficient conditions that IIa and Isla be permutable. From 
the present geometrical point of view it is clear, that permutability is possible 
if either: 


(a) the line éd is in the polar Sp-s of the line cd with respect to (rr) == 0; 
or E . | 
(b) the lines cd and Cd are tangent to (xx) — 0 at the same point. 


Indeed, in the second case Iela and 1:13 ave transformations Se,» defined 
at the same elliptic point and permutable by (11). The algebraic conditions 
for these two cases are equivalent to Barber’s conditions and hence are neces- « 

_ sary as well as sufficient. Thus | ° 


(16) The do and. sufficient conditions that Ifa and I ae be permutable 
as that either: 

the lines cd and éd be conjugate with respect to tan = 0, 
or the lines cd and éd be tangent to (xx) — 0 at the same point. 


A simple algebraic form is: we 


(17) The necessary and sufficient condition ‘that Iols and Isla be permutable 
ts that either 


° (ed) =2 is equivalent to (cf)? = 4 since e— o determines the same harmonie 
perspectivity and the‘same D-condition. as c. d 


1 
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(cc) = (cd) = = (dé) == (dd) = 0 
or (cd)? = (Gd)? = 4 and PRET) 


where signs are chosen so that (cd) = (Ed) = 2. 


These conditions are simpler than those given by Barber. However, all 
conditions given there are necessary in a technical sense, because they are all 
consequences of these. 


6. Results in the case p—9. For p—9 the Ss, (lc) ==0 is tangent 
to (ar) = 0 at the elliptic characteristic 1. Any D-condition is on (lz) —0 
and by (8) defines a pencil of characteristics kl + d. These lines all meet 
Ty == 0 in points which are D-conditions on Pas. Thus the lines of D-condi- 
tions determined by the D-conditions for which 2 == 0 contain ‘all D-conditions. 
There are 120 of these. 


(18) All D-conditions for Po» lie on the 120 tangent lines 
ki+d 


where d is any one of the 120 D-conditions for which x = 0. 


The one elliptic characteristic l? defines a group {81}. Any element is 
determined by a characteristic b satisfying (1b) — 0. If one choice b defines 
an element, then by (13) all 6 — ki + define the same element. In par- 
ticular, for k = — bs, there is a b in s= 0 which defines the element and 
only one of this sort. Every element of {S:} is determined once and only once 
by all characteristics b satisfying (lb) — bo 0, Indeed, it is the infinite 
abelian sub-group dp. 


+ 
(19) {Si} ts the infinite abelian sub-group as and each element is given 
once and only once by Sir, where (1b) = bo == 0. 


Any two P-characteristics y, z satisfy (ly) = (lz) = — 1, and hence by 
(14) (z —y)/— 1 = y —2 is a b such that Sı» sends y into z. 


(20) AU P-characteristics on Po, are geometric and any pair defines a unique 
element of ay which senels one into the other. The images of {0; 08— 1} 
under dy include all P-characteristics once and only once. 


& is the integer group defined at 1. If we allow rational values of b 
then we have a subgroup of G(R)52. Under this group all C-characteristics are 
conjugate. Indeed, two C-characteristics y, g satisfy (ly) — (lz) — — 3 and 
by (14) (y—-z)/3 is a rational b such that Si. sends y into z. Ordinarily 
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this would give an element with rational ciel. It may happen, however, 
that they turn out to be integers. | 


To state the conditions under which this occurs we note that b— 4 (kl 
+y— z) yields the same substitution as (y—z)/8 and is integral if 
y — z= kl, mod 3. On the other hand, if z is the image of y under an element ` 
of ay, it can be shown (by writing down the explicit form of a general element 
of a) that y—z=kl, mod 3. 


(21) The necessary and suficient condition that two C-characteristics y, z 
shall be conjugate under a is that 


y — z= kl (mod 3). 


Now for every integer C-characteristic on nine Dont there is one on 
eight points which satisfies the condition of (21). A C-characteristic on nine 
points is geometric or not according as it is related under (21) to a geometric 
or non-geometric characteristic on Ps. This can be used to obtain the 
criterion obtained by Coble.? | 

The substitutions with rational coefficients sometimes ‘have a surprising 
form. For b == {8; 3°0}, S*.,» has the form: | 


Do == (86K? + 1)70 — (1247 + on + Da - - —(12k? — 8k) To 
di = (12k? — k) to — (4k? — 1) 2, — 4h?ta — ta atja 
pe a a - « (4h? — 3k) x 


Ta == (12k? — k) ay — 4h%a, — 4kie —t + -—(4k? — 3k) To 
Do = (12k? + 8k) ay — (4h? + 3h) 2, — (4h? + 3h) t_—- - »—(4h? — 1) to. | 
For k = — 1/3 this gives an element of G(R)s2: whose cube is in G(C}o. 
The characteristic {n;1;} = {5; 1°4} is integral and indeed is geometric. 
But {n; sj} == {5; 548 — 44} is a rational C-characteristic. This answers the 
conjecture :° that integral {n; 74} must require integral {n; sy}. 

7. Results for the case p= 10. For Pros the characteristic 1 again 
plays a particular role. In this case —/ satisfies the equations 


Gaia (aja 


‘and defines a virtual P-characteristic. Moreover, any P-characteristic p satis- - 
fies (lp) ==—1 or (— Ip) —1 and is related to —l. Hence for.p == 10 
all P-characteristics lie on the tangent cone from —J1 to (zz) —0. .They 
may be obtained by joining all elliptic points a to — 1. E “as à 


1° See reference *, p. 489. 
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` (22). All integer P-characteristics on Poe are given by 
p= tka-—1 

where k is any integer and a is any elliptic characteristic. 


Now Coble showed ° that any one of these could be reduced under G(C) 10: 
to an irreducible P-characteristic of the form k(!+ 5) —1, where 8 is the 
fundamental P-characteristic {0; 0°—1} and (14 8) = {8; 1°0} is the 
earliest geometric elliptic characteristic. It follows then that all elliptic char- 
acteristics are conjugate under G(C)104 to 2+ 8 or a multiple of 1+ 8. In 
particular, 


(23) All elliptic ‘characteristics of positive order and G.C.D=1 are geo- 
metric and reducible under G(C)102 to 1+ 8 = {3; 1°0}. 


Combining this with (22) yields the result 


(24). All geometric P-characteristics are given by 
pHa—l, 


. where, a runs over all geometric elliptic characteristics. 
Ly 


That is, on each line of the cone of all P-characteristics there lies one 
and only one geometric characteristic. That (24) gives a definite determina- 
tion of all geometric P-characteristics is clear when we recall that = elliptic 
characteristics are easily obtained by Coble’s method. 

: At the particular elliptic characteristic a — l -+ à there is defined a group 
{Sa} which is simply isomorphic to a. At any other elliptic characteristic & 
there is defined a group {Sz}, which is simply a transform of Aes) under 
G(C) 10,2, for a and ā are conjugate under this group. 


(25) Every infinite abelian sub-group of type (8) generated by pairs of 
involutions in related D-conditions is geometric and indeed is: simply iso- 
morphic to a Such a sub-group is defined for each elliptic characteristic and 
all such sub-groups are obtainable in that way. | 

e 


Dr. Barber obtained by experiment one of these defined at {4; 2715}. 
(26) gives a complete classification of sub-groups of this sort. 

F plays a particular role in another sense. By (3), 1, thé harmonic 
perspectivity defined by l and. , (tz) is a ‘member of G(Z)1o2 and has the 
equations 

Tio: r == — 2+ al(ix). 
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Coble discovered this involution in another way and named it To. For Tic, ` 
{n, ri} = {n; sj} = {— 19; — 6%} and the P-characteristics are of the form. 
—?R(I+ 8) +8. That is, all its P-characteristics are irreducible This 
naturally raises the question: is T; the only element of G(Z)102 with this — 
property. Coble’s simple algebraic form for irreducible P-characteristics pro- . 
vides an easy answer, since two P-characteristics must satisfy (pp) = 0. If 


(8k; k, —1,k} and (34; WW —1} 


are to satisfy this, then kk’ + k+ k =0 or (k+1)(k +1) 1, which is 
true for integers only when k = K = 0,— 2. Hence, 


(26) Any aenant of G(T):0,2 such that all its P-characteristics are irre- 
ducible under G(C)10, 2 is either To or Tro multiplied : by an element of r. 


An important corollary is: 
(27) G(Z)10,2 ts generated by m, Ass, and Ty. 


Another result is interesting to the writer in that it enables him to answer 
a question that has been in his mind for some: time. It is known that 
n, 84, Tj, j == 0 is not a sufficient condition that an element of G(Z)p,2 be in 
G(C) 2, the first example being devised for p— 11. However, the condition 
is sufficient for p< 9. The only case in doubt has been p = 10. From (26) . 
we see that any element in G(Z)10,2 but not in G(C):02 can be written in the . 
form ET, where # is an element of G(C)io2. By the algebraic form of 
To it can be shown that the n of such an élement must be negative. 


(28) For p10 the elements of G(I)p.2 which have n, Ti, 3j, we are 
all in G(Cio For p È= 11, this ts no longer true. 


8. -The case p— 11. For p= 11, l defines an element of. G(Z) 11,2, 
z Tu: g= — zs + (x) | 


for which {n; sj} = {— 10; — 3}. It sends any C or P-characteristic of., 
positive order into one of negative order. Also, this is the first place a 
transformation JI, is defined for (ly) == 0, (yy) = + 2. The re case occurs 
for v = {4; 2 119} and -has the equations 


I: x =g + (ve). 


Since (lv) = 0, 71, and Ty are germutable and TuTyis an involution. Indeed, 
it is the de Jonquieres involution defined by the geometric net {6;51%}. This 
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nes is interesting since it shows two elements of @(Z).:,2 which are virtual 
and such that their product is geometric. 

Another new type of- involution is Tni the involution in the first non- 
geometric D-condition {3; 1° — 1}. The product of Tı, and T, is abelian 
and TauTa = To This means that the group generated: by all transforma- 
tions Jy; (yy) = + 2, must include Tio and Ti. Or, Tio and T, seems to 
give the new types of elements of G(Z)11,2; so that weight is given to the con- 
jecture that G(I)1:,2 is generated by Aizs, 7, Tio and Ti. 

Further, on P::? the group {Sa} defined at a == {3; 1°07} has as andi 
tions on b: ; 


bi +H b+: Ehei or | bio + Dia = 0 
Vitre + Do + bio + bii = 3D bi + be +: + ++ by = 8bo- 


An element is determined then by b= {0; 0°1— 1} and its k- th power has 


the form: 
To = (9k? + 1) To — 3h*2, —3kt. . —++:—8k2ty — 8kayy + 8hayy 
ds me 3k’ To —(k?—1)2,— ka. ; — ka, — ko + hay 
Tz = 3k? — kr, ' — (k? — 1) t1 — — kit, — Ktio + kay 
Bad)” . be tah “Sere oe nes a ea saute Beg A de 

Co — 3k, - — hay — kx, (kK? — 1) to — kay, + hay 

Tio = — 8k + ka, +hte . +++ + kat, + To 

d'u = Bko — hay, — Ete —  —k% + a. 


à l 
. This infinite cyclic group furnishes whole families of irreducible P-`and - 
C-characteristics, including all irreducible P-characteristics for P62. That is, 
` these characteristics which are “ irreducible” under G{(C'u,e are no longer | 
irreducible under G(Z),;,2. Sab is the product IIa where c is the geometric 
D-condition {0; 0°1,—1} and d is the irreducible sion-geomettic D-condition 

{3; 1” — 1}. 

At the elliptic characteristic {4; 27150} a satisfactory b is {1; 12051} which 
defines a cyclie group of elements in G(I*)n, 2 None of these are geometric 
nor are any of the P or C-characteristics geometric, for this group is the trans- 
form of the one above under an element of G(C)11,2- 


Conclusion. The interesting point in this work is its unity and the 

' directness permitted by the geometrical point of view. The invariant in- 
volutions (3) which are defined for G(Z)p.2, p = 7, 8, 10, 11 and the trans- 
formations (4) for “(yy) ==— 2, (which areealways virtual) appear in the 
same way as the thoroughly studied involutions in D-conditions. Attention 


A GEOMETRY ASSOCIATED WITH CREMONA’S EQUATIONS. 867 


is drawn to pencils of elliptic curves of genus 2, which define these virtual 
involutions. Could it be that these virtual involutions may have some geo- 
metrical meaning? 

The study of the systems of characteristics lying on a line and on a plane 
conic is important in that it leads to the linear substitution Sz. The writer 
feels that the exceedingly simple laws which these satisfy should throw con- 
siderable light.on the structure of the groups for p11. In particular, 
Theorem (14) furnishes a simple sufficient condition that two characteristics 
be conjugate under G(I)p,2 and for p==9 gives very simple results. The 
. unity of the work would be increased if a simple geometrical definition of 
these transformations could be given. | 

All geometrical P-characteristics for Po,» occur once and only once among 
the P-curves of the sub-group {81} for p==9. From examples studied for 
larger p it seems possible that the aggregate of all geometric subgroups {Sq}, 
defined at all geometric elliptic characteristics a might have this property. 
An affirmative answer to this conjecture would make the study of G(C)p,2 
dependent only on the nature of the elliptic characteristics defined for that 
value of p. Thus elliptic characteristics may play as important a role in the 
general theory as C, P, and D-characteristics. 
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POLYNOMIALS WHOSE REAL PART IS BOUNDED ON À GIVEN 
CURVE IN THE COMPLEX PLANE.* 


By A. C. Somarrrer and G. Szzad. 


` Introduction. 1. In what follows we denote a rational polynomial of 
the complex variable z by =, if the degree of this polynomial is n, 
As a simple consequence of the theorem of S. Bernstein on trigonometric 
polynomials, the following holds: + ' 


A. Let f(z) be a mm and | f(z)|S1 in (¿| S1. Then |P) |En ` 
in |z| & 1, with the equality only if FC) — ez", | e| =1. Da 


This theorem has been generalized by Szegö in two different directions. 
First, the unit circle may be replaced by a Jordan curve subject to certain © 
restrictions : ? . 


B. Let T be an open or a closed Jordan curve consisting of a finite 
number of analytic arcs which join so that the exterior angle? is greater than 
zero. If f(z) ts a my satisfying | f(z)|S1 on T, then at any point à of T 


| F (40) | S Ans 


Here A is-a constant which depends only on T and 2, and ar ts the exterior 
angle of T at 2. The order of this bound as n becomes infinite can not be 
improved. 


On the other hand, the condition | f(z)|S1 in Theorem A can be 
` replaced by | Rf(z)| 5 1, so the following is true: * 


C. Let f(z) be a mn and | Rf(2)| Sl in |2| S1- Then POIES 
in |z| 1, with the equality only if f(z) = er, |e| —1. 


* Received April 22, 1940. 

- 1M. Riesz, “ Eine trigonometrische Tnterpolationsformel und einige Ungleichungen 
für Polynome,” Jahresbericht der Deutschen Mathematiker-Vereinigung, vol. 23 (1914), 
pp. 354-368. See also, O. Swdsz, “ Korlátos hatványsorokról,” Mathematikai és Termé 
sgettudományi Értesité, vol. 43 (1926), pp. 504-520. 

3G. Szegë, “Uber einen Satz von A. Markoff,” HORS Zoitsohrift, vol, 23 
(1925), pp. 45-61. 

"In case T is a closed Jordan curve, the exterior angle at any point of T is defined 
as usual. If I is an epen curve, the exterior angle is defined as in loc. cit.2, pp. 48-49, 

+G. Szegd, “Über einen Satz des Herrn Ser& Bernstein,” Königsberger Gelehrte 
Gesellschaft, Naturwissenschaftliohe Klasse, 1928, pp. 59-70. Also, S. Bernstein, “Sur 
un théorème de M. Szegi,” Prace Matematycono-Fizyozne, vol. 44 (1937), pp. 9-14. | 
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2. The main result of the present note is: 


THEOREM 1. Let T be a closed Jordan curve consisting of a finite number 
of analytic arcs which join in such a way that the exterior angle is always 
greater than zero and less than 2m. Let f(z) be aan satisfying 


(1) IMGISL zer 
Then at-an arbitrary point Z of T 
(2). | f’(0)| S Ans; 


here Awa constant which depends only on T and 2, and ax is the exterior 
angle of T at Zo. The order: of this. bound as n becomes infinite can not be 
improved. 


This is a generalization of Theorem C, at least so far as the order of the. 
bound of | f (zo) | is concerned; for if T is a circle, the exterior angle at every 
point is r so that a == 1. incidentally: i in this special « case our general method. 
used in §2, furnishes the inequality 


jf @)|<6en, Jaan 


For closed Jordan curves, Theorem 1 is an obvious extension of Theorem 
B which was obtained under the more restrictive hypothesis | f(z)| 1 on T.. 
Theorem B, however, holds even if T is an open arc, while our Theorem 1 . 
does not. Indeed let T be the real segment —1<2< -F 1 and consider the 
polynomial f(z) 1&2, z =s -+ iy, K real. In this case Rf(z) —0 on T, 
but |f(1)| can be arbitrarily large. More generally, we can take for T a 
Jordan are along which the real part of a certain given polynomial is constant. . 

Our proof of Theorem 1 makes use of the theory of conformal mapping 
and in particular of the theorems of Osgood-Taylor® concerning the behavfor ` 
of the map-function near the boundary. (See however, the last remark in 8 2.) 


3. Under the conditions.of Theorem 1 we may ask for proper bounds 
for the “ oscillation” of f(z) in T, that is for the maximum of | f(z)—f(z2) | 
if z, and zə describe, independently of each other, the closed interior of T. 


THEOREM 2. Let T have the same meaning ag in Theorem 1 and let 
f(z) satisfy the same conditions as there. Then for two arbitrary points Z 
and 2 in the closed interior of T, 


G) | #(%) —f(2)| <Alogn, n>1; 


t e . 
*W. F. Osgood and E. H. Taylor, “ Conformaf transformations on the boundaries 
_of their regions of definition,” Transactions of the American Mathematical Sootety, 
vol. 14 (1913), pp. 227-228. 
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here A depends only on T. The order of this bound as n becomes infinite can 
not be improved. 


Let F(z) be real at a certain (not nn fixed) point in T; then 
from (3)' 
(3) |Sf(2)|<Alogn,  zeT, 
follows. ; 
Theorem 2 is well-known for the case in which T is the unit circle. The 
much discussed example 


F(z) = (3/2) (2/1 + 2/2 +: + r/n) * 


‘shows that logn is the true rate of growth of the bound’ in (3) or (3’) 
[f(0) = 0] in case T is the unit circle. | 
Theorem 2 has a more elementary character than Theorem 1 ; therefore 

‘we found it convenient to bring its proof first. Having proved both inequali- 
ties, we discuss the precision of our estimates as n becomes infinite. Obviously 
Theorem 2 combined with Theorem B furnishes the less informative bound 

` Anslog n instead of the bound in (2). . | 

In the proofs of both theorems we use the following 


Lemara 1. LetT satisfy the conditions of Theorem B and let f(z) be a 
mn satisfying | f(z)| 1, zeT. We denote by (2) a function which maps 
the exterior of T onto.the exterior of the unit circla in such a way that the : 
points at infinity correspond. Then at a point g outside T - 


(4) AALS |y)" 
Here |y(#)| —R > 1. 


This is a well-known consequence of the maximum principle. 
e <3 


1. Proof of Theorem 2. 1. LetT satisfy the conditions of Theorem 2, 
and let 8 > 0 be the smallest interior angle at which two arcs.of T join. If 
Z is any point on T, we draw through z two line segments DL, and La with 
the following properties: 


(a) Zo is one end-point of i and of fe 

(b) the other end-points of L, and L, are also on T, whereas all die 
‘points of L, and Lz are in the open interior of T; 

(c) at Zo Lı and Le intersect T (or one of the arcs of IT if z is a vertex) 
with an angle of 8/4. 


_ The distance of any point Z on ZA or Lyfrom T (that is, from any point 
z on T) is at least sin (8/8) times the distance from Z to z provided Z is 
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sufficiently near to z. We determine the lurgest segments LU’, and F on Ly : 
and L, respectively, having % as one end-point, for whose points Z the condition 


(5) : |Z—z|2|Z—z | sin (8/8), eT, 
is satisfied. In what follows, let L= L(z) denote the larger of the segments 


I’, and I’; (or either of them if they are equal).’ The length 1(2)) of, this 
L(æ) has a positive minimum, say lo; as à runs over T. 


2.. The following statements are essentially known: 


Lemma 2. If the real part of an analytic function F(z) ts bounded by 1 
an the open interior of a circle of radius r > 0, then at the center 7 of this 
circle l 


(6) o JEZ) Ser. 


Lemma 3. If the real part of an Fou function F0) is bounded by 1 
in the unit circle | z| < 1, then 


(7) i | F(z) — F(0)| £ 2 log = 


Lemma 4 Let S$ be a segment of length s. If f(z) is a m satisfying 
| f(2)| S 1 on S, then | f(z)| SK provided z lies within a distance n? of 
either end-point of S. Here K is a constant which depends on s but is 
independent of n. 


Inequality (6) may be obtained by diftereitiating Poisson’ s integral which 
for a. circle of radius p, p < r, with center at the origin is 


7 1 r À 4$ 1 
F(a) =i SRO) + 5 [7 


Inequality (7) may be obtained from (6) by integrating along a radius from 
0 to z: 


“RLF (pe) }dg, | z] < p. 


|F(2)—F(0)| = | f P@als fr dt 


which is (7). Lemma 4 follows from Lemma 1 of the Introduction where #(z) 
is a function which maps the exterior of S onto the exterior of the unit circle . 
with the points at infinity corresponding. In case § és the segment (— 1, +1) 
of the real axis we need only note that | y (2) |*= |.z-+.(22—1)* |" is bounded 
if |z—1|Sn* or |z+1[/ Sn. . 


‘8. Now we proceed to the proof of Theorem 2. Let fo be a fixed int 
point of T and F(z).an analytic function (not necessarily a polynomial) - 
_ satisfying the condition | RF(z)| S1, zer. Then, according to Lemma 8, 


+ (8) | F(z) —F(to)| S 2 log 
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zer. 


oi 


he w = (z) is a function which maps the interior of T onto the as 


|w] <1 such that $(f) — 0. -If 5 is a small positive number, let As be 


_the set of points inside T which lie at a distance 3 or greater from F. Let 


ô == la sin (8/8); then the segments L drawn through each point zo of T 
according to the former constriction, extend into 43. Furthermore, from (8), 


(9) | F@) Fo) SB, zeda; 


here B is a constant which depends only on T and čo Now let z be a point 
on P and let z% be the end-point of L different from %; then me As. If ¢ is 
any point of L, (5) and (6) imply that — 


| EE) | S 2{| €— zo | sin (8/8) }>. 3 


This, together with (9), shows that if z is any point on L, 


Go) [PEFEA] S POLS C 


where Ọ is again a positive constant which depends only on T 'and o. 

So far the polynomial character of our function has not been used. Now 
let F(z) =f (2) be the 7, of Theorem 2. In the portion of L which lies at a 
distance greater than n° from z, (10) shows that 


| f(#) — f(t) | £ C log (Cn°). 


Since the length of L is greater than a fixed positive number > Lemma 4 
implies that 
| f(z) — F (ĉo) | S KC log (Cn*) 


- where K depends only on I. This completes the proof of Theorem 2 because- 


| Fe) =F SS | Fe) Fo) + LC) — Fo) [= 


2. Proof of Theorem 1. 1. Let z be a point of T at which two arcs. 
yı and y: intersect with an exterior angle ar, 0 <a< 2. It is no loss of 
generality to suppose that a= 0 and that the tangents to yı and yz at Zo = 0: 


‘intersect the real axis at ‘angles of ar/2 and’—an/2, respectively; also we 


may assume that a neighborhood of the negative real axis near the origin lies. 
inside r. With p a small positive number draw. two circles of radius p, the 


` first with center at pexp {t(a+1)7/2} and the second with center at 


p exp {— (a + 1)x/2}. These tyo circles will intersect at the origin, where 
they are tangent to y, and ys, respectively, and at the point 
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z= 2p cos { (a + 1)x/2} 


on the negative real axis. The arc of the first circle which lies above and on 
the real axis, and’ the arc of the second circle which lies beneath and on the 
real axis, together form the boundary of a region R which is closed and simply 
connected and whose boundary touches T at &—0. All other points of R 
will lie inside T if p is small enough, and the exterior angle of R at z is am. 

If a = 1 the two arcs which form the boundary of R are arcs of the same 
circle, and R is the circle |z+p|Sp 

If § is a small positive number, let Rẹ be the region obtained by trans- 
lating R a distance 8 to the left; that means ze Ra if and only if (z + d)eR. 
Now we show the following. If p is small but fixed, then for all sufficiently 
small § the region Rs will lie entirely inside D and at a distance at least Ad 
from T where A is a positive constant independent of 8. Indeed let us repeat 
the previous construction of Æ replacing p by 3p; the resulting region S is 
bounded by two arcs of circles of radius 3p with centers at 


3p exp {+ i(a +.1)2/2}; 
choose p so small that § lies entirely in the closed interior of T. Now fix p; for 
0<8<p| cos {(a -+ 1)7/2} | 


one shows by direct calculation that Ra lies inside § and at a distance greater 
than 
25 | cos { (a + 1)x/2} | 


from the boundary of S, and so inside T and at least this distance from T. 


2 Let f(z) be a polynomial satisfying the conditions of he 1. 
We conclude from (6) that if z is any point of Rs. ‘ 


(11) |F (2)| £ 2/8). 
Let y(z) be a function which maps the exterior of R onto the exterior of the 
unit circle with the points at infinity mutually corresponding. Then ẹ(z -+ 8) 
maps the exterior of Rs onto the exterior of the unit circle, and we obtain 
from (11) by application of Lemma 1 [cf. (4)] + 

[FOIE {2/ (A8) } | ¥(8) |. 


But from a theorem òf Osgood-Taylor mentioned in the Introduction [see 5] 
we conclude that near the nas point z = 0 of R theemap-function ¥(z) 
must be of the form s 


ÿ(z) =y (0) + ^p (2) 
12 


874 A. C. SCHAEFFER AND G. SZEGO. 


where | ¥(0)| = 1 and p(z) approaches a finite limit not zero as z approaches 
zero. Then if | p(z)| <A for small | z|, we obtain 


| (0) | SS (2/a8)} (1 + 4/77. 
Placing § == n4 (permissible for large n) this gives 
| #°(0)| < 2-A: et ne 
which proves the theorem. 
We notice that the map-function y(z) of the region R may be calculated 


in terms of elementary functions. This makes it possible to avoid the use of 
the Osgood-Taylor theorem. 


3. Discussion of the precise order. 1. The bound An* in Theorem 1 
is of the precise order as n — œ ; this follows from the corresponding fact in 
Theorem B. 

We show that the bound A logn in Theorem 2 is also the precise one. 
More exactly, let I, be a closed region in the open interior of T, z a fixed 
point on I’ and z; arbitrary in Tẹ; we construct a sequence {g,(z)} such that 
gn(Z) 18 a mn, n'= 1, and 
| Ryn (z) | = l; ze, 
| gn (4) — ga(z)| > A’ log n; 


here A’ > 0 is independent of n. 


2. By use of the polynomials D 2/v this construction is rather easy in 
1 


case a circle through z, exists containing T. The following method holds 
generally. The principal tool is Faber’s polynomials f,(2), n = 1, associated 
with T. They are defined as follows. Let 


w = ÿ(2) == 02 + co + ag? +: 7, 


(12) z= y (w) = iw +: my c > 0, 


be the conformal mapping of the exterior of T onto the exterior of the unit 
circle | w | > 1, uniquely determined by the condition c > 0. Then f,(z) is 
defined as the “principal part” of {#(z})}", that is ° 

{y¥(Z) }° 1 WT CO | 
(13) În(z) = cer Pt à dZ = D: (M) — aw. 
Here the integration is extended over a curve C enclosing T ea over the 
corresponding curve in the w-plane), and z is'in the interior of C. For the 
construction mentigned we need the following expansion (slightly different 
from the expansion in ?, p. 54, €17)): s 


° See ?, p. 53. 
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Yi(W)—z ; 1 & falz) 
49) oD log ee eo 772 sn Wir” 


Here the determination of the logarithms is obvious; and z is arbitrary. But 
| W|>1 if z is in the interior of I, and (FI > 146) if z is in the 
exterior of T, 
Expansion (14) is clear for W = œ. The differentiated expansion 


GAM) opas mi 
(15) yW) —z =W TAMON | 
follows from (13). - ; 

8. Let z be arbitrary in the closed interior of T; and let | W|>1. 


Then the imaginary part of (14) is uniformly bounded (see ?, p. 54) so we 
have for the Cesàro means of first order 


Le m \ fm(2) 
a) [sÈ(-2)50 | 
where Q depends only on T. Also, the function (14) is bounded for | W| 21 


and for a fixed z in the open interior of T (uniformly if z is restricted to a 
closed region T, entirely in the open interior of T) ; that is 





<Q, aeT,|W| 21, : 


| 8 f. . m\ fala) | L ~ 
ay [| 3G-8)ER | se, entries 
where Q depends only on T and Ty. 


4 Letz, bean arbitrary point on T with the exterior angle ar, 0 < a < 2, 
and let w, == y (21), | w, |= 1. Assuming for a moment that T is a closed 
polygon, we find by use of the Schwarz-Christoffel formula 


(18) DT (W) — y> (w) = CREER) ° 


where F(t) is analytic around t—0, and FC) #0. This furnishes, if . 
| W—w, | is sufficiently small, 


PEL) RE I ee 
0) FW) Fw) Wo + Le 











Now, : 
.{__W2OM)} + a: 
(20) fau) = aS j tr — yt (w) W—w, ; S 
Wwe 
+ — De Wa aw. 


For the line of integration we ehoose two gres c, and Ce; & connects two 
points w and w” of the unit circle (on opposite sides of w,) and runs entirely 
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in the exterior | W | > 1 of the unit circle; the'other arc cs is the “large” 
are ww” of the unit circle | W | = 1. We choose w’, w”, c, so that the func- - 
tion F(W —-w,) is regular and 0 in the domain bounded by the “small” 
are ww” of the unit circle and by c. 
In the first integral of (20) we can replace c, + ce by the unit circle 
'| W | =1; the resulting integral approaches 0 as n=> 00, according to 
Riemann’s lemma. The second integral is aw,", so 


. (21) fala) = ow+to(1), too. 
5. We define the required polynomials by 


i ERA ELIO! 
| m= £3 (1-2) Be, 
According to (16) and (17) 


(22). | Rfgn(e)} | S1, zer; | ga(z)| < g, z é To. 

But according to (21), 

(23) gr (4) -= į 3 È (1- 2) + 0 (log) 
eee PEO n— ©. 


Q 
This shows that the bound A log n in Theorem 2 is of the right order as n—> œ, 


6. Finally we remove the condition that T is a polygon. If z is given, 
we construct a polygon I” with the following property: 


` (a) Y contains T; 
, ® 2, is on I’; 
* (c) the FR enoE angle of IY at z, is an, o< ea 
` Obviously there is no difficulty in constructing such a polygon I” so long as 
a <a. 

Repeating the previous consideration for I’, we obtain a | sequence of a. 
satisfying conditions (22) ; instead of (23) we have — 


gn (41) siZ logn + 0 (ogni ARS 


which suffices for our purpose. 
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NEUER BEWEIS EINES SATZES VON G. H. HARDY UND 
S. RAMANUJAN UBER DAS ASYMPTOTISCHE 
VERHALTEN DER ZERFALLUNGS- 
KOEFFIZIENTEN.* 


von VOJISLAV G. AVAKUMOVIG. 


Wird mit p(n) die Anzahl der verschiedenen Zerlegungen von n in gleiche 
oder ungleiche positive ganzzahlige Summanden bezeichnet, so ergibt bekannt- 
lich die Hardy-Ramanujansche asymptotische Entwicklung von p(n) in erster 
Annäherung die Formel 


1) p(n ~ aR exp [2 ee n|, n= es 


Einen Beweis dieser Formel habe ich auf Grund allgemeiner Tauberscher 
Sätze funktionentheoretischer Art im Sections- Vortrag “Uber das Verhalten 
Laplacescher Integrale an der Konvergenzgrenze u.s. w.” /2. Congr. Inter- 
balkan. des Math. Bucarest, 12-IX-1937. Bull. Math. Soc. Roum. Soi. 40, 
Nr. 1/2 1938, 8. 101-106/ gegeben.* 

Im folgenden möchte ich mit der im Prinzip gleichen Methode die Formel 
I) auf möglichst kurzem Wege beweisen. 

1) Für R(s) > 0 ist 


oo 1 œ 
g(s) = Ily zam t E pin), 
R=1 = n=1 





also, ; 
(1) A(u) = p(n) fir nSu<n+1,,. (n == 0,1,2,° : ya 
gesetzt, 
oo 1 — g 
(2) f A (u)du = ( > J6). 
Sei 


0 für 0 S'u < 1/24 
(u) -4: für 1/24 S u < 25/24 


0 für 25/24 € u è 
* Received April 29, 1940. 


1 Q. H. Hardy and S. Ramanujan, “ Asymptotic formulae in combinatory analysis,” 
Proceedings of the London Mathematical Society (2), vol. 17 (1918), pp. 75-115. 


2Den Beweis eines Specialfalles dieser SAtze habe ich in des Note: “Théorémes | 


relatifs aux intégrales de Laplace su? leur frontiéme de convergence,” O. R. de l’Acad. 
des Soi. Paris, vol. 204 (1937), pp. 224-226 skizziert. 
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. und ; ; 
= Bu) Nes Ba f° (6) È 6) 57 ET a 
Dann ist nn 
f * eB (u) du 


wr 3/2 


< 1/ Vz. f “eg (u)du fren S ev TGF DIO 178 % 
-( E a et Vs/2r (exp[r*/65] — 1), 





was zusammen mit (2) 

f “ee*{A(u) —B (u) }du 
(=) si - (55) VTT Fe (expl/65] —1) 
_ =J (8) = 


-ergibt. Auf Grund der für die elliptische Modulfunktion g(s) gültigen 
Funktionalgleichung 


g (8) — Vs/2m exp[—s/24 + n*/68]g (4x*/s) 





sieht, man, dass 


JL +), i= V—1 
bei festem a fiir jedes e > 0 eine im Intervall (— œ, + œ) gleichmassig in e 
beschränkte und absolut integrable Funktion darstellt. Also ist . 
e po o | 
a f eat] (e + t? + Zait) di 
-0 
= eu A (u) — B(u))du f expl— tu + 2ai(e—u)t]at 
0 -00 
ar? 
— avr f7 {A(u) —B(u)} exp ere 


woraus 

(3) af” {A(u) _ exp [- ad =a EH auva | aa 
—a/Va Ae m tait] (13 + Zait) dt 

folgt, du im Integral rechts wegen lim sup | A (u)— B(u) | exp[— du] < const. 


(fir jedes 8 > 0) der DA ‘> 0 erlaubt ist. Wegen | 
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| l S à pers (U— 1724)» — (u — 25/24) 13 
SARI a (r°/6) T+ DTO 1/2) 0—18) 


EF ap [2 2 Eu T u—> © 
ist 
a f Bw) exp [=e Es 


Tap manri z |, T—> ©, 
so dass aus (3) schliesslich - 


(4) af” Orel a Java 


VE exp [+ Wes |; T—> co 





F (u= 25/24) | 











folgt. 
2) Mit y == z — Vr/a ist 


1+ o(1) = exp[— 1/240" (e ap [2 aN #21) 
x anol fer] fe fon SN 
also 


(5) lim sup A(y)4V3 y exp ae] 
< Vz exp[x?/24a* — Vx*6a] 

fe edu 
-Va 3 


3) Sei N — M(t) die kleinste nicht-negative, nach (5) stets vorhandene © 
Funktion, für die. 


. (1 #2) ae] —A(t)=0- 
ist. Wird zur FR 





== W, (a) —1, a œ. 


Qa = Min Q(t) e 


z— Vr/a<t<z 


gesetzt, so folgt wegen (4) und Q— 0, ad 
r TR CP a : j e 

- > Bei w œ strebt das in (3) Sechts stehende Integral als Fourierconstante einer 
absolut integtabieh Funktion gegen ð. 
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= [—2 fa ]a f° { apa) 
X exp [2 Vee] — A(u) exp [— a? C2] du/Vu 
>s exp [—2 Ve | Gta), f exp Cou a? Ga! du/us/? 
À VS : 


— T exp E [Hee du/ Vu, 





æ-Vo/a 


also | 
(6) lim inf A(2)4V3 z exp [ —2 Ve] | 
[et u/a — u?]du 

DS ENG oa a 


0 
f _etdu 
-Va 


4) Kis (5) und (6) folgt I). 


= W: (a) — 1, a> ©. 
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AN ALGEBRAIC PROBLEM INVOLVING THE INVOLUTORY 
INTEGRALS OF LINEAR DYNAMICAL SYSTEMS.* 


© By JOEN WILLIAMSON. 


Introduction. In what follows f = f(x), g == g(x) etc. are scalar func- 
tions with continuous second derivatives and are not constant in the z-domain 
under consideration. The point T == (2;,%2, - *, 2m) is a point of the 2n- 
‘dimensional phase space 


Ti = Pi, Enyi == (i; (t= 1,2,- k s‘ n), 


where, with the usual notation of Lo the q; denote codrdinates and the 
pi denote momenta. . 

Two functions f(x) and g(x) are said to be in involution, if the Poisson 
parenthesis,* 


& (af 3g _ ôf 2) č (2 og : Of se) 
1 me oo) oe Eae ana n 
( ) (f, 9) => ( oq: ôq; Ops, 2 0%; ni Ont 0x4 z 


vanishes identically. On denoting by G the skew symmetrie matrix, whose 
square is the identity, 
_ f0 +E 
S ‘ee 0 | 


where Æ is the unit matrix of order n, (1) can be written in the more 
compact. form 


(2) | (fa) aga = 0. 


In (2) fe and gs denote the gradients of f and g respectively and (fe) “the 
transposed of the column vector fs. A set of m forms fi, f2,: ©, fm is called 
an involutory system, if any two of them are in involution, i. e., if (fa, fs) =9, 
4,j=1,2,: : :,m. One can readily verify that m is less than or equal to n, 
if the involutory system consists of independent functions,? independent in the 


*-Received March 18, 1940. 

*Cf. e.g. E. T. Whittaker, Analytical Dynamics (Cambridge University Press 
(1904), page 288. 

l 2 Let A’GA = 0, where G is non-singular of order 2n and a is of rank r. Then, if 


E 0 
PAQ = B = A o pner 4H, is the unit matrix of order r, B’SB = 0, where G = P'AP. 
e 


Hence, if 8= (835), 4 ij Li, 2,...,®n, ayy =0, ig =1,2,.--,r. Since G and there- 
fore & is non-singular, r is less than or equal to n. 
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ordinary function sense, and that the maximum value n of m may be obtained 
for suitably chosen independent functions. 

If h= h(x) is the Hamiltonian function of a conservative dynamical 
system, with the above notation we may write ° 
(3) Gi ho, 
where & = dx/dt. 

If f is a conservative integral of (3), (fz2)’& is identically zero in x or, 
since G == — G1, by (3) 
(4) | (fa) Gha = 0 


Conversely, if (4) is satisfied, a F(z) is a conservative integral of (8). 
Hence the m functions 


f= h, fo: g Fr 


are mm conservative integrals in involution, if, and only if, 
(5) | (fi) fs, = 0, (,j—1, 2,° . m). 


It is known that m = n, but not more than n, conservative integrals in in- 
volution may pe chosen to be independent in the functional sense mentioned 
above. 

There remains the question: what becomes of these analytical facts in 
case the dynamical system is linear, i.e. if h = h(g) is the quadratic form 
$z’ Hr, where H is an arbitrary, but not zero, 2n-rowed symmetric matrix, 
representing the Hessian of h. In this case the Hamiltonian system appears 
in the simplified form 
(6) | Gt = Hz. 


Fugther the quadratic form f = $2’Fz is by (4) an kiei of ©, if, and 
only if, 
(7) z'FGHz= 0. 


Equation (7) is however equivalent to 
FGH + H’@’F’ = 0, 
or, since F and H are symmetric and G skew symmetric, to 
(8) 7 FGH — HGF. 
Similarly the m quadratic forms, which belong to the symmetrie matrices 


e . 
_* Aurel Wintner, “On the lingar conservateve dynamical systems,” Annali di 
matematica pura ed applicata, ser. 4, tomo 13 (1935-36), pp. 105-112. 
. ` 
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F, = H, Fo, © <, Fm are m quadratic’ integrals of the system (6), forming 
an involutory system, if, and only it,*- 


(9) PAGE; = BGK, : (j=l, 2go m). 


. It is understood that all the matrices F; are distinct from zero but are not 
necessarily non-singular. 

By the general theorem, mentioned for non-linear sete: there always 
exist m—n independent integrals in involution for the linear system (6). 
The conjecture was made by Professor Wintner that, in the case of a linear. 
system, these mn integrals may be chosen to be quadratic forms. The 
main purpose of this paper is to show that this conjecture is correct—that for 
every 2n-rowed non-zero symmetric matrix H, there exist n symmetric matrices 
F, =H, Fa: - +, Fx, which are independent and satisfy the involutory con- 
dition (9). It is understood that independence is now meant in the algebraic 
‘sense, i. e., that the corresponding quadratic forms are functionally independent. 
| By the general theory there always exist 2n — 1 integrals, which are 

independent, and n— 1 may be obtained, by a theorem of Liouville from the 
n independent integrals in involution by means of quadratures and elimina- 
tions. In the linear case it is possible that some of these n — 1 integrals may 
also be quadratic; in fact, if the minimal equation of HG is of degree 2m, . 
this number is l= n—m and, if the degree of the minimal equation is 
2m — 1, the number is 1=n—m--1. The remaining n—?—1 must 
then be determined by local elimination processes, which seem to lie outside 
the scope of an algebraic treatment. l 

It was found ‘advisable first to determine the linearly independent quad- 
ratic integrals, a. comparatively aimple process; and then, from them to de- 
termine the quadratic integrals independent in the more general sense. This 
was accomplished by the extensive use of linear differential operators, similar 
to the Aronhold operators of classical invariant theory. In § 6, when H is 
singular, the linear integrals of the system (6) are determined. 

In the final section it is shown that the dynamical system, corrésponding 
to the equations of variation of the small vibrations about an equilateral 
Lagrangian libration point in the restricted problem of three bodiés, has, for 
all values of the masses, in addition to the energy éntegral only one quadratic 
integral; and this integral is determined. 

The methods employed throughout the paper are purely algebraic, and the 


t Aurel Wintner, loo. oit., page 108. — | e 
SE. T. Whittaker, op. cit., pag 311. ô . 
° E.g, L. E. Dickson, Modern Algebraic Theories, Chicago (1926), pp. 25-27. 
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proofs, to a large extent, ave based on results, previously proved by the author,’ 
which give normal forms for a pencil of matrices, whose base consists of a 
symmetric and a non-singular skew symmetric matrix. These results, for 
convenience of reference, are given in § 1. 


1. Normal forms. If H is a symmetric and G a non-singular skew- 
symmetric matrix, we shall say that the pair A, B is equivalent to the pair H, G, 
if there exists a non-singular matrix P, such that 


PHP =A and PGP’ =B. 


In normal form the matrices A and B of the pair equivalent to H, G are simi- 
larly partitioned diagonal block ® matrices 


A = [A1, 4a: k ', Ax], B= [Ba Ba: ` Br], 


the blocks being determined by the elementary divisors of the matrix pencil 
H — zG. The elementary divisors of this pencil are subject to the following 
restrictions; ® if (æ— a)", a40, occurs s times amongst the elementary 
divisors of the pencil, then so does the elementary divisor (z + a)" and the 
elementary divisor 27, where r is odd, if it does occur, must occur an even 
number of times. Since the field of operations is the real field, there are four 
distinct forms for the matrices A; and By. 


Type (a). The pencil A— zB has the single pair of real elementary 
divisors (æ—p}", (x + p)". Then? 


0° Er L; 0 0 L; 
(10) B= ( "p, de 4, = (4 ae) B= (i, ay 


where 
(11) Ly = pH, + Ur. 


In (11), E» is the unit matrix of order r and U, the auxiliary unit matrix 
of the same order. In particular, if r is odd, p may be zero. 


Type (a). The pencil A — zB has only the four elementary divisors 
(A+ a+ 1b)", b0. The matrices A; and By are still determined by (10) 
ey LI 


TJohn Williamson, “On the algebraic problem concerning the normal forms of 
‘linear dynamical systems,” * American Journal of Mathematics, vol. 58 (January, 1936), 
pp. 141-163. The general fleld Æ is now the fleld R of all real numbers and the 
particular results now required are given in $6, pp. 161-163. 

*The matrices À, fod B, are square matrices of the same order. 

°? John Williamson, loc. cit., page #45 and page t62. 

49 John Williamson, loc. cit, page 158, formula, (59). 
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and (11), if each unit is replaced by the two-rowed unit matrix, each zero by 


the two-rowed zero matrix and pue matrix de 2) : 


Type (b). The pencil A sage has only the two pure imaginary divisors 
(2— ib)", (x + ib)". ‘Then * 


(12) A; = (pE, n eU;)B;, 
where e is the unit matrix of order 2, and 
| ` 0 b 
(13) p=ti=( ie 
(14) ` B; =1X,, 
0 0 0°1 
0 0 —1 0 
i s? a 0 ‘ 
A, 1)r1 0 . 0 0 


Type (bı ). The sl A;— zB; has the ainele Mae dir zr, 
Then A; = Ur and By == Xor. 

For later purposes we require the following. If r is even, X, is skew 
symmetric and, if r is odd, X, is symmetric and therefore for all values of r 
the matrix B; in (14) is skew symmetric. Further 


(16) XU, mm — U’,X, 


. and, since X == + E, 3 
(17) ; UX, = — X,U’,. 


- e | 
In type (a), when p 0, any matrix D commutative with 4,B;"* is of 


Dy 0 
the form ( 0 Da where 


r-1 j r-i 
(18) Da = > feU” and Dag — > gU w. 
k=0 k=0 


If p = 0, the matrices defined by (18) are certainlg commutative with 4;B;1 
but of course are not the only ones. 
In type (a,) D. has the same form except that f$ and gz are both poly- ` 


u John Williamson, loc. cit. page 155, formula (55). Formula (55) is of course 
simplified for this special case as indicated on page 162. The fact that B, is not unique 
` but may be replaced by — B j does not alter the form of a matrix commutative with B,. 
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nomials in p, i. e., are of the form = *): In type (b), D has the form. 
| Da in (18), where fys is again a polynomial ** in p. 
| 2.. We now consider the purely algebraic problem of determining the 
number m of lineary independent symmetric matrices F; of order 2n, which 
satisfy | ta È 

(19): . F4GH = HGF, (i— 1,2," --,m), 

0 +E 

—E 0 
singular skew symmetric matrix mentioned in the introduction. If (19), is 
satisfied, 
f > F GHG = HGF,G, 

or, since G = — Q7, 5 

(20) Fi@? HG = HGF; Q. 


Hence, if F, satisfies (19), FQ is commutative with HG. Since the 
number and nature of the linearly independent matrices /;G*, commutative 
with HG, are known," it is only necessary to determine for which of these 
known matrices F,G-1 the matrix F; is- symmetric. 

The number of linearly independent matrices commutative with H os 
depends on the number and the nature of the elementary divisors of the matrix 
pencil H.— zG. Hence, in considering the general case, it is necessary to 
reduce H and G to the normal forms given in section 1. However, if HG1 
is not derogatory, i.e., if the minimal equation of HG is the same as its ` 
characteristic equation, any matrix commutative with HG-1 is a polynomial 4 
in HG", A maximal set of linearly independent matrices commutative with 
HG# therefore contains exactly 2n members; and one such set consists of the 
2n distinct powers of HG, i. e., of the 2n matrices 


(HG), (k =0,1,2,- + -,2n—1). 


where H is a given symmetric matrix and a—( is the non- 


We may therefore suppose that 
| F.G@+ — (Ha) ‘ 


e 

12 J. H. M. Wedderburn, “ Lectures on matrices,” American Mathematical Sooiety 
- Colloquium Publications, vol. 17 (1934), page 124; John Williamson, “The idempotent 
and nilpotent elements of a matrix,” American Journal of Mathematics, vol. 58 (1936), 
p. 477. 

J. H. M. Weddesburn, op. cit., page 106. 
_ - MJ. H. M. Wedderburn, op. òit., page 27; C. CeMacDue, The Theory of Afatrices, 
Berlin (1933), page 94. ; 
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or that 
= (HŒ) H. 
Consequently, F’; the transposed of F satisfies 
Py = (—1) “H (GH) = (—1) (HG )H = (LS 


If F; is symmetric, i — 1 must be even and + must be odd. Therefore, if HG 
is not derogatory, there exist exactly n linearly independent symmetrié matrices 
F; which satisfy (19). One such set consists of the n matrices 15 


* (21) Fy = (HG) OX, | (i=1,2 ++, 0). 


Since the matrices PG, where F4 is defined i in (21), are all polygonal in 
HG, it follows that 


FGF, = FGF, (i,7—=1,2,-- +5), 
and hence, that the n quadratic integrals ‘corresponding to the etr F; 


form a set of integrals in involution. Consequently, we have 


THEOREM I. If HG is not derogatory, there exist n linearly inde- 
pendent quadratic integrals of the system (6). These n quadratic integrals 
form a set in involution and may be so chosen that the corresponding matrices 
are the matrices F; in (21). 


‘Tt will be shown later (8 3). that these n quadratic integrals are not only 
linearly independent but also functionally independent. 
If a matrix F, which satisfies (8), is not symmetric, we find, on taking 
the transposed of both sides of (8), that . 
FUE = H'CF", 
or, since H is symmetric and G skew symmetric, that 
EGH = HGF. 


Therefore we have 


Lemma 1. If a matrix F satisfies (8), so does the matris F’, the trans- 
posed of F. 


If the pencil of matrices H—x@ is congruest to the pencil À —xB, 
so that there exists a non-singular matrix P satisfying both of the equations, 


PHP’ =A’ and PGP’=B, 
NES: i p e | . 
18 Since G-1 =— G the matrix S, = (HG@)2050 A, The matrix @-1 is used instead 


of —@ to emphasize the fact, that it is the matrix pencil H—aG@ which is the 
dominating factor throughout this discussion. | . 
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then 
| AB#Q;B = Q,B°AB4, 
where Qi = PFP. L 
Consequently, we may replace the matrices H and G in (19) or (20) ‘by 
any pair A, B equivalent to H, G or, what is equivalent to this, we may suppose 
that the pair H, G is already in the normal form described in $ 1. We there- 
fore do this and assume that H and @ are the matrices A and B respectively 
of $1. We let . | 
= (Fy), (4, 7 = 1,2, i nk), 


be a partition of F similar to that of À, or B and therefore, as a consequence 
of (20), have 
(22) Py By AjByt = A BoOPyBe, (4, j= 1, 2,° +k). 


If FG" is the most general matrix commutative with HG, by Lemma 1, 
we obtain the most general symmetric matrix satisfying (8) from this by 
putting Fy; = F’4;, à < 3, and restricting Fa to be symmetric. If A,B, is 
non-derogatory, it is a consequence of. Theorem 1, that the number of linearly 
independent matrices FiB? commutative with A,By;*, for which Fi, is sym- 
metric, is exactly one half the order of Aj, i.e. is one half the number of 
linearly independent matrices commutative with A:B;1. Consequently, if 
each of the matrices A;B,* is non-derogatory, the number of linearly in- 
dependent symmetric matrices F, which satisfy (8), is exactly one half the 
total number of linearly independent matrices commutative with HG and 
as remarked earlier, this number is known.i® A maximal set of symmetric 
matrices F,, which satisfy (9), must consist entirely of oe blogs 
matrices s 
[Pa Fos : ', Fa]; 


e 
and, as a consequence of Theorem 1, the number of linearly independent 
matrices in such a set isn. It is apparent from § 1, that A,B; is derogatory, 
if, and only if, the pencil A; — zB; is of type (a) with p = 0, and that then 
H is singular. Accordingly, we have proved 


THeorEM 2. If H is non-singular, there exist n nait independent 
quadratic integrals of the system (6), which form a set in involution. The 
number of linearly independent quadratic integrals of the system (6) ts 
exactly one half the fumber of linearly independent matrices commutative. 
with HG. 


If A;B is derogatory, the elementary glivisors of the pencil A; — zB; 


16 J. H. M. Wedderburn, op. oit., page 105. 


y 
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are z", x where r is odd. . On dropping the suffix r, we obtain from equa- 
tion (10), | 
U 0 
j = eee i 
(23) AB; = e = $ ‘ 
Pyy = DBs", 
as a consequence of (22), we have 


(24) D{U, — U'] = [U,— 0"]D. 


Finally, if D = (Dj;), i, j= 1,2, is a partition of D similar to that of A,B; 
in (23), then (24) yields the equations 


(25) DyU = UDua, DeU’ = U'Dos, — DU’ = UD», — DhU = UD. 
The matrix Fy; therefore has the form 


| ( Da r) 
eray Dı Di: 
and is symmetric, if, and only if, Da; and Da: are symmetric and D,, == — D'az. 


By (16), U’ = — XUX and therefore by (25), Dı:X UX == UD,:z; 80 that 
Dı:X is commutative with U and is therefore a polynomial in U. Hence 


: rot 
Diz = > fiUi X. 
4=0 
Since r is odd, X is symmetric and therefore, 
r-1 r-1 | 
Du (X>) DAU? = X (—1) fi. 
4=0 i=0 . 4 


If Dıs is symmetric, f;=0 when + is odd and therefore, if r == 2m + 1, 
Dis depends on the m + 1 parameters fi, i == 0,2, 4,: --,2m. Similarly, if 
Da is symmetric, D., depends on m -+ 1 parameters.’ Finally, if D,, is the 
most general matrix commutative with U, then D,, depends on 7 == 2m +-1 
. parameters and D’;, is the most general matrix commutative with U’. Hence, 
if Dı = — Dn, the matrix pair Dı and D, depends on only 2m +1 
parameters. Therefore the matrix F'j;, when it is symmetric, depends on 


mtl+m+1i+2m+1=—4m+3 


parameters. The general matrix F,;,B;1, commutative with 4;B;+, however, ` 
depends on 4r = 8m -+ 4 parameters and 4m + 8 — 4(4r) + 1. For example, 
in the simplest case, r = 1, A; isghe zero matrix and F 3j is of course arbitrary. 
To restrict Fy; to be symmetric, imposes only one condition and there. are, 


13 
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therefore, in this simple case 3 = 44 -+ 1 linearly independent symmetric 
matrices Fj;. Consequently, if A;By* is derogatory, the number of linearly 
independent symmetric matrices Fj; is one more than half the number of 
linearly independent matrices commutative with AB; Moreover, the 
matrices F,,, for which Dı: = 0, form a set in involution, since any two such | 
matrices #,,B;1 are obviously commutative. Their number is r = 2m + 1 
and we therefore have | 


THEOREM 3. The number of linearly independent symmetric matrices 
Fi, which satisfy (19), is exactly one half the number of linearly independent 
matrices commutative with HG, unless the pencil H— zG has a pair of 
elementary divisors of the form (2, 22"), For each such pair of ele- 
mentary divisors, the number of linearly independent symmetric matrices Fi 
ts increased by one. 


It is obvious that two matrices FG for which Fis — 0, +54 j, are always 
commutative and it is known that a maximal set of matrices FG-1, commutative 
in pairs, consists solely of matrices?’ for which F4; = 0, i4 j. The number 
of en independent symmetric matrices F in such a set is, therefore, 


d = PL where d; is the number of linearly independent matrices Fis 
Since we have just shown that ee ra, where n; is the, order of Fis 


d= Š $n, = $2n — n. ` We have accordingly proved 
i 


THEOREM 4. Every linear conservative dynamical system with n degrees 
of freedom has at least n linearly independent quadratic integrals in involution, 


3. While the results obtained up to the present all deal with linear in- 
dependence, we now determine, from the known linearly independent quadratic 
integrals or forms, all the functionally independent quadratic integrals. We 
first show that n quadratic integrals in involution, which are linearly in- 
dependent, are necessarily functionally independent. 
| We note that, if 

F = [Fii Fos, j "Fal, 


‘is a diagonal block symmetric matrix, the quadratic forms corresponding to 
the matrices 
[Fu 0, 0, *» 01, [0, Pog, -0l ><, [0, 0, e Fa]: 


are not only linearly, but also functionally independent, since each involves ` 
itis e hd 


11 J. H. M. Wedderburn, op. oit., page 108. e 


wt 
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a different set of variables. Accordingly, we need only consider four simple 
cases—those corresponding to the types (a), (a1); (b) and (b:) of section 1. 


Type (a). Let H—2xG have the single pair of real elementary divisors 
(A+ p)". Then we may take H in the normal form given by (10), 


. (L 0 
H-(% C y)e 
where L is the matrix L; of (11). Then the n linearly independent matrices 
F of Theorem 4 may be chosen to be 1$ 


0 U* i 
(26) Ra (b= 1,2, > +n), 
and those of Theorem 1 as 
0. ZX 
(27) ee D ; (k = 0,1,2,---+,n—1). 


The quadratic forms corresponding to the symmetric matrices (26) are 
(8) fp BB xp sn (jan k= 1,2, + +n). 
+ FF x 


The n quadratic forms (28) obviously are functionally independent, since each : 
of the forms fi, fe,‘ * *, fn contains a variable, which does not occur in any | 
of its predecessors. Since the matrices (26) are all linear combinations of 
the matrices (27), the n quadratic forms corresponding to the matrices (27) 
are also functionally independent. It should be noted that the above results 
are true, even if p == 0; in this case, however, there do exist in addition other 
symmetric matrices F, which satisfy (8) but are not of the above form. 


Type (a). Yet H—xG have only the four elementary divisors 
(x £a 1b)", b0, so that H is now of order 4n. We may take H and G 
in the normal form described in section 1. Then the 2n linearly ae ean 
matrices F of Theorem 1 are given by (27), with k = 0,1,2,---,2n—1, 
while the matrices of Theorem 4 are the matrices (26) together with the 
matrices 


(29) | (Lin owe ei | 


For convenience we relabel the 4n variables v in the order 2, Éi, Lay Ên °° , Dan; Éan- 
Then the 2n linearly independent quadratic forms corresponding to the 
matrices (26) and (29) are regpectively 


1s J. H. M. Wedderburn, op. cib, page 104. 





(30) efi = 2 À (ZpTon-jap + Epfon-ssp) » (j =1,2,:-- ? n), | 
and i 


| ; ; 

(31) 2g; = 2 > (2pbon-jap — ÉpTan-j+p) - 
5 r= 

On arranging these 2n quadratic forms in the order, 


fis 91 fa, g2; or , fns Qn 


we see that they are functionally independent; for fx and ga are functionally 
independent and each pair fr, gx contains two variables which do not occur 
in any of the preceding pairs. 


Type (bı). Let the pencil H —'zG have the single pair of elementary 
divisors (+ 1b)", n 540. Then we may take H and G in the normal forms 
given by (12), (13) and (14), (15) respectively. On dropping the suffix r, 
we can easily show that any matrix F satisfying (8) is of the form D::X, 
where D; is given by (18), and the fx are two-rowed matrices of the form 
Es) If F is to be symmetric, d == 0 when n—& is odd, while c—0 
when n— k is even. If the 2n variables x are relabelled in the order, 
1, Éis La, E2, © * +» Ln, En, the n linearly independent quadratic forms of Theorem 
4 may be taken as those n of the 2n quadratic forms, 


(32) fi > (— LP (aptys1-» + Épbin-n); (—1,2,;::,n), | 
and j i 
G g= (CDA btm) F ALE n), 


which do not vanish identically. In (82), f; is zero, if j is even; similarly, 
in (83), g; is zero, if j is odd. If we write the forms in the order 
fis 92) fs» as © < it follows, as in the previous cases, that those n of the forms 
(32) and (33), which are not identically zero, are functionally independent. 


_ Type (b2). Let H— zG have the single elementary divisor z**. Then, 
with the notation of 81 fype (bı) we may take G— X and H—U. The 
matrices of the n linaarly independent quadratic forms are then UX- 
t= 0,1,2,---;n—1, by Theorem 1. The corresponding quadratic forms 
are proportional to : i . : 


2 R ` >. 
(34) a f= È (Pata, à (j= 1,3, 5,:::,èn—1). 
= : 
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As in previous cases, the n quadratic forms (34) are functionally independent. 
Combining the separate results of this section we have: 


Turora 5. Every linear conservative dynamical system with n degrees’ 
of freedom has at least n functionally independent quadratic integrals in in- 
volution. If HG is not derogatory, the quadratic forms, whose matrices are 
(HG*)2G-D A, j= 1, 2,- > +, n, are n functionally independent quadratic in- 
tegrals in involution of the linear system (6). 

4. In determining the maximal number.of functionally independent 
quadratic integrals (not necessarily in involution), we shall once again start 


from the known set of linearly independent ones. As in the previous me à 
it will only be necessary to consider four special cases: 


Type (a). Let H — sG have the elementary divisors 
(t— p)" (e + p)*; (i= 1,2, >>, k5; er = eps Sex); 


p real and different from zero. Then we may take H and G in the normal 
form of $ 1, where A; and By are given by (10) and (11) for all values of j. 
Therefore, if F is a symmetric matrix which satisfies (8), 


(y o e where W= (Fu), Gt) 
and : : 
(35) a (4%), 64 = ej; Wij — (0, Gay), & = Cj. 


The matrix Gi; is of the form * 
| e-1 
(36) f Gij = È gija U”, 
s=0 ; . 

where e is the minimum of e; and ej, and U is the auxiliary unit matrix of 
order e. If all gavo are zero except a particular one, gijs, which has the value 
unity, we denote the corresponding matrix F by F'ije-s and the corresponding 
quadratic form by 2fije-s. With this notation the linearly independent quad- 
ratic forms are 


i ; . 
(37) fist = È Graptnso-teps (4,7 =1, 8," "č; t= 1,2, > ', 6—1), 

a 1 
where r = 6, + ea +: + - He; and o= e, + ea +'e - He. The quadratic 


13 If AB = BA and A is the diagonal block matrix E4,,4,1 where the minimal 
equations of À, and A, are relatively prime, p is also a diagonal block matrix par- 
titioned similarly to A. ase =. Xa 

xJ, H. M. Wedderburn, op. c®., page 104. - 


Bole My Rano TIE Se TREER SUR Ay yee woe ee 
sets of variables involved are no longer 2, T3, ` `, Zn and Tany ` ©, Zne In 
particular, if Tr, = Y; and Gao = Zj, 

(88) fij 1 — DrnBnao = Ya. 


In order to determine the functionally independent quadratic integrals, 
we make use of linear differential operators reminiscent of the Aronhold opera- 
tors of classical invariant theory.’ We define the linear differential operators 
Q, and Q; by 


(39) MS prog +S (pen. 
orme F e H: + ++ ei and 
Dj 3 pra HSP) o 
m1 P+p p= Taspsps1 
p— e, +égt- +--+ es Then 


t 
Qifijt = X PlraperTnso-tapy 
rl 
and 


t 
Afije = = (t—p+ 1) TrspTnio-t-13p 
2° re 
since o = p + ej. Therefore 


#42 | t 
(w + 04) faye — È ( p= 1) ErspTaro-t-14p + 2 G—pt 1) etwas 
r= 


E 


=t 5 Drapnao-t-1199 
and fénally by (37) : | 
(40) (Ri + 05) fast = titre 


We note that the 2n — e, quadratic forms 
(41) fus; (tj) = 1,2,° + -,6;3 j—=iori+i1;i=1,2,"::,k), 


are functionally independent; for, if we arrange them in the order 


frufiss, VE fase 312, re > fizes3 “feat, i.e > fezes3 aes ty > Sixers 
each form contains a variable which does not occur in aiy of its predecessors. 


These variables are in order 


Ti, To,’ Te; Triton ` ` y Tnet) Toris ` "5 Testess © ° l Tne 
S : 


n Cf, L. E. Dickson, Modern Algebraio Theorie, pp. 25-27. 
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Next we show that every quadratic form (37) is a function of the quad- 
ratic forms (41). It is a consequence of (38) that 25/2541. = fjn/fj and 
therefore that 
(42) Zi = fiji 


where qi; is a rational fanction of the quadratic forms fre: and fesur Where 
(43) either jSsSi—1 ortSsSj—l. 


Consequently we have 


(44) | fisi = Qasfein 
and are in a position to prove, 


LEMMA 2. The quadratic form fi: is a rational function of the quadratic 
forms fer, fessir, Where rt, and 8 satisfies one of the inequalities (43). 


We shall prove this lemma by induction on £ and observe that, as a con- 
sequence of (44), it is true when t= 1. We assume the lemma true for the 
value ¢ and therefore have 


(45) fast R=R(fosr3 four), r St, s defined by (43). 


Let Q =» ZQ, where Qg is defined in a similar manner to Q; in (39) and 
the summation extends from i to j or from j toi. Since, by definition, Q is a 
linear operator, OR is a sum of terms, each term being a product of the partial 
derivative of R with respect to one of its variables fspr by Qfspr. By (40) 


Qf ijt = tfi jt+ and Qf apr == rf pret 


and therefore by operating with © on both sides of equation (45) we have 
tfijtu == W, where W is a rational function of fer and fseur; s$ is defined by 
(43) and *<¢-++1. Hence our lemma is proved and consequently all°the 
quadratic forms (37) are functions of the 2n — e, functionally independent 
quadratic forms (41). It should be noted, for later reference, that e, is the 
nighest exponent of (x + a) in the elementary divisors of the pencil H — zG 
or of HG? — aH, and that accordingly the minimal equation of HG is of 
degree 2e. 


Type (a,). Let H — zG have the elementary @ivisors 
(za ib)“; (i= 1,2, :,k; b0; fZ e2: Ze). 


Then, if we let G be of order 4n and relabel the variables z,, fas Lay Las" * * > Erny Eon 
we find, by an argument similarsto that applied to the forms Gaya and (31) 
of the previous section, that the forms 
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A . t 
(46) Wy = > (GrspTnso-tep + Érspénso-tsp), 
mi 
and 
$ 
(47) Vije == = (Sripénso-tap = ExspEnro-tsp) 
. 3 #1 


are linearly independent if t, o and r have the values given in (37). In 
particular re may write 
(48) : Wine YZ; yaks and vij = Yii — mes. 


The 4n — 2e, quadratic forms (46) and (47), for which j =i or ++ 1, are 
functionally independent, for they can be so arranged in pairs that each pair 
contains two variables which do not occur in any of the preceding pairs. 

On forming the Jacobian of the eight forms ` 


(49) f Wikis — Viet, Wifi, — Vigas Wier, — Viki, Wjjis — Vis 


- we obtain, with the notation of (48), the eight-rowed square matrix 


Zx 0 Y:0 


Z0 0 FY; w €) | Yi m 
K= 0 Zs ¥; 0 , Where n=( 5 J and ri (5 Le. 
; 0 Z; 0 Y; 
| O\ _ fu +) 
f W; = F = 
1 ur (o 2) (y na}? 


the rank of K is the same as that of the matrix P obtained from K by replacing 
Y, by W; and Y; by W;. Since the matrices W, and Z; are commutative, 
a simple calculation shows that the product of the two by four matrix 
° Tin Vide, + Vibs, — Vide) 

by P is the zero matrix. Hence, at most six of the eight forms (49) are 
functionally independent and, since any three of the pairs wan, Von in (49) / 
are functionally independent, each member of the fourth pair is a function - 
of the other six forms. In particular 


(50) © Wm mol (Wij, Viji; Witr Vikis 03313 Viji) 
and, if k = 4, x | 

í À 
(51) Win = G (Wiji Vija) Wiio Vii; Wj Viji) 


with similar results for the corresponding forms vy, and Vine 
| . Asa consequence. of (51), if a > b, wom is a function of forms Wij aa 
Vij, Where tj. As a consequence’ of ys when b—a>2, Wad is a 
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function of forms wij, and vi where j—1< ba. The same is true of ` 

the forms ter, and therefore we have ; 

(52) Wiji "= F (Was, Vasi; Ws, s41,13 Vs, 841,1) Viji = G (Wss; Ussi) Wa,s41,13 Ve,041,1) . 
t 


Let O1 and O; be the differential operators obtained from 0; and Qj, 
defined in (39), by replacing x by é Then 


(53) (94 + Os + Qj + Os) wage = Wijt 
and : | 
(54) (4 + Os + Q; + Oj) vii = tigna 


If O == 0), where the summation extends from + to j or 7 to i, we may 
prove a lemma, which is the analogue of Lemma 2. In the latter the operator 
Q is replaced by Q + 0, (44) by (52), and (40) by (53) or by (54). It 
then follows that any of the quadratic forms (46) or (47) is a function of-the 
4n — 2e, quadratic forms for which j =t or i+ 1. Therefore, in this par- 
ticular case, where G is of order 4n and the minimal equation of HG is of 
degree 4e,, the number of functionally independent quadratic integrals of the 


system (6) is 4n — 2e. 
Type (b:). Let the pencil H —-zG have the elementary divisors 
(z + b)”, @ 0, j= 1,2 k; ae Ze). 


Then we may take H and G in the normal form of § 1, where A ; and By are 
given by equations (12), (13), and (14). If F is a symmetric matrix, which. 
satisfies (8), and, if F = (Fij), 1,7 == 1,2,- - -,k is a partition of F similar 
to that of H or G, 


Fy = (Wy); 
where Wy, is defined by (35) and (36), with the addition that ° 


eau (e t). 
an — tijs Tija 
Since F is symmetric, F’;; = F44, so that we need only concern ourselves with 
the case in which +S j. If Foy = 0 for all a and b except when a = t, b =m 4 
and a= j, i= b, the corresponding quadratic form involves two sets of 
variables, one containing 2e4, and the other 2e; vêriables. If for convenience 
of notation we write e; == e and e; = d, we can denqie these sets by 


2 D, é, To, Ég 7 Tos be and Yis Mis Yo Mer" * ` > Ya, Na 
respectively. The correspondigg linearly independent, quadratic forms are ther 
P y p es AP q i i 


t $ ; a 
(55) Wijt =e à (— 1) tyin + Epntsi-p), | (t =s ? d), | 
pur 


(56) vie => (— 1) (tpness-p — ÉpYtui-p), (t—1, 8, : *,4). 
#1 


If t— j, the variables x, é are the same as y, 7; and Wijt = 0, if t is even, 
while Vit == 0, when ¢ is odd. Let 


o= $ p(o emi) and 0 Sr dus 2) 
A 0m 0 o P (I yg 7 
Then ; 
t 
QeViyi = > (— 1)?*"p (EpurYtar-p — Tpytu-p) 
! pl 
t+1 
== 2 (— 1)? (p — 1) (EoY/t+2-p — Lpytse-p)» 
pas 


while 


t À 
Qu igt = 2 (— 1) (t + 1— p) (2pntse-p — EY t2-p). 
| E 
Therefore 


t+1 

(57) (Qs + Qy) Wijt =t > (— 1)? (Lpytse-p = ÉpYt+2-p) = Vijita 
p1 

Similarly 


t 
QaVijt = = (— 1)?**p (Epsintst-p + ZpsiY/te1-p) > 


and 
; : 
Oyvige = 2 (— 1)74 (t + 1— P) (— ToYt+2-p — Épnt+2-p) 
p= < 
80 that 
_ (58) (Qe + Oy) Vije = — basta. 


The Jacobian of the four forms Wit, Wijs Wija and vij, is the matrix 


Tt & 00 
0 0 Yı m 
yı M Ti & 
Mm — Y — Ê Tı 


The determinant of this matrix is zero while the matrix of its first three rows 
has rank three. Therefore v4;, is a function of Wit, Wjy and Wiji Since 
Qs + Q, is a linear operator, we may employ an argument similar to that 
used in the proof of Lergma 2, and from (57) and (58) deduce that w:52 is a 
function of Wit, Wj41, Wijr, Viin, Vise, Vase By repeating this argument we 
finally have the result that, for any value of k, Wijs and Vijexs1 are both func- 
tions of forms of the ‘type fae, where e 


(59) : fabo = Wade, if c is odd, and fabo = Vane, if c is even. 
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If we denote the variables associated with F,, by z and ¢, the Jacobian 
matrix of the six forms i 


(60) furs fits fro fus fims firs 


formed with respect to the variables 21, 41, 2, &, m1, &:, is the matrix 


z00&6&0 0 
0 y: 0 O m 0 
0 0 z 0 0 & 
Yi Ti O m & 0 
0 z Yı 0 & Mı 
TOE ii 0 4 


If X is the column vector with components é, mr, tu — is — Y1; — #1, the 
vector QX is the zero vector. Hence Q is singular. The first five of the forms 
(60) are functionally independent: and therefore firı is a function of these 
five. Therefore, if i+ 1< r, 
(61) , fin = F (fari), 
where b —a < r— 1. 

` As a consequence of (61) we have 


(62) fers = G (favs), 


where b == a or a-+-1. By operating on both sides of (62) with an operator, 
which is the sum of all operators Qe and Qy for all sets of variables, it follows 
that, for all values of ¢ and 7, the form fij: == F (favs), where b = 4 or a +1, 
s = 1,2,- `,» Hence, the quadratic forms wij: of (55) and (56) are 
functions of the forms f 


Q—_ 


(63) fase, (1,2, :,k;j—tiori+i,t—1,2, : :,6;), 
e 


where fi; is defined by (59). There are 2n —e, forms (63) and they are 
functionally independent, as they may be so arranged that each involves a 
variable which does not appear in any of its predecessors. Once again, e, is 
one half the degree of the minimal equation of HG. 

If H is non-singular, we can take H and G@ in diagonal block form 
H = (Hy, Hz: 3 , A), G= [G,, Ga: : Gl, where H; and G; are of 
order 2n; and the elementary divisors of H; —x@; are of one of the three 
types considered above. The number of functionally, independent quadratic 


| à ; 
forms F is therefore $, (2n,——2m,/2), where 2m; is the degree of the 
iz 
t j t . 
minimal equation of HiG. Bat È n; =n, and > 2m, = 2m is the degree 
: 4-1 . #=1 


of the minimal equation of HGe. Since the remaining case, in which H is 
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singular, is rather complieated, it is Povi to Sum up our i in the 
form of 


THEOREM 6. Let H be the mairiz of the Hamiltonian function of a linear 
conservative dynamical system with n degrees of freedom. If the degree of. 
the minimal equation of HG is 2m, the system has 2n — m independent 
quadratic integrals unless H is singular. 


5. In order to obtain corresponding results for the case of a singular H, 
it is only necessary to consider the case in which every elementary divisor of 
H — zG is of the form 2”. We therefore consider the pencil H — zG, in which 
the elementary divisors are | 


x; & ax a? (1,2, °°, k; 6 = 6: = ++ + = ey). 


When r is odd, the elementary divisor «7 must occur an even number of times; 
hence, -if e, is odd, either e, has the same value as ei, or the same value as 
eu. From §1 we see that a norinal form for HG is the diagonal block 
matrix 

[Wi Ws,° °°, Wa], 


where W, = U; if e; is even, while, if e, is odd, either W: == U, and : 
Win == — U’, or Wia = U; and W; = — U’';,. The normal form for G is 
also a diagonal block matrix with a block X, (defined by (15)) corresponding 
to each even e; and a block ( > A corresponding to each pair of odd 84: 
AT 
In the above the matrices, E;, U; and X, are the matrices of order e, de- 
fined in $ 1.. 

In order to determine thẹ form of the most general eres matrix: F, 
whigh satisfies (8), we let 
(64) C == PG=. 


f 


Since C is-commutative with HG", if C = (Ci), t, j = 1,2,: > +, k, we have 
(65) Wilis = Cay Wy 


The matrices Cy; are of four types depending on the structure of W; and Wj. 
However, as we shall only*be interested in symmetric matrices F, we need only 
consider the different possibilities for Cy when i < 7, so that e; = ej. There- 
fore, in what follows, i is always less than or equal to j. These possibilities are: 


Type (i). Wi= Un Wy= Us: Then Cu (Y) where Gy is a 


polynomial in U}. x : l 


THE INVOLUTORY INTEGRALS OF LINEAR DYNAMICAL SYSTEMS. 901 


Type (ii). Wi=— Us, W);——U';. Then Cu=(x where Kij 
tj 
is a polynomial in U’;. 
Type (ili). W; == Ui, W; = — U’; Since UX; a= — Y,U;, (65) 


becomes 
UO Cy Xj = Cis X5 0; 


G 
Ci = ( wt) x, 
where va is of type (i). 


Type (iv). Wi = — U’;, W == U;. Since UX; = — X,U’; (65) 
becomes 


and 


TCX y = CX; U’;, 


Cy == (=) Xj, where en is of type (ii). 


Symbolically we may denote a matrix Ci, of type (p) by Tp, p = 1, 2, 3, 4 and 
therefore symbolically have the result 


so that 


(66) Tı=T,X and Dis POG 


If F = (P;;) is a partition of the matrix F, defined by (64), similar to 
that of C, the matrix F4; is one of four distinct types. 

Tf W, = U; and e; is even, Fii = CiiX;. Since Cr is of type (i), by 
(66), Fu is of type (iii). 

If Wi == U: and €; is odd, Fu = — Uria: Since gi is odd, Win = —— T'i 
and Cii. and, therefore, Fi; is of type (iii). | ° 

If Wi = U’; Fi; == Cii Since Wi = Ui, the matrix Ciia and, 
therefore Fi, is of type (iv). 

We accordingly have the lemma, 


Lexa 3. The matric Fu is either of type (iii) or of type (iv). It is 
of type (iv) if, and only if, Wı =— U':. If Fu ts of type (iv), the matrices 


Fiai and Finin are both of type (iii). ` 
e; even: Then Fij == Ci Xj. If Wi = Ui, Ci; is od type (i) and, by (66), 
Pi; is of type (iii). If W, = — U%, Ca is of type (iv) and, by (66), Fi; is 


of type (ii). On replacing the matrices of & . ) by their corresponding 


types, we may express the above results conveniently by the two diagrams 


on CF) où (2) 


ej odd, W; = U;: Then Fi; == — Ci, ji If Wi — U; Fy ig of type (iii) 
and, if W, = — U's, Fy is of type (ii). These two results lead to the same ` 
` diagrams (67). i 

€; odd, W; == — U’;: Then Fi; — Ci g-is E W; = Ui Fi; is of type (i) 
and, if W: =— — U’;, Fy; is of type (iv). These results may be expressed by 
means of the two new diagrams | ‘ 


T: Ty {Ta Tu 

(68) | ( | and Ç 7 

It is apparent from the diagrams (67) and (68) that the type of Fij 
uniquely determines the types of Fi; and Fy;, and conversely. Since Ts and 
‘T4 involve. the matrix X, while T, and T, do not, we shall call the matrices of 
types (i)-and (ii) posttive, and those of types (iii) and (iv) negative. For 
brevity we shall say that F'.; has sign e, where e= + 1 or — 1, according as 
Fi; is of positive or negative type. : | 

4 + 1 less than j: If Fi; is positive, either Fu i is of type (iv) and F;; of 
type (iii) or Fy; is of type (iv) and Fi, of type (iii). In the first of these 
cases, as a consequence of Lemma 3, Fis: in is of type (iii), so that Fi a is of | 

type (ii) and Fi, s is of type (iii). In the second case, F;, 3-1 is of type (iii), 

` Fija of type (iii) and F;, of type (i). We have therefore proved 


Lemma 4. Ift+1<j and if Fi; is positive, there exists an integer k, 
i <k <j, such that Fix and Py; are of opposite sign. 


On the other hand, if Fy; is negative, Fi, and Fy; are of the same type. 
Therefore, if i < k <j, Fi and Fy; are both of the same sign. We may 
` combine this last result with that of Lemma 4 to have 


` Lemma 6. Let Fy; have the sign e= + 1 and let i+ 1 < 7. Then there 
exists an integer k, i< k < j, such that, if the sign of Fu is 8, the sign of 
Fi; is — e. 

We next obtain explicit formulae for the linearly independent quadratic 
forms. If Fix is of type gü), Fu = Qu X, where. Gi is a polynomial in U4. 
Since F, and therefore F4, is symmetric, Fi; is an even or an odd polynomial 
in U4, according as e; is even or odd. Let 

Fin = UX. 
Then, if e; =e, and æ is the vegtor with components Tı, Ta,' ' ', Te 


OR yt = guh 
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where 
(69) giu = À (ans Ram se 


It is obvious from (69) that gi; is zero, if t is even, so that there are only | 
[4 (e + 1)] linearly independent quadratic forms gs, for a fixed value of 1. 
Similarly, if Fu is of type (4), Fin = (Ui ty X, and d'Fiut = hist, where 


; 4 
(70) hit = 2 (— 1) Dos Teint. 
= . 


On comparing (69) and (70) we see that his: is of the same form as 8 qu, if 
æ; is replaced by Ze. 
Since F is symmetric, the linearly independent quadratic forms corre- 
sponding to the matrix Fy; are those whose matrices are of the form 
0 Fy 
Fy 0 
e and d Fe the corresponding quadratic form is 


(71) en (p, TE) =F. 


Since, according to assumption, e = d, if Fi; is of type (iii), the linearly 
independent quadratic forms obtained from (71) are 


). If & =e and a= d and z and y are vectors of dimensions 


(12) g= È (1) tea (1,2, d). 
I£ Fi, is of type (iv) they are 

(13) hie > (—1)eestert (#2), 
If Fi; is of type (i) they aré 

(4) > ay Severs | (1,2, -:,d), 
and, if F4; is of type (ii), they 2 | 


(75) l Diye = È tunit e (t= 1,2,- +, d). 
Although at first sight it appears that there are four distinct types, there are 
really only two. If, in (73) and (74), we replace y; Éy Ya and in (73) and 
(75), zy by Zeus, the forms hij: becomes the same as gije while vase and Wijt 
both become : G 


de : 
; (76) Viji = DaTpYts1-p) (t =1, 2, *,4). 
| pat | 
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The 2n original variables z may be placed in sets of e1, €2,° * * , x 
- corresponding respectively to the symmetric matrices Fir, Fo, +, 
‘when Fi, is of type (iv), the e; variables associated with Fu are : 
in the reverse order, this relabelling is equivalent to the replacement 
Yau-s and of x; by Te, the replacement by which (76) was obtañ 
such replacements may be made simultaneously and, if this is done, th 
independent quadratic forms corresponding to the matrices F; 
i, j =1,2,---,& are all of two types; the forms gije of (72) and £ 
of (76). It is of course apparent that, if i = 7, the formula (72 
to (69). In conformity with our previous convention, the forms 
called negative, and the vis: positive. For convenience we shall der 
forms by fist, so that j 

(77) either fut = vist OT fije — Jijte 


Further, when there is no risk of confusion, we shall drop the suffixe 
and write.vz, ge and fi for vist, gije and fije respectively. 
We now define two other linear differential operators, X and H, 


x (78) X= 3 (— Horu a A 
A 41 6x5” 41 bys 
en 
Í t t 
(X + Hu =X D Tofi1-p + H D ypTtar-p 
pel 1 ` 
t t 
= X, (— 1) panaÿtan + eZ (— 1) pyoutt1-p 
pal pal 


t t-1 
= X (— 1) pra) + eZ (— 1) P(t — p) tpt 
pel - pao 


t 

, =t È (— 1) payt- | 

° = if e= fé 
= t $, (— 1)PtpYtar-p = tftn 

pr 


and therefore 
(79) (X + (—1)'H) vr = tye. 
Further, . 


t à j . 
(X + eH) g: = XD (— 1) tpytnip + e(— LH TE (— 1) tts» 
pel £ ri 
: n ; 
— x (— 1) 2P OD t41-p + e(— 1) t-t > (— 1) PH Nyy, 
wel e p= 
e 


t t 
= — Ñ Pl pYtup — D (t — p) TpaYta 
pi p=0 
° } if e= (— 


t 
=— t5 Xpy trip =e — rs 
g= 


(X + (— 1) OH) gi = — Win. 


4% BSA ; 
Since 24, = Vi = — 91, MT Fipe? Hy R 


(X—H) (2191) = 92, l 

(X — HE) (ag) = (X — E) g: = — Wa | 

(X— H)? (1141) = — 2 (X — Hu = — 3! g4, 
n general f 

(X — B) (Ty) = (— 1): (2s + 1) I geese, 

(X — E)” (a) = (— 1)” (28) vao: 
larly f 
(X+ H) (ay) = (—1)*(28 + 1) ana 
(X + E) (ay) = (— 1) (28) ! gass. 


It was remarked earlier that the forms fij: of (77) are of two types, 
ive and negative. However, for a fixed + and 7 and variable ¢ all fiye are 
e same type, the type being determined by the sign of Fiz. 

Since v+ is positive and g: is negative, we may write (81) and (82) more 
actly in the form 


(X — H)” (2,41) = afessa, (X + eH)? (241) = bfreu 


© —e is the sign of f: and a and b are numerical constants. If i= j, 
is the gis, which is defined by (69); and, on dropping the suffix 1, we 
, as the analogue of (83), 


x” (titi) = Ass m= afzaris 


‘e a is a numerical constant. . ' 

We shall now prove that all the quadratic forms f4;: are functions of those 

vhich j =i or j==t-+-1. In so doing we shall say that fis: is reducible, 
ig a function of quadratic forms fac, where either b— a< j—i or 

j, a —1and c<t. Clearly, fij = tY 154 f, is reducible, since it is a 

tion of 2,7 == fin and of y? = fins The reducibility of fis, will be ez- 

sed by writing : 

f in = 0. a 


s, if f= h, f — h ts reducible. E 

In showing that fis: is reducible, when j > 1+ 1, we shall use an in- 

ion proof. The proof consists of two essentially different parts, since the 

3 of even and odd values of t have to be treated gepårately. In fact, for 

t the restriction j > i -+ 1 may be replated by the weaker inequality j > i. 
e 


14 
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We first assume’ that fije, j > à is reducible for << 2m and. under this 
assumption prove that then fij em. is reducible.’ Since (2141) (12191) = 217417, 
(X + Ht)" (tags) (Gays) = (X + H) Mey. | 
Therefore by Leibnitz Theorem, | 


en) SP) xp dread} + anna) 
=F (7) cree. 


-IÉ r is even, X°(2:?) and H“)(¥,?) are both-reducible by (84). Moreover, 
if Fa has the sign —e, and r is even but different from 0 or 2m, 

(X + H)" (myi) and (X + H)?™ (syi) are also reducible by (83) and our 
induction assumption. Accordingly, with this value of e, (85) reduces to 
(86) 2) (X + M) (ag) =V — D, 

where . 
en v= g PP) (+ an) aiy) 
and | | - 
ee. pes (PP) pera) qu). 

‘ >. rodd \ T . 
On writing | 
X2(r,) = X* and H?(y:) = HP, 


‘we have in place of (87) 





) r 2m-r ret 
e) T= 5 (7) & (2) xcanre E (Mr), 
~ rodd \ T a=0 \G v= 
< © ` r PA 5 X2X°H eH 2e-r-b ath 
ABS) CT ear esa) Lene 


If U* is obtained from U in (87) by replacing « by — e, since r is odd, as a 
consequence of (83) and our induction assumption, U* is reducible. Hence, 


(90) U=U—U*. 


In (88) U is expressed fa terms of powers of e In the difference U — U* 
all even powers will disappear and each odd: power will occur with a factor 
two. Therefore, U — U* ig equal to twice the sum of those terms on the 
right of (89) for which @+ 6 is odd. Jn this summation therefore each 
term has the factor she For fixed values of a and b, the coefficients of 
2 (2m) 1eXeX?/alb! iè BHreH?™-#/(r—a)!(2m—r—b)!, where the 
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summation extends over all odd values of r, for which r = a, 2m—r=b. 
If a is odd, r— a-is even and, since a+ 6 is also odd, 2m — r —b is odd; 
while, if a is even, r—a is odd and 2m—r—b is even. Hence in Ÿ each 


term H?He/plaql, for which p+ q = 2m—a— b, occurs me once, and ` 


therefore 
SY ogm- (r— a)! (2m—r— b)! 
at 3 a pa ‘) BPH (2m —a — b)! 


prato 


== fH"? (y1y1)/ (2m — a — Ñ) l= pHa (y,2)/ (2m —a—b)!. 


Therefore, on changing the order of summation in the sum for U — U*; 
we have i 


U—U* = 2e(2m)! D (X°X/a!b Dan b)! 
atb 


a 


“Therefore, by (90), U=V; while-by (86), AEE T TE But, 


by (83), (X + H)” (ayy) = afitj emu and accordingly fij sms is reducible. 


. Incidentally, we have shown that fiy oma i8 a function of fis, fiz, and fus 


where s < 2m. 

We now show that, if 441 < 7j and uf fije is reducible for t = 2m +1, 
then f;j2m:2 ig reducible. Let the sign of Fs; be e and let z, be the variable 
associated with the integer k of Lemma 4.. Let 8 be the sign of Fi. Then, 
by Lemma 5, — & is the sign of Fay. If Z is the differential operator obtained 
from X in (78) by replacing x by z, as a consequence of Oa and our induction 
assumption we have the following results : : 


(X + E) (mg) = afina = 0, if £< m, > 5 
(91) (X—eH)* (ay) = bfijzn = 0, if t< m, 
(X + 82)?" (x121) == (X — 82)" (2121). 
= (H — èZ) (yz) = = (H + io == 0. 


Since (21y1) 213 = t£e) (4121), 
(X + H+ 82) (ay, a? = (X4 a yey (aye) (ym). > 
Therefore, by Leibnitz’ Theorem, if g==2m-+1;, | 
$ (2) x + meaa) 
| =$ (9 } (x + wlan) art merge). 








908 
On using. 
equation, 


where 


and 


If y 


- reducible 


. On expan 


Therefore 
Sine 


j=i+] 


` forms fij 
e 


(92) 


If e; == 2 
of forms 
The total 


64. Acco: 


` where e: 


Hen 


ER 


in the order 


18° "73 fiaz isa, Frs) feos,” Rag frac, ° us 


forms contains at least one variable which does nc 
essors. The minimal equation of HG“ is of degre: 
yer À can be expressed in terms of e, and the order 
ased to prove Theorem 5 we can now complete i 


Let H be the matrix of the Hamiltonian funct 
dynamical system with n degrees of freedom. I) 


imal equation of HG, the system has 2n — |; 
wt quadratic integrals. | 


° 
2n 
tegrals. If yT = >) yiz; = l is a linear integra 
41 


tces to 
y GHz =0 or HGy—0. 


j= 0, lis a linear integral of (6). Since G is nor 
2a linear integral of the system (6) exists if, 
‘integral, 1? is a quadratic integral, for 

(le?) GHz = 2y gHz = 0. 


early independent linear integrals is k, where ?n- 
#7 or H. We may express this by 


is The number of linearly independent linear 
if the elementary divisors of the form af ‘belongi 


ition of the previous section the linearly independ 


Vu; SR S 
only one integral for each value of i. If e, is eve 


fui = Dou, o= b + ee +: -P eia; 


s not new. See Aurel Wintner, “On,4he linear c 
Annali di M@tematica pure ed applicto, ser. 4, tomo 


| š 
is proved by Wintner. See reference **. 
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if e, is odd but W; = U; (so that ejn = 6; 


Vfin = ton and V fingja = Ern p= 


Since G is now in the normal form of g1 
matrix, and since Zon is the only linear : 
in G, Ton is in involution with all other | 
of tp. and z, Further, the integrals Tp 
‘ej==1. In this last case, e; — 1, they are 
the theorem 


THEOREM 6 bis. The number of lin 
volution consists of k — f members, where . 
divisors of the pencil H — zG which have 
`of those for which r—1. _ 


As a consequence of the &bove theorer 


COROLLARY 1. All linear integrals oj 
if, and only if, the pencil H—zxG has n 
` form r —0. | 


Further, if there exist f linearly.ind 

which are not in involution, the pencil Z 

mentary divisors of the form æ— 0, æ—1 

| Obviously linear independence of lines 
tional independence, 


| 7. In the particular case of the sm 
-Lagrangian libration point in the restrici 
matrix H of the Hamiltonian function is 


1 0 0 1 
0 1—1 0 

ae e 1/4 —z °F 
1 0 —z — 5/4 


This matrix can be written more convenien 
z =- : 


Ud 
jz + i 01 
01? e —1 07? 
™ Gyldén, Bull. Astr., vol. 1 (1884), pp. 361- 


Masse im problème restrèsat und über des prob 
tioner og mindre meddelelser fra Købehavns O 


a 
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A The characteristic equation of H Gi is 


feos) 
{ 


a ot + E afa) =o, 


and the roots of this equation are given by 


(94) 


g? 





ie VI EE 


If the radical in (94) is negative the four roots of (93) are all complex and 

distinct while, if the radical is positive, the four roots, while still distinct, are 
“all purely imaginary. When the radical is zero the roots of (94) are all 
_ imaginary but are equal in pairs. In this, the critical case, the general solution 
‘contains secular terms.’ Nevertheless, as far as quadratic integrals are con- 

‘cerned, this case is the same as the other two. For, in the critical case, the 
“characteristic equation of H Gris j is. (2? +4) =p. Since — 


(HG) — Gan a) 


+. —e—k 


the minimal equation of HG“ is not z? + 40. Accordingly; for all values 
of p, the minimal equation of HG is the same as its characteristic equation. 
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Therefore, by Theorem 1, there are exactly two independent quadratic integrals: 


the energy integral whose matrix is H and the integral whose matrix is 
(HG")?H. A simple calculation shows that 
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